Search Results for author: Nan Yang

Found 85 papers, 29 papers with code

Unified Language Model Pre-training for Natural Language Understanding and Generation

9 code implementations • NeurIPS 2019 • Li Dong, Nan Yang, Wenhui Wang, Furu Wei, Xiaodong Liu, Yu Wang, Jianfeng Gao, Ming Zhou, Hsiao-Wuen Hon

This paper presents a new Unified pre-trained Language Model (UniLM) that can be fine-tuned for both natural language understanding and generation tasks.

Ranked #2 on Generative Question Answering on CoQA (using extra training data)

Abstractive Text Summarization Document Summarization +7

18,284

Paper
Code

MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers

1 code implementation • NeurIPS 2020 • Wenhui Wang, Furu Wei, Li Dong, Hangbo Bao, Nan Yang, Ming Zhou

The small model (student) is trained by deeply mimicking the self-attention module, which plays a vital role in Transformer networks, of the large model (teacher).

Ranked #8 on Zero-shot Text Search on BEIR

Zero-shot Text Search

18,284

Paper
Code

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

3 code implementations • 28 Feb 2020 • Hangbo Bao, Li Dong, Furu Wei, Wenhui Wang, Nan Yang, Xiaodong Liu, Yu Wang, Songhao Piao, Jianfeng Gao, Ming Zhou, Hsiao-Wuen Hon

We propose to pre-train a unified language model for both autoencoding and partially autoregressive language modeling tasks using a novel training procedure, referred to as a pseudo-masked language model (PMLM).

Ranked #4 on Question Generation on SQuAD1.1 (using extra training data)

Abstractive Text Summarization Language Modelling +3

18,284

Paper
Code

Pseudo-Masked Language Models for Unified Language Model Pre-Training

1 code implementation • ICML 2020 • Hangbo Bao, Li Dong, Furu Wei, Wenhui Wang, Nan Yang, Xiaodong Liu, Yu Wang, Jianfeng Gao, Songhao Piao, Ming Zhou, Hsiao-Wuen Hon

Language Modelling Natural Language Understanding +1

18,284

Paper
Code

s2s-ft: Fine-Tuning Pretrained Transformer Encoders for Sequence-to-Sequence Learning

1 code implementation • 26 Oct 2021 • Hangbo Bao, Li Dong, Wenhui Wang, Nan Yang, Furu Wei

Pretrained bidirectional Transformers, such as BERT, have achieved significant improvements in a wide variety of language understanding tasks, while it is not straightforward to directly apply them for natural language generation.

Abstractive Text Summarization Question Generation +2

18,284

Paper
Code

SimLM: Pre-training with Representation Bottleneck for Dense Passage Retrieval

1 code implementation • 6 Jul 2022 • Liang Wang, Nan Yang, Xiaolong Huang, Binxing Jiao, Linjun Yang, Daxin Jiang, Rangan Majumder, Furu Wei

It employs a simple bottleneck architecture that learns to compress the passage information into a dense vector through self-supervised pre-training.

Language Modelling Passage Retrieval +1

18,284

Paper
Code

Text Embeddings by Weakly-Supervised Contrastive Pre-training

1 code implementation • 7 Dec 2022 • Liang Wang, Nan Yang, Xiaolong Huang, Binxing Jiao, Linjun Yang, Daxin Jiang, Rangan Majumder, Furu Wei

This paper presents E5, a family of state-of-the-art text embeddings that transfer well to a wide range of tasks.

Ranked #11 on Only Connect Walls Dataset Task 1 (Grouping) on OCW (using extra training data)

Only Connect Walls Dataset Task 1 (Grouping) Retrieval

18,284

Paper
Code

Multilingual E5 Text Embeddings: A Technical Report

1 code implementation • 8 Feb 2024 • Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei

This technical report presents the training methodology and evaluation results of the open-source multilingual E5 text embedding models, released in mid-2023.

18,284

Paper
Code

InfoXLM: An Information-Theoretic Framework for Cross-Lingual Language Model Pre-Training

4 code implementations • NAACL 2021 • Zewen Chi, Li Dong, Furu Wei, Nan Yang, Saksham Singhal, Wenhui Wang, Xia Song, Xian-Ling Mao, He-Yan Huang, Ming Zhou

In this work, we present an information-theoretic framework that formulates cross-lingual language model pre-training as maximizing mutual information between multilingual-multi-granularity texts.

Ranked #16 on Zero-Shot Cross-Lingual Transfer on XTREME

Contrastive Learning Cross-Lingual Transfer +2

18,283

Paper
Code

Inference with Reference: Lossless Acceleration of Large Language Models

1 code implementation • 10 Apr 2023 • Nan Yang, Tao Ge, Liang Wang, Binxing Jiao, Daxin Jiang, Linjun Yang, Rangan Majumder, Furu Wei

We propose LLMA, an LLM accelerator to losslessly speed up Large Language Model (LLM) inference with references.

Language Modelling Large Language Model

3,167

Paper
Code

Learning to Retrieve In-Context Examples for Large Language Models

2 code implementations • 14 Jul 2023 • Liang Wang, Nan Yang, Furu Wei

Our framework initially trains a reward model based on LLM feedback to evaluate the quality of candidate examples, followed by knowledge distillation to train a bi-encoder based dense retriever.

In-Context Learning Knowledge Distillation

3,167

Paper
Code

TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view Stereo

1 code implementation • 14 Nov 2021 • Lukas Koestler, Nan Yang, Niclas Zeller, Daniel Cremers

In this paper, we present TANDEM a real-time monocular tracking and dense mapping framework.

3D Reconstruction Monocular Visual Odometry +1

901

Paper
Code

Generative Representational Instruction Tuning

2 code implementations • 15 Feb 2024 • Niklas Muennighoff, Hongjin Su, Liang Wang, Nan Yang, Furu Wei, Tao Yu, Amanpreet Singh, Douwe Kiela

Notably, we find that GRIT matches training on only generative or embedding data, thus we can unify both at no performance loss.

Language Modelling Large Language Model +1

806

Paper
Code

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

1 code implementation • CVPR 2021 • Felix Wimbauer, Nan Yang, Lukas von Stumberg, Niclas Zeller, Daniel Cremers

Unlike other multi-view stereo methods, MonoRec is able to reconstruct both static and moving objects by leveraging the predicted masks.

570

Paper
Code

Fine-Tuning LLaMA for Multi-Stage Text Retrieval

1 code implementation • 12 Oct 2023 • Xueguang Ma, Liang Wang, Nan Yang, Furu Wei, Jimmy Lin

Our findings demonstrate that the effectiveness of large language models indeed surpasses that of smaller models.

Passage Retrieval Retrieval +1

384

Paper
Code

Behind the Scenes: Density Fields for Single View Reconstruction

2 code implementations • CVPR 2023 • Felix Wimbauer, Nan Yang, Christian Rupprecht, Daniel Cremers

By directly sampling color from the available views instead of storing color in the density field, our scene representation becomes significantly less complex compared to NeRFs, and a neural network can predict it in a single forward pass.

Depth Estimation Depth Prediction +1

227

Paper
Code

Neural Document Summarization by Jointly Learning to Score and Select Sentences

1 code implementation • ACL 2018 • Qingyu Zhou, Nan Yang, Furu Wei, Shaohan Huang, Ming Zhou, Tiejun Zhao

In this paper, we present a novel end-to-end neural network framework for extractive document summarization by jointly learning to score and select sentences.

Ranked #9 on Extractive Text Summarization on CNN / Daily Mail

Document Summarization Extractive Document Summarization +3

150

Paper
Code

Neural Question Generation from Text: A Preliminary Study

6 code implementations • 6 Apr 2017 • Qingyu Zhou, Nan Yang, Furu Wei, Chuanqi Tan, Hangbo Bao, Ming Zhou

Automatic question generation aims to generate questions from a text passage where the generated questions can be answered by certain sub-spans of the given passage.

Ranked #13 on Question Generation on SQuAD1.1

Position Question Generation +2

142

Paper
Code

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

1 code implementation • 19 Sep 2023 • Dawei Zhu, Nan Yang, Liang Wang, YiFan Song, Wenhao Wu, Furu Wei, Sujian Li

To decouple train length from target length for efficient context window extension, we propose Positional Skip-wisE (PoSE) training that smartly simulates long inputs using a fixed context window.

2k Position

139

Paper
Code

Improving Text Embeddings with Large Language Models

1 code implementation • 31 Dec 2023 • Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei

In this paper, we introduce a novel and simple method for obtaining high-quality text embeddings using only synthetic data and less than 1k training steps.

Paper
Code

Selective Encoding for Abstractive Sentence Summarization

2 code implementations • ACL 2017 • Qingyu Zhou, Nan Yang, Furu Wei, Ming Zhou

We propose a selective encoding model to extend the sequence-to-sequence framework for abstractive sentence summarization.

Ranked #8 on Text Summarization on DUC 2004 Task 1

Sentence Sentence Summarization

Paper
Code

Multiview Identifiers Enhanced Generative Retrieval

1 code implementation • 26 May 2023 • Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li

Instead of simply matching a query to pre-existing passages, generative retrieval generates identifier strings of passages as the retrieval target.

Retrieval

Paper
Code

Learning to Rank in Generative Retrieval

2 code implementations • 27 Jun 2023 • Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li

However, only learning to generate is insufficient for generative retrieval.

Learning-To-Rank Passage Ranking +3

Paper
Code

Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification

1 code implementation • ACL 2014 • Duyu Tang, Furu Wei, Nan Yang, Ming Zhou, Ting Liu, Bing Qin

Classification Feature Engineering +4

Paper
Code

Learning Diverse Document Representations with Deep Query Interactions for Dense Retrieval

1 code implementation • 8 Aug 2022 • Zehan Li, Nan Yang, Liang Wang, Furu Wei

In this paper, we propose a new dense retrieval model which learns diverse document representations with deep query interactions.

Retrieval

Paper
Code

Confidence Estimation Transformer for Long-term Renewable Energy Forecasting in Reinforcement Learning-based Power Grid Dispatching

1 code implementation • 10 Apr 2022 • Xinhang Li, Zihao Li, Nan Yang, Zheng Yuan, Qinwen Wang, Yiying Yang, Yupeng Huang, Xuri Song, Lei LI, Lin Zhang

The expansion of renewable energy could help realizing the goals of peaking carbon dioxide emissions and carbon neutralization.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

An Efficient Method for Detecting Asphalt Pavement Cracks and Sealed Cracks Based on a Deep Data-Driven Model

1 code implementation • journal 2022 • Nan Yang, Yongshang Li, Ronggui Ma

Second, we develop a dense and redundant crack annotation method based on the characteristics of the crack images.

object-detection Object Detection

Paper
Code

LongEmbed: Extending Embedding Models for Long Context Retrieval

1 code implementation • 18 Apr 2024 • Dawei Zhu, Liang Wang, Nan Yang, YiFan Song, Wenhao Wu, Furu Wei, Sujian Li

This paper explores context window extension of existing embedding models, pushing the limit to 32k without requiring additional training.

4k 8k +1

Paper
Code

Sequential Copying Networks

1 code implementation • 6 Jul 2018 • Qingyu Zhou, Nan Yang, Furu Wei, Ming Zhou

Copying mechanism shows effectiveness in sequence-to-sequence based neural network models for text generation tasks, such as abstractive sentence summarization and question generation.

Question Generation Question-Generation +3

Paper
Code

Challenges in Monocular Visual Odometry: Photometric Calibration, Motion Bias and Rolling Shutter Effect

no code implementations • 11 May 2017 • Nan Yang, Rui Wang, Xiang Gao, Daniel Cremers

Monocular visual odometry (VO) and simultaneous localization and mapping (SLAM) have seen tremendous improvements in accuracy, robustness and efficiency, and have gained increasing popularity over recent years.

Monocular Visual Odometry Simultaneous Localization and Mapping

Paper
Add Code

Relaxed Wasserstein with Applications to GANs

no code implementations • 19 May 2017 • Xin Guo, Johnny Hong, Tianyi Lin, Nan Yang

Wasserstein Generative Adversarial Networks (WGANs) provide a versatile class of models, which have attracted great attention in various applications.

Image Generation

Paper
Add Code

S-Net: From Answer Extraction to Answer Generation for Machine Reading Comprehension

no code implementations • 15 Jun 2017 • Chuanqi Tan, Furu Wei, Nan Yang, Bowen Du, Weifeng Lv, Ming Zhou

We build the answer extraction model with state-of-the-art neural networks for single passage reading comprehension, and propose an additional task of passage ranking to help answer extraction in multiple passages.

Answer Generation Machine Reading Comprehension +1

Paper
Add Code

Ambiguity set and learning via Bregman and Wasserstein

no code implementations • 23 May 2017 • Xin Guo, Johnny Hong, Nan Yang

Construction of ambiguity set in robust optimization relies on the choice of divergences between probability distributions.

BIG-bench Machine Learning

Paper
Add Code

Brains and pseudorandom generators

no code implementations • 26 Nov 2013 • Vašek Chvátal, Mark Goldsmith, Nan Yang

In a pioneering classic, Warren McCulloch and Walter Pitts proposed a model of the central nervous system; motivated by EEG recordings of normal brain activity, Chv\' atal and Goldsmith asked whether or not this model can be engineered to provide pseudorandom number generators.

EEG

Paper
Add Code

Jointly Modeling Topics and Intents with Global Order Structure

no code implementations • 7 Dec 2015 • Bei Chen, Jun Zhu, Nan Yang, Tian Tian, Ming Zhou, Bo Zhang

Modeling document structure is of great importance for discourse analysis and related applications.

Paper
Add Code

Radical-Enhanced Chinese Character Embedding

no code implementations • 18 Apr 2014 • Yaming Sun, Lei Lin, Duyu Tang, Nan Yang, Zhenzhou Ji, Xiaolong Wang

We present a method to leverage radical for learning Chinese character embedding.

Chinese Word Segmentation

Paper
Add Code

Deep Virtual Stereo Odometry: Leveraging Deep Depth Prediction for Monocular Direct Sparse Odometry

no code implementations • ECCV 2018 • Nan Yang, Rui Wang, Jörg Stückler, Daniel Cremers

To this end, we incorporate deep depth predictions into Direct Sparse Odometry (DSO) as direct virtual stereo measurements.

3D Reconstruction Depth Estimation +3

Paper
Add Code

Read + Verify: Machine Reading Comprehension with Unanswerable Questions

no code implementations • 17 Aug 2018 • Minghao Hu, Furu Wei, Yuxing Peng, Zhen Huang, Nan Yang, Dongsheng Li

Machine reading comprehension with unanswerable questions aims to abstain from answering when no answer can be inferred.

Ranked #11 on Question Answering on SQuAD2.0 dev

Machine Reading Comprehension Question Answering

Paper
Add Code

Attention-Guided Answer Distillation for Machine Reading Comprehension

no code implementations • EMNLP 2018 • Minghao Hu, Yuxing Peng, Furu Wei, Zhen Huang, Dongsheng Li, Nan Yang, Ming Zhou

Despite that current reading comprehension systems have achieved significant advancements, their promising performances are often obtained at the cost of making an ensemble of numerous models.

Knowledge Distillation Machine Reading Comprehension

Paper
Add Code

A Disease Diagnosis and Treatment Recommendation System Based on Big Data Mining and Cloud Computing

no code implementations • 17 Oct 2018 • Jianguo Chen, Kenli Li, Huigui Rong, Kashif Bilal, Nan Yang, Keqin Li

It is crucial to provide compatible treatment schemes for a disease according to various symptoms at different stages.

Cloud Computing Clustering

Paper
Add Code

Gated Self-Matching Networks for Reading Comprehension and Question Answering

no code implementations • ACL 2017 • Wenhui Wang, Nan Yang, Furu Wei, Baobao Chang, Ming Zhou

We first match the question and passage with gated attention-based recurrent networks to obtain the question-aware passage representation.

Ranked #35 on Question Answering on SQuAD1.1 dev

Question Answering Reading Comprehension

Paper
Add Code

Sequence-to-Dependency Neural Machine Translation

no code implementations • ACL 2017 • Shuangzhi Wu, Dong-dong Zhang, Nan Yang, Mu Li, Ming Zhou

Nowadays a typical Neural Machine Translation (NMT) model generates translations from left to right as a linear sequence, during which latent syntactic structures of the target sentences are not explicitly concerned.

Machine Translation NMT +1

Paper
Add Code

Improving Attention Modeling with Implicit Distortion and Fertility for Machine Translation

no code implementations • COLING 2016 • Shi Feng, Shujie Liu, Nan Yang, Mu Li, Ming Zhou, Kenny Q. Zhu

In neural machine translation, the attention mechanism facilitates the translation process by producing a soft alignment between the source sentence and the target sentence.

Machine Translation Sentence +1

Paper
Add Code

A Recursive Recurrent Neural Network for Statistical Machine Translation

no code implementations • ACL 2014 • Shujie Liu, Nan Yang, Mu Li, Ming Zhou

Chunking Language Modelling +7

Paper
Add Code

Word Alignment Modeling with Context Dependent Deep Neural Network

no code implementations • ACL 2013 • Nan Yang, Shujie Liu, Mu Li, Ming Zhou, Nenghai Yu

Speech Recognition Word Alignment

Paper
Add Code

Punctuation Prediction with Transition-based Parsing

no code implementations • ACL 2013 • Dong-dong Zhang, Shuangzhi Wu, Nan Yang, Mu Li

Language Modelling Machine Translation +3

Paper
Add Code

Easy-First POS Tagging and Dependency Parsing with Beam Search

no code implementations • ACL 2013 • Ji Ma, Jingbo Zhu, Tong Xiao, Nan Yang

Dependency Parsing POS +1

Paper
Add Code

A Ranking-based Approach to Word Reordering for Statistical Machine Translation

no code implementations • ACL 2012 • Nan Yang, Mu Li, Dong-dong Zhang, Nenghai Yu

Machine Translation Translation

Paper
Add Code

DirectShape: Direct Photometric Alignment of Shape Priors for Visual Vehicle Pose and Shape Estimation

no code implementations • 22 Apr 2019 • Rui Wang, Nan Yang, Joerg Stueckler, Daniel Cremers

Scene understanding from images is a challenging problem encountered in autonomous driving.

3D Object Detection Autonomous Driving +2

Paper
Add Code

Multi-Frame GAN: Image Enhancement for Stereo Visual Odometry in Low Light

no code implementations • 15 Oct 2019 • Eunah Jung, Nan Yang, Daniel Cremers

We propose the concept of a multi-frame GAN (MFGAN) and demonstrate its potential as an image sequence enhancement for stereo visual odometry in low light conditions.

Image Enhancement Optical Flow Estimation +2

Paper
Add Code

Inspecting Unification of Encoding and Matching with Transformer: A Case Study of Machine Reading Comprehension

no code implementations • WS 2019 • Hangbo Bao, Li Dong, Furu Wei, Wenhui Wang, Nan Yang, Lei Cui, Songhao Piao, Ming Zhou

Most machine reading comprehension (MRC) models separately handle encoding and matching with different network architectures.

Machine Reading Comprehension

Paper
Add Code

D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual Odometry

no code implementations • CVPR 2020 • Nan Yang, Lukas von Stumberg, Rui Wang, Daniel Cremers

We propose D3VO as a novel framework for monocular visual odometry that exploits deep networks on three levels -- deep depth, pose and uncertainty estimation.

Monocular Depth Estimation Monocular Visual Odometry

Paper
Add Code

4Seasons: A Cross-Season Dataset for Multi-Weather SLAM in Autonomous Driving

no code implementations • 14 Sep 2020 • Patrick Wenzel, Rui Wang, Nan Yang, Qing Cheng, Qadeer Khan, Lukas von Stumberg, Niclas Zeller, Daniel Cremers

We present a novel dataset covering seasonal and challenging perceptual conditions for autonomous driving.

Autonomous Driving Visual Odometry

Paper
Add Code

LM-Reloc: Levenberg-Marquardt Based Direct Visual Relocalization

no code implementations • 13 Oct 2020 • Lukas von Stumberg, Patrick Wenzel, Nan Yang, Daniel Cremers

The learned features significantly improve the robustness of direct image alignment, especially for relocalization across different conditions.

Pose Estimation

Paper
Add Code

Coverage Analysis for 3D Terahertz Communication Systems with Blockage and Directional Antennas

no code implementations • 16 Apr 2020 • Akram Shafie, Nan Yang, Zhuo Sun, Salman Durrani

We further show that the coverage performance improvement brought by increasing the antenna directivity at APs is higher than that brought by increasing the antenna directivity at UEs.

Paper
Add Code

Expected Density of Cooperative Bacteria in a 2D Quorum Sensing Based Molecular Communication System

no code implementations • 1 Dec 2018 • Yuting Fang, Adam Noel, Andrew W. Eckford, Nan Yang

The number of molecules observed at each randomly-distributed bacterium is first derived by characterizing the diffusion and degradation of signaling molecules within the population.

Paper
Add Code

Hybrid Beamforming for Terahertz Multi-Carrier Systems over Frequency Selective Fading

no code implementations • 14 Oct 2019 • Hang Yuan, Nan Yang, Kai Yang, Chong Han, Jianping An

We consider a three-dimensional wideband THz channel by incorporating the joint effect of molecular absorption, high sparsity, and multi-path fading, and consider the carrier frequency offset in multi-carrier systems.

Paper
Add Code

Directional Modulation-Enabled Secure Transmission with Intelligent Reflecting Surface

no code implementations • 7 Jul 2020 • Liangling Lai, Jinsong Hu, Youjia Chen, Haifeng Zheng, Nan Yang

We propose a new secure transmission scheme which uses directional modulation (DM) with artificial noise and is aided by the intelligent reflecting surface (IRS).

Position

Paper
Add Code

Coverage Analysis for 3D Terahertz Communication Systems

no code implementations • 20 Apr 2021 • Akram Shafie, Nan Yang, Salman Durrani, Xiangyun Zhou, Chong Han, Markku Juntti

We conduct novel coverage probability analysis of downlink transmission in a three-dimensional (3D) terahertz (THz) communication (THzCom) system.

Paper
Add Code

SINGA-Easy: An Easy-to-Use Framework for MultiModal Analysis

no code implementations • 3 Aug 2021 • Naili Xing, Sai Ho Yeung, ChengHao Cai, Teck Khim Ng, Wei Wang, Kaiyuan Yang, Nan Yang, Meihui Zhang, Gang Chen, Beng Chin Ooi

Specifically, in terms of usability, it is demanding for non-experts to implement deep learning models, obtain the right settings for the entire machine learning pipeline, manage models and datasets, and exploit external data sources all together.

Image Classification

Paper
Add Code

xMoCo: Cross Momentum Contrastive Learning for Open-Domain Question Answering

no code implementations • ACL 2021 • Nan Yang, Furu Wei, Binxing Jiao, Daxing Jiang, Linjun Yang

Dense passage retrieval has been shown to be an effective approach for information retrieval tasks such as open domain question answering.

Contrastive Learning Open-Domain Question Answering +2

Paper
Add Code

Spectrum Allocation with Adaptive Sub-band Bandwidth for Terahertz Communication Systems

no code implementations • 10 Nov 2021 • Akram Shafie, Nan Yang, Sheeraz Alvi, Chong Han, Salman Durrani, Josep M. Jornet

Aided by numerical results, we show that by enabling and optimizing ASB, significantly higher throughput can be achieved as compared to adopting equal sub-band bandwidth, and this throughput gain is most profound when the power budget constraint is more stringent.

Paper
Add Code

FIRe: Fast Inverse Rendering using Directional and Signed Distance Functions

no code implementations • 30 Mar 2022 • Tarun Yenamandra, Ayush Tewari, Nan Yang, Florian Bernard, Christian Theobalt, Daniel Cremers

To this end, we learn a signed distance function (SDF) along with our DDF model to represent a class of shapes.

3D Reconstruction Inverse Rendering

Paper
Add Code

Novel Spectrum Allocation Among Multiple Transmission Windows for Terahertz Communication Systems

no code implementations • 6 Jul 2022 • Akram Shafie, Nan Yang, Chong Han, Josep M. Jornet

We also show that a further data rate gain can be obtained by optimally determining the unused spectra at the edges of TWs, as compared to avoiding using pre-defined spectra at the edges of TWs.

Paper
Add Code

Terahertz Communications for 6G and Beyond Wireless Networks: Challenges, Key Advancements, and Opportunities

no code implementations • 22 Jul 2022 • Akram Shafie, Nan Yang, Chong Han, Josep Miquel Jornet, Markku Juntti, Thomas Kurner

The unprecedented increase in wireless data traffic, predicted to occur within the next decade, is motivating academia and industries to look beyond contemporary wireless standards and conceptualize the sixth-generation (6G) wireless networks.

Management

Paper
Add Code

An Unsupervised Learning Approach for Spectrum Allocation in Terahertz Communication Systems

no code implementations • 7 Aug 2022 • Akram Shafie, Chunhui Li, Nan Yang, Xiangyun Zhou, Trung Q. Duong

Numerical results demonstrate that comparing to existing approaches, our proposed unsupervised learning-based approach achieves a higher data rate, especially when the molecular absorption coefficient within the spectrum of interest varies in a highly non-linear manner.

Paper
Add Code

CCR: Facial Image Editing with Continuity, Consistency and Reversibility

no code implementations • 22 Sep 2022 • Nan Yang, Xin Luan, Huidi Jia, Zhi Han, Yandong Tang

In this work, we put forward three concepts and corresponding definitions: editing continuity, consistency, and reversibility.

Attribute

Paper
Add Code

Adversarial Transformer for Repairing Human Airway Segmentation

no code implementations • 21 Oct 2022 • Zeyu Tang, Nan Yang, Simon Walsh, Guang Yang

Discontinuity in the delineation of peripheral bronchioles hinders the potential clinical application of automated airway segmentation models.

Segmentation

Paper
Add Code

Terahertz Communications for Massive Connectivity and Security in 6G and Beyond Era

no code implementations • 25 Oct 2022 • Nan Yang, Akram Shafie

Terahertz (THz) communications (THzCom) has experienced a meteoric rise of interest, due to its benefits for ultra-high data rate transmission in the sixth generation (6G) and beyond era.

Management

Paper
Add Code

Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset

no code implementations • 2 Nov 2022 • Haolin Deng, Yanan Zhang, Yangfan Zhang, Wangyang Ying, Changlong Yu, Jun Gao, Wei Wang, Xiaoling Bai, Nan Yang, Jin Ma, Xiang Chen, Tianhua Zhou

To the best of our knowledge, it is currently the largest manually-annotated Chinese dataset for open event extraction.

Benchmarking Event Extraction +2

Paper
Add Code

4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions

no code implementations • 31 Dec 2022 • Patrick Wenzel, Nan Yang, Rui Wang, Niclas Zeller, Daniel Cremers

In this paper, we present a novel visual SLAM and long-term localization benchmark for autonomous driving in challenging conditions based on the large-scale 4Seasons dataset.

Autonomous Driving Benchmarking +2

Paper
Add Code

Random Padding Data Augmentation

no code implementations • 17 Feb 2023 • Nan Yang, Laicheng Zhong, Fan Huang, Dong Yuan, Wei Bao

Random Padding is parameter-free, simple to construct, and compatible with the majority of CNN-based recognition models.

Data Augmentation Image Classification +1

Paper
Add Code

FedIL: Federated Incremental Learning from Decentralized Unlabeled Data with Convergence Analysis

no code implementations • 23 Feb 2023 • Nan Yang, Dong Yuan, Charles Z Liu, Yongkun Deng, Wei Bao

Most existing federated learning methods assume that clients have fully labeled data to train on, while in reality, it is hard for the clients to get task-specific labels due to users' privacy concerns, high labeling costs, or lack of expertise.

Federated Learning Incremental Learning +1

Paper
Add Code

Real-time scheduling of renewable power systems through planning-based reinforcement learning

no code implementations • 9 Mar 2023 • Shaohuai Liu, Jinbo Liu, Weirui Ye, Nan Yang, Guanglun Zhang, Haiwang Zhong, Chongqing Kang, Qirong Jiang, Xuri Song, Fangchun Di, Yang Gao

The well-trained scheduling agent significantly reduces renewable curtailment and load shedding, which are issues arising from traditional scheduling's reliance on inaccurate day-ahead forecasts.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Query2doc: Query Expansion with Large Language Models

no code implementations • 14 Mar 2023 • Liang Wang, Nan Yang, Furu Wei

This paper introduces a simple yet effective query expansion approach, denoted as query2doc, to improve both sparse and dense retrieval systems.

Memorization Retrieval

Paper
Add Code

FedMAE: Federated Self-Supervised Learning with One-Block Masked Auto-Encoder

no code implementations • 20 Mar 2023 • Nan Yang, Xuanyu Chen, Charles Z. Liu, Dong Yuan, Wei Bao, Lizhen Cui

Latest federated learning (FL) methods started to focus on how to use unlabeled data in clients for training due to users' privacy concerns, high labeling costs, or lack of expertise.

Federated Learning Image Reconstruction +1

Paper
Add Code

Combining Adversaries with Anti-adversaries in Training

no code implementations • 25 Apr 2023 • Xiaoling Zhou, Nan Yang, Ou wu

On the basis of our theoretical findings, a more general learning objective that combines adversaries and anti-adversaries with varied bounds on each training sample is presented.

Fairness Meta-Learning

Paper
Add Code

UAV-assisted IoT Monitoring Network: Adaptive Multiuser Access for Low-Latency and High-Reliability Under Bursty Traffic

no code implementations • 25 Apr 2023 • Nilupuli Senadhira, Salman Durrani, Sheeraz A. Alvi, Nan Yang, Xiangyun Zhou

In this work, we propose an adaptive system design for an Internet of Things (IoT) monitoring network with latency and reliability requirements, where IoT devices generate time-critical and event-triggered bursty traffic, and an unmanned aerial vehicle (UAV) aggregates and relays sensed data to the base station.

Paper
Add Code

Incremental Dense Reconstruction from Monocular Video with Guided Sparse Feature Volume Fusion

no code implementations • 24 May 2023 • Xingxing Zuo, Nan Yang, Nathaniel Merrill, Binbin Xu, Stefan Leutenegger

Incrementally recovering 3D dense structures from monocular videos is of paramount importance since it enables various robotics and AR applications.

Paper
Add Code

Project Aria: A New Tool for Egocentric Multi-Modal AI Research

no code implementations • 24 Aug 2023 • Jakob Engel, Kiran Somasundaram, Michael Goesele, Albert Sun, Alexander Gamino, Andrew Turner, Arjang Talattof, Arnie Yuan, Bilal Souti, Brighid Meredith, Cheng Peng, Chris Sweeney, Cole Wilson, Dan Barnes, Daniel DeTone, David Caruso, Derek Valleroy, Dinesh Ginjupalli, Duncan Frost, Edward Miller, Elias Mueggler, Evgeniy Oleinik, Fan Zhang, Guruprasad Somasundaram, Gustavo Solaira, Harry Lanaras, Henry Howard-Jenkins, Huixuan Tang, Hyo Jin Kim, Jaime Rivera, Ji Luo, Jing Dong, Julian Straub, Kevin Bailey, Kevin Eckenhoff, Lingni Ma, Luis Pesqueira, Mark Schwesinger, Maurizio Monge, Nan Yang, Nick Charron, Nikhil Raina, Omkar Parkhi, Peter Borschowa, Pierre Moulon, Prince Gupta, Raul Mur-Artal, Robbie Pennington, Sachin Kulkarni, Sagar Miglani, Santosh Gondi, Saransh Solanki, Sean Diener, Shangyi Cheng, Simon Green, Steve Saarinen, Suvam Patra, Tassos Mourikis, Thomas Whelan, Tripti Singh, Vasileios Balntas, Vijay Baiyya, Wilson Dreewes, Xiaqing Pan, Yang Lou, Yipu Zhao, Yusuf Mansour, Yuyang Zou, Zhaoyang Lv, Zijian Wang, Mingfei Yan, Carl Ren, Renzo De Nardi, Richard Newcombe

Egocentric, multi-modal data as available on future augmented reality (AR) devices provides unique challenges and opportunities for machine perception.

Paper
Add Code

Large Search Model: Redefining Search Stack in the Era of LLMs

no code implementations • 23 Oct 2023 • Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei

Modern search engines are built on a stack of different components, including query understanding, retrieval, multi-stage ranking, and question answering, among others.

Language Modelling Large Language Model +3

Paper
Add Code

Enhancing Traffic Object Detection in Variable Illumination with RGB-Event Fusion

no code implementations • 1 Nov 2023 • Zhanwen Liu, Nan Yang, Yang Wang, Yuke Li, Xiangmo Zhao, Fei-Yue Wang

To address this issue, we introduce bio-inspired event cameras and propose a novel Structure-aware Fusion Network (SFNet) that extracts sharp and complete object structures from the event stream to compensate for the lost information in images through cross-modality fusion, enabling the network to obtain illumination-robust representations for traffic object detection.

Object object-detection +2

Paper
Add Code

Time-Frequency Localization Characteristics of the Delay-Doppler Plane Orthogonal Pulse

no code implementations • 13 Nov 2023 • Akram Shafie, Jinhong Yuan, Nan Yang, Hai Lin

Furthermore, we determine the TFA for the recently proposed generalized design of the DDOP.

Paper
Add Code

Event-driven Real-time Retrieval in Web Search

no code implementations • 1 Dec 2023 • Nan Yang, Shusen Zhang, Yannan Zhang, Xiaoling Bai, Hualong Deng, Tianhua Zhou, Jin Ma

The Event information is then integrated with the query through a cross-attention mechanism, resulting in a time-context query representation.

Information Retrieval Retrieval

Paper
Add Code

Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation framework

no code implementations • 17 Mar 2024 • Kaiyan Chang, Kun Wang, Nan Yang, Ying Wang, Dantong Jin, Wenlong Zhu, Zhirong Chen, Cangyuan Li, Hao Yan, Yunhao Zhou, Zhuoliang Zhao, Yuan Cheng, Yudong Pan, Yiqi Liu, Mengdi Wang, Shengwen Liang, Yinhe Han, Huawei Li, Xiaowei Li

Our 13B model (ChipGPT-FT) has a pass rate improvement compared with GPT-3. 5 in Verilog generation and outperforms in EDA script (i. e., SiliconCompiler) generation with only 200 EDA script data.

Data Augmentation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.