Search Results for author: Yan Gao

Found 66 papers, 29 papers with code

``What Do You Mean by That?'' A Parser-Independent Interactive Approach for Enhancing Text-to-SQL

no code implementations • EMNLP 2020 • Yuntao Li, Bei Chen, Qian Liu, Yan Gao, Jian-Guang Lou, Yan Zhang, Dongmei Zhang

In Natural Language Interfaces to Databases systems, the text-to-SQL technique allows users to query databases by using natural language questions.

Text-To-SQL

Paper
Add Code

Translating Headers of Tabular Data: A Pilot Study of Schema Translation

1 code implementation • EMNLP 2021 • Kunrui Zhu, Yan Gao, Jiaqi Guo, Jian-Guang Lou

Experiments on our dataset demonstrate that CAST significantly outperforms state-of-the-art neural machine translation models.

Machine Translation Translation

360

Paper
Code

Improving Relevance Quality in Product Search using High-Precision Query-Product Semantic Similarity

no code implementations • ECNLP (ACL) 2022 • Alireza Bagheri Garakani, Fan Yang, Wen-Yu Hua, Yetian Chen, Michinari Momma, Jingyuan Deng, Yan Gao, Yi Sun

Ensuring relevance quality in product search is a critical task as it impacts the customer’s ability to find intended products in the short-term as well as the general perception and trust of the e-commerce system in the long term.

Re-Ranking Semantic Similarity +1

Paper
Add Code

Spelling Correction using Phonetics in E-commerce Search

no code implementations • ECNLP (ACL) 2022 • Fan Yang, Alireza Bagheri Garakani, Yifei Teng, Yan Gao, Jia Liu, Jingyuan Deng, Yi Sun

In E-commerce search, spelling correction plays an important role to find desired products for customers in processing user-typed search queries.

Spelling Correction

Paper
Add Code

From Image to Video, what do we need in multimodal LLMs?

no code implementations • 18 Apr 2024 • Suyuan Huang, Haoxin Zhang, Yan Gao, Yao Hu, Zengchang Qin

Multimodal Large Language Models (MLLMs) have demonstrated profound capabilities in understanding multimodal information, covering from Image LLMs to the more complex Video LLMs.

Video Understanding

Paper
Add Code

AgentGroupChat: An Interactive Group Chat Simulacra For Better Eliciting Emergent Behavior

1 code implementation • 20 Mar 2024 • Zhouhong Gu, Xiaoxuan Zhu, Haoran Guo, Lin Zhang, Yin Cai, Hao Shen, Jiangjie Chen, Zheyu Ye, Yifei Dai, Yan Gao, Yao Hu, Hongwei Feng, Yanghua Xiao

Language significantly influences the formation and evolution of Human emergent behavior, which is crucial in understanding collective intelligence within human societies.

Paper
Code

Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents

no code implementations • 8 Mar 2024 • Jinyang Li, Nan Huo, Yan Gao, Jiayi Shi, Yingxiu Zhao, Ge Qu, Yurong Wu, Chenhao Ma, Jian-Guang Lou, Reynold Cheng

The challenges and costs of collecting realistic interactive logs for data analysis hinder the quantitative evaluation of Large Language Model (LLM) agents in this task.

Benchmarking Decision Making +2

Paper
Add Code

NoteLLM: A Retrievable Large Language Model for Note Recommendation

no code implementations • 4 Mar 2024 • Chao Zhang, Shiwei Wu, Haoxin Zhang, Tong Xu, Yan Gao, Yao Hu, Di wu, Enhong Chen

Indeed, learning to generate hashtags/categories can potentially enhance note embeddings, both of which compress key note information into limited content.

Contrastive Learning Language Modelling +1

Paper
Add Code

NL2Formula: Generating Spreadsheet Formulas from Natural Language Queries

no code implementations • 20 Feb 2024 • Wei Zhao, Zhitao Hou, Siyuan Wu, Yan Gao, Haoyu Dong, Yao Wan, Hongyu Zhang, Yulei Sui, Haidong Zhang

Writing formulas on spreadsheets, such as Microsoft Excel and Google Sheets, is a widespread practice among users performing data analysis.

Natural Language Queries

Paper
Add Code

FedAnchor: Enhancing Federated Semi-Supervised Learning with Label Contrastive Loss for Unlabeled Clients

no code implementations • 15 Feb 2024 • Xinchi Qiu, Yan Gao, Lorenzo Sani, Heng Pan, Wanru Zhao, Pedro P. B. Gusmao, Mina Alibeigi, Alex Iacob, Nicholas D. Lane

Federated learning (FL) is a distributed learning paradigm that facilitates collaborative training of a shared global model across devices while keeping data localized.

Federated Learning

Paper
Add Code

Text2Analysis: A Benchmark of Table Question Answering with Advanced Data Analysis and Unclear Queries

no code implementations • 21 Dec 2023 • Xinyi He, Mengyu Zhou, Xinrun Xu, Xiaojun Ma, Rui Ding, Lun Du, Yan Gao, Ran Jia, Xu Chen, Shi Han, Zejian yuan, Dongmei Zhang

We evaluate five state-of-the-art models using three different metrics and the results show that our benchmark presents introduces considerable challenge in the field of tabular data analysis, paving the way for more advanced research opportunities.

Question Answering

Paper
Add Code

Multi-Scene Generalized Trajectory Global Graph Solver with Composite Nodes for Multiple Object Tracking

no code implementations • 14 Dec 2023 • Yan Gao, Haojun Xu, Nannan Wang, Jie Li, Xinbo Gao

In addition to the previous method of treating objects as nodes, the network innovatively treats object trajectories as nodes for information interaction, improving the graph neural network's feature representation capability.

Multi-Object Tracking Multiple Object Tracking +1

Paper
Add Code

Towards the Law of Capacity Gap in Distilling Language Models

1 code implementation • 13 Nov 2023 • Chen Zhang, Dawei Song, Zheyu Ye, Yan Gao

The pain is mainly resulted by the curse of capacity gap, which describes that a larger teacher LM cannot always lead to a better student LM than one distilled from a smaller teacher LM due to the affect of capacity gap increment.

Language Modelling

Paper
Code

Bandit Learning to Rank with Position-Based Click Models: Personalized and Equal Treatments

no code implementations • 8 Nov 2023 • Tianchen Zhou, Jia Liu, Yang Jiao, Chaosheng Dong, Yetian Chen, Yan Gao, Yi Sun

Online learning to rank (ONL2R) is a foundational problem for recommender systems and has received increasing attention in recent years.

Learning-To-Rank Position +1

Paper
Add Code

L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated Self-Supervised Visual Representation Learning

1 code implementation • ICCV 2023 • Yasar Abbas Ur Rehman, Yan Gao, Pedro Porto Buarque de Gusmão, Mina Alibeigi, Jiajun Shen, Nicholas D. Lane

The ubiquity of camera-enabled devices has led to large amounts of unlabeled image data being produced at the edge.

Federated Learning Representation Learning +1

Paper
Code

Piecing Together Clues: A Benchmark for Evaluating the Detective Skills of Large Language Models

no code implementations • 11 Jul 2023 • Zhouhong Gu, Lin Zhang, Jiangjie Chen, Haoning Ye, Xiaoxuan Zhu, Zihan Li, Zheyu Ye, Yan Gao, Yao Hu, Yanghua Xiao, Hongwei Feng

We introduces the DetectBench, a reading comprehension dataset designed to assess a model's ability to jointly ability in key information detection and multi-hop reasoning when facing complex and implicit information.

Common Sense Reasoning Decision Making +2

Paper
Add Code

Improving the Transferability of Time Series Forecasting with Decomposition Adaptation

no code implementations • 30 Jun 2023 • Yan Gao, Yan Wang, Qiang Wang

However, in time series forecasting, it is difficult to obtain enough data, which limits the performance of neural forecasting models.

Multivariate Time Series Forecasting Time Series +1

Paper
Add Code

AdaSelection: Accelerating Deep Learning Training through Data Subsampling

no code implementations • 19 Jun 2023 • Minghe Zhang, Chaosheng Dong, Jinmiao Fu, Tianchen Zhou, Jia Liang, Jia Liu, Bo Liu, Michinari Momma, Bryan Wang, Yan Gao, Yi Sun

In this paper, we introduce AdaSelection, an adaptive sub-sampling method to identify the most informative sub-samples within each minibatch to speed up the training of large-scale deep learning models without sacrificing model performance.

Paper
Add Code

Secure Vertical Federated Learning Under Unreliable Connectivity

no code implementations • 26 May 2023 • Xinchi Qiu, Heng Pan, Wanru Zhao, Yan Gao, Pedro P. B. Gusmao, William F. Shen, Chenyang Ma, Nicholas D. Lane

Most work in privacy-preserving federated learning (FL) has focused on horizontally partitioned datasets where clients hold the same features and train complete client-level models independently.

Privacy Preserving Vertical Federated Learning

Paper
Add Code

Uncovering and Categorizing Social Biases in Text-to-SQL

1 code implementation • 25 May 2023 • Yan Liu, Yan Gao, Zhe Su, Xiaokang Chen, Elliott Ash, Jian-Guang Lou

In this work, we aim to uncover and categorize social biases in Text-to-SQL models.

Text-To-SQL

Paper
Code

TACR: A Table-alignment-based Cell-selection and Reasoning Model for Hybrid Question-Answering

no code implementations • 24 May 2023 • Jian Wu, Yicheng Xu, Yan Gao, Jian-Guang Lou, Börje F. Karlsson, Manabu Okumura

A common challenge in HQA and other passage-table QA datasets is that it is generally unrealistic to iterate over all table rows, columns, and linked passages to retrieve evidence.

Question Answering Retrieval

Paper
Add Code

Language Knowledge-Assisted Representation Learning for Skeleton-Based Action Recognition

1 code implementation • 21 May 2023 • Haojun Xu, Yan Gao, Zheng Hui, Jie Li, Xinbo Gao

Also, humans have brain regions dedicated to understanding the minds of others and analyzing their intentions, such as the medial prefrontal cortex of the temporal lobe.

Ranked #2 on Skeleton Based Action Recognition on NTU RGB+D 120 (using extra training data)

Action Recognition GPR +2

Paper
Code

MVP-SEG: Multi-View Prompt Learning for Open-Vocabulary Semantic Segmentation

no code implementations • 14 Apr 2023 • Jie Guo, Qimeng Wang, Yan Gao, XiaoLong Jiang, Xu Tang, Yao Hu, Baochang Zhang

CLIP (Contrastive Language-Image Pretraining) is well-developed for open-vocabulary zero-shot image-level recognition, while its applications in pixel-level tasks are less investigated, where most efforts directly adopt CLIP features without deliberative adaptations.

GPR Open Vocabulary Semantic Segmentation +3

Paper
Add Code

Multi-view reconstruction of bullet time effect based on improved NSFF model

no code implementations • 1 Apr 2023 • Linquan Yu, Yan Gao, Yangtian Yan, Wentao Zeng

By using the optical flow prediction information to suppress the dynamic network timely, the network is forced to improve the reconstruction effect of dynamic and static networks independently, and the ability to understand and reconstruct dynamic and static scenes is improved.

Neural Rendering Optical Flow Estimation

Paper
Add Code

Adaptive Approximate Implicitization of Planar Parametric Curves via Weak Gradient Constraints

no code implementations • 23 Feb 2023 • Minghao Guo, Yan Gao, Zheng Pan

Converting a parametric curve into the implicit form, which is called implicitization, has always been a popular but challenging problem in geometric modeling and related applications.

Paper
Add Code

OvarNet: Towards Open-vocabulary Object Attribute Recognition

1 code implementation • CVPR 2023 • Keyan Chen, XiaoLong Jiang, Yao Hu, Xu Tang, Yan Gao, Jianqi Chen, Weidi Xie

In this paper, we consider the problem of simultaneously detecting objects and inferring their visual attributes in an image, even for those with no manual annotations provided at the training stage, resembling an open-vocabulary scenario.

Ranked #1 on Open Vocabulary Attribute Detection on OVAD benchmark (using extra training data)

Attribute Knowledge Distillation +5

Paper
Code

Towards Knowledge-Intensive Text-to-SQL Semantic Parsing with Formulaic Knowledge

1 code implementation • 3 Jan 2023 • Longxu Dou, Yan Gao, Xuqi Liu, Mingyang Pan, Dingzirui Wang, Wanxiang Che, Dechen Zhan, Min-Yen Kan, Jian-Guang Lou

In this paper, we study the problem of knowledge-intensive text-to-SQL, in which domain knowledge is necessary to parse expert questions into SQL queries over domain-specific tables.

Semantic Parsing Text-To-SQL

360

Paper
Code

Learning Attribute and Class-Specific Representation Duet for Fine-Grained Fashion Analysis

no code implementations • CVPR 2023 • Yang Jiao, Yan Gao, Jingjing Meng, Jin Shang, Yi Sun

Fashion representation learning involves the analysis and understanding of various visual elements at different granularities and the interactions among them.

Attribute Inductive Bias +2

Paper
Add Code

MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing

1 code implementation • 27 Dec 2022 • Longxu Dou, Yan Gao, Mingyang Pan, Dingzirui Wang, Wanxiang Che, Dechen Zhan, Jian-Guang Lou

Text-to-SQL semantic parsing is an important NLP task, which greatly facilitates the interaction between users and the database and becomes the key component in many human-computer interaction systems.

Benchmarking Semantic Parsing +1

360

Paper
Code

Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation

1 code implementation • ACL 2022 • Xinyu Pi, Bing Wang, Yan Gao, Jiaqi Guo, Zhoujun Li, Jian-Guang Lou

The robustness of Text-to-SQL parsers against adversarial perturbations plays a crucial role in delivering highly reliable applications.

Text-To-SQL

360

Paper
Code

Know What I don't Know: Handling Ambiguous and Unanswerable Questions for Text-to-SQL

1 code implementation • 17 Dec 2022 • Bing Wang, Yan Gao, Zhoujun Li, Jian-Guang Lou

Following this study, we propose a simple yet effective counterfactual example generation approach that automatically produces ambiguous and unanswerable text-to-SQL examples.

counterfactual Text-To-SQL

Paper
Code

Federated Learning for Inference at Anytime and Anywhere

no code implementations • 8 Dec 2022 • Zicheng Liu, Da Li, Javier Fernandez-Marques, Stefanos Laskaridis, Yan Gao, Łukasz Dudziak, Stan Z. Li, Shell Xu Hu, Timothy Hospedales

Federated learning has been predominantly concerned with collaborative training of deep networks from scratch, and especially the many challenges that arise, such as communication cost, robustness to heterogeneous data, and support for diverse device capabilities.

Federated Learning

Paper
Add Code

Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report

2 code implementations • 7 Nov 2022 • Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Ganzorig Gankhuyag, Jingang Huh, Myeong Kyun Kim, Kihwan Yoon, Hyeon-Cheol Moon, Seungho Lee, Yoonsik Choe, Jinwoo Jeong, Sungjei Kim, Maciej Smyl, Tomasz Latkowski, Pawel Kubik, Michal Sokolski, Yujie Ma, Jiahao Chao, Zhou Zhou, Hongfan Gao, Zhengfeng Yang, Zhenbing Zeng, Zhengyang Zhuge, Chenghua Li, Dan Zhu, Mengdi Sun, Ran Duan, Yan Gao, Lingshun Kong, Long Sun, Xiang Li, Xingdong Zhang, Jiawei Zhang, Yaqi Wu, Jinshan Pan, Gaocheng Yu, Jin Zhang, Feng Zhang, Zhe Ma, Hongbin Wang, Hojin Cho, Steve Kim, Huaen Li, Yanbo Ma, Ziwei Luo, Youwei Li, Lei Yu, Zhihong Wen, Qi Wu, Haoqiang Fan, Shuaicheng Liu, Lize Zhang, Zhikai Zong, Jeremy Kwon, Junxi Zhang, Mengyuan Li, Nianxiang Fu, Guanchen Ding, Han Zhu, Zhenzhong Chen, Gen Li, Yuanfan Zhang, Lei Sun, Dafeng Zhang, Neo Yang, Fitz Liu, Jerry Zhao, Mustafa Ayazoglu, Bahri Batuhan Bilecen, Shota Hirose, Kasidis Arunruangsirilert, Luo Ao, Ho Chun Leung, Andrew Wei, Jie Liu, Qiang Liu, Dahai Yu, Ao Li, Lei Luo, Ce Zhu, Seongmin Hong, Dongwon Park, Joonhee Lee, Byeong Hyun Lee, Seunggyu Lee, Se Young Chun, Ruiyuan He, Xuhao Jiang, Haihang Ruan, Xinjian Zhang, Jing Liu, Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He

While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints.

Image Super-Resolution

105

Paper
Code

Match to Win: Analysing Sequences Lengths for Efficient Self-supervised Learning in Speech and Audio

no code implementations • 30 Sep 2022 • Yan Gao, Javier Fernandez-Marques, Titouan Parcollet, Pedro P. B. de Gusmao, Nicholas D. Lane

Self-supervised learning (SSL) has proven vital in speech and audio-related applications.

Model Compression Self-Supervised Learning

Paper
Add Code

ZeroFL: Efficient On-Device Training for Federated Learning with Local Sparsity

no code implementations • ICLR 2022 • Xinchi Qiu, Javier Fernandez-Marques, Pedro PB Gusmao, Yan Gao, Titouan Parcollet, Nicholas Donald Lane

When the available hardware cannot meet the memory and compute requirements to efficiently train high performing machine learning models, a compromise in either the training quality or the model complexity is needed.

Federated Learning

Paper
Add Code

Federated Self-supervised Learning for Video Understanding

2 code implementations • 5 Jul 2022 • Yasar Abbas Ur Rehman, Yan Gao, Jiajun Shen, Pedro Porto Buarque de Gusmao, Nicholas Lane

The ubiquity of camera-enabled mobile devices has lead to large amounts of unlabelled video data being produced at the edge.

Ranked #1 on Action Recognition on UCF-101 (Accuracy metric)

Action Recognition Federated Learning +3

4,176

Paper
Code

LogiGAN: Learning Logical Reasoning via Adversarial Pre-training

1 code implementation • 18 May 2022 • Xinyu Pi, Wanjun Zhong, Yan Gao, Nan Duan, Jian-Guang Lou

We present LogiGAN, an unsupervised adversarial pre-training framework for improving logical reasoning abilities of language models.

Logical Reasoning Sentence

360

Paper
Code

NFormer: Robust Person Re-identification with Neighbor Transformer

1 code implementation • CVPR 2022 • Haochen Wang, Jiayi Shen, Yongtuo Liu, Yan Gao, Efstratios Gavves

To tackle this issue, we propose a Neighbor Transformer Network, or NFormer, which explicitly models interactions across all input images, thus suppressing outlier features and leading to more robust representations overall.

Person Re-Identification Representation Learning

Paper
Code

Federated Self-supervised Speech Representations: Are We There Yet?

no code implementations • 6 Apr 2022 • Yan Gao, Javier Fernandez-Marques, Titouan Parcollet, Abhinav Mehrotra, Nicholas D. Lane

The ubiquity of microphone-enabled devices has lead to large amounts of unlabelled audio data being produced at the edge.

Federated Learning Self-Supervised Learning

Paper
Add Code

UniSAr: A Unified Structure-Aware Autoregressive Language Model for Text-to-SQL

1 code implementation • 15 Mar 2022 • Longxu Dou, Yan Gao, Mingyang Pan, Dingzirui Wang, Wanxiang Che, Dechen Zhan, Jian-Guang Lou

Existing text-to-SQL semantic parsers are typically designed for particular settings such as handling queries that span multiple tables, domains or turns which makes them ineffective when applied to different settings.

Language Modelling Text-To-SQL

360

Paper
Code

Decoupled IoU Regression for Object Detection

no code implementations • 2 Feb 2022 • Yan Gao, Qimeng Wang, Xu Tang, Haochen Wang, Fei Ding, Jing Li, Yao Hu

Prior works propose to predict Intersection-over-Union (IoU) between bounding boxes and corresponding ground-truths to improve NMS, while accurately predicting IoU is still a challenging problem.

Object object-detection +2

Paper
Add Code

Reasoning Like Program Executors

1 code implementation • 27 Jan 2022 • Xinyu Pi, Qian Liu, Bei Chen, Morteza Ziyadi, Zeqi Lin, Qiang Fu, Yan Gao, Jian-Guang Lou, Weizhu Chen

Reasoning over natural language is a long-standing goal for the research community.

Ranked #2 on Question Answering on DROP Test (using extra training data)

Logical Reasoning Math +1

360

Paper
Code

Physically Disentangled Intra- and Inter-Domain Adaptation for Varicolored Haze Removal

1 code implementation • CVPR 2022 • Yi Li, Yi Chang, Yan Gao, Changfeng Yu, Luxin Yan

Consequently, we perform inter-domain adaptation between the synthetic and real images by mutually exchanging the background and other two components.

Domain Adaptation Image Dehazing

Paper
Code

Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge

no code implementations • 15 Nov 2021 • Jiyang Qi, Yan Gao, Yao Hu, Xinggang Wang, Xiaoyu Liu, Xiang Bai, Serge Belongie, Alan Yuille, Philip H. S. Torr, Song Bai

To promote the development of occlusion understanding, we collect a large-scale dataset called OVIS for video instance segmentation in the occluded scenario.

Instance Segmentation Object Recognition +3

Paper
Add Code

HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language Generation

1 code implementation • ACL 2022 • Zhoujun Cheng, Haoyu Dong, Zhiruo Wang, Ran Jia, Jiaqi Guo, Yan Gao, Shi Han, Jian-Guang Lou, Dongmei Zhang

HiTab provides 10, 686 QA pairs and descriptive sentences with well-annotated quantity and entity alignment on 3, 597 tables with broad coverage of table hierarchies and numerical reasoning types.

Descriptive Entity Alignment +2

Paper
Code

SpeechBrain: A General-Purpose Speech Toolkit

4 code implementations • 8 Jun 2021 • Mirco Ravanelli, Titouan Parcollet, Peter Plantinga, Aku Rouhe, Samuele Cornell, Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan Zhong, Ju-chieh Chou, Sung-Lin Yeh, Szu-Wei Fu, Chien-Feng Liao, Elena Rastorgueva, François Grondin, William Aris, Hwidong Na, Yan Gao, Renato de Mori, Yoshua Bengio

SpeechBrain is an open-source and all-in-one speech toolkit.

Language Identification Spoken Language Understanding

7,879

Paper
Code

End-to-End Speech Recognition from Federated Acoustic Models

1 code implementation • 29 Apr 2021 • Yan Gao, Titouan Parcollet, Salah Zaiem, Javier Fernandez-Marques, Pedro P. B. de Gusmao, Daniel J. Beutel, Nicholas D. Lane

Training Automatic Speech Recognition (ASR) models under federated learning (FL) settings has attracted a lot of attention recently.

2k 4k +4

Paper
Code

On-device Federated Learning with Flower

no code implementations • 7 Apr 2021 • Akhil Mathur, Daniel J. Beutel, Pedro Porto Buarque de Gusmão, Javier Fernandez-Marques, Taner Topal, Xinchi Qiu, Titouan Parcollet, Yan Gao, Nicholas D. Lane

Federated Learning (FL) allows edge devices to collaboratively learn a shared prediction model while keeping their training data on the device, thereby decoupling the ability to do machine learning from the need to store data in the cloud.

BIG-bench Machine Learning Federated Learning

Paper
Add Code

Hyperspectral Image Denoising Based On Multi-Stream Denoising Network

no code implementations • 6 Apr 2021 • Yan Gao, Feng Gao, Junyu Dong

Our network consists of the noise estimation subnetwork and denoising subnetwork.

Hyperspectral Image Denoising Image Denoising +1

Paper
Add Code

A first look into the carbon footprint of federated learning

no code implementations • 15 Feb 2021 • Xinchi Qiu, Titouan Parcollet, Javier Fernandez-Marques, Pedro Porto Buarque de Gusmao, Yan Gao, Daniel J. Beutel, Taner Topal, Akhil Mathur, Nicholas D. Lane

Despite impressive results, deep learning-based technologies also raise severe privacy and environmental concerns induced by the training procedure often conducted in data centers.

Federated Learning

Paper
Add Code

Occluded Video Instance Segmentation: A Benchmark

2 code implementations • 2 Feb 2021 • Jiyang Qi, Yan Gao, Yao Hu, Xinggang Wang, Xiaoyu Liu, Xiang Bai, Serge Belongie, Alan Yuille, Philip H. S. Torr, Song Bai

On the OVIS dataset, the highest AP achieved by state-of-the-art algorithms is only 16. 3, which reveals that we are still at a nascent stage for understanding objects, instances, and videos in a real-world scenario.

Ranked #39 on Video Instance Segmentation on OVIS validation

Instance Segmentation Segmentation +3

Paper
Code

Dynamic-K Recommendation with Personalized Decision Boundary

no code implementations • 25 Dec 2020 • Yan Gao, Jiafeng Guo, Yanyan Lan, Huaming Liao

The ranking objective is the same as existing methods, i. e., to create a ranking list of items according to users' interests.

Paper
Add Code

"What Do You Mean by That?" A Parser-Independent Interactive Approach for Enhancing Text-to-SQL

1 code implementation • 9 Nov 2020 • Yuntao Li, Bei Chen, Qian Liu, Yan Gao, Jian-Guang Lou, Yan Zhang, Dongmei Zhang

In Natural Language Interfaces to Databases systems, the text-to-SQL technique allows users to query databases by using natural language questions.

Text-To-SQL

360

Paper
Code

Flower: A Friendly Federated Learning Research Framework

1 code implementation • 28 Jul 2020 • Daniel J. Beutel, Taner Topal, Akhil Mathur, Xinchi Qiu, Javier Fernandez-Marques, Yan Gao, Lorenzo Sani, Kwing Hei Li, Titouan Parcollet, Pedro Porto Buarque de Gusmão, Nicholas D. Lane

Federated Learning (FL) has emerged as a promising technique for edge devices to collaboratively learn a shared prediction model, while keeping their training data on the device, thereby decoupling the ability to do machine learning from the need to store the data in the cloud.

Federated Learning

4,176

Paper
Code

Compositional Generalization by Learning Analytical Expressions

1 code implementation • NeurIPS 2020 • Qian Liu, Shengnan An, Jian-Guang Lou, Bei Chen, Zeqi Lin, Yan Gao, Bin Zhou, Nanning Zheng, Dongmei Zhang

Compositional generalization is a basic and essential intellective capability of human beings, which allows us to recombine known parts readily.

Hierarchical Reinforcement Learning

360

Paper
Code

IMUTube: Automatic Extraction of Virtual on-body Accelerometry from Video for Human Activity Recognition

no code implementations • 29 May 2020 • Hyeokhyen Kwon, Catherine Tong, Harish Haresamudram, Yan Gao, Gregory D. Abowd, Nicholas D. Lane, Thomas Ploetz

The lack of large-scale, labeled data sets impedes progress in developing robust and generalized predictive models for on-body sensor-based human activity recognition (HAR).

Human Activity Recognition

Paper
Add Code

Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition

3 code implementations • 19 May 2020 • Yan Gao, Titouan Parcollet, Nicholas Lane

In the specific context of Automatic Speech Recognition (ASR), distillation from ensembles of acoustic models has recently shown promising results in increasing recognition performance.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Code

1st Place Solutions for OpenImage2019 -- Object Detection and Instance Segmentation

2 code implementations • 17 Mar 2020 • Yu Liu, Guanglu Song, Yuhang Zang, Yan Gao, Enze Xie, Junjie Yan, Chen Change Loy, Xiaogang Wang

Given such good instance bounding box, we further design a simple instance-level semantic segmentation pipeline and achieve the 1st place on the segmentation challenge.

General Classification Instance Segmentation +6

453

Paper
Code

A Hybrid Semantic Parsing Approach for Tabular Data Analysis

no code implementations • 23 Oct 2019 • Yan Gao, Jian-Guang Lou, Dongmei Zhang

This paper presents a novel approach to translating natural language questions to SQL queries for given tables, which meets three requirements as a real-world data analysis application: cross-domain, multilingualism and enabling quick-start.

Semantic Parsing

Paper
Add Code

C-MIDN: Coupled Multiple Instance Detection Network With Segmentation Guidance for Weakly Supervised Object Detection

no code implementations • ICCV 2019 • Yan Gao, Boxiao Liu, Nan Guo, Xiaochun Ye, Fang Wan, Haihang You, Dongrui Fan

Weakly supervised object detection (WSOD) that only needs image-level annotations has obtained much attention recently.

Ranked #4 on Weakly Supervised Object Detection on PASCAL VOC 2012 test

Multiple Instance Learning object-detection +1

Paper
Add Code

Utilizing the Instability in Weakly Supervised Object Detection

no code implementations • 14 Jun 2019 • Yan Gao, Boxiao Liu, Nan Guo, Xiaochun Ye, Fang Wan, Haihang You, Dongrui Fan

Weakly supervised object detection (WSOD) focuses on training object detector with only image-level annotations, and is challenging due to the gap between the supervision and the objective.

Ranked #8 on Weakly Supervised Object Detection on PASCAL VOC 2012 test

Multiple Instance Learning Object +2

Paper
Add Code

Characterizing Shadow Price via Lagrangian Multiplier for Nonsmooth Problem

no code implementations • 31 May 2019 • Yan Gao

It is shown that the Lagrangian Multiplier is the upper bound of shadow price for convex optimization and a class of Lipschtzian optimizations.

Relation

Paper
Add Code

Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation

5 code implementations • ACL 2019 • Jiaqi Guo, Zecheng Zhan, Yan Gao, Yan Xiao, Jian-Guang Lou, Ting Liu, Dongmei Zhang

We present a neural approach called IRNet for complex and cross-domain Text-to-SQL.

Text-To-SQL

256

Paper
Code

Towards Reliable, Automated General Movement Assessment for Perinatal Stroke Screening in Infants Using Wearable Accelerometers

no code implementations • 21 Feb 2019 • Yan Gao, Yang Long, Yu Guan, Anna Basu, Jessica Baggaley, Thomas Ploetz

We demonstrate the effectiveness of our approach in a study with 34 newborns (21 typically developing infants and 13 PS infants with abnormal movements).

Paper
Add Code

Robust Cross-View Gait Recognition with Evidence: A Discriminant Gait GAN (DiGGAN) Approach

1 code implementation • 26 Nov 2018 • BingZhang Hu, Yu Guan, Yan Gao, Yang Long, Nicholas Lane, Thomas Ploetz

Gait as a biometric trait has attracted much attention in many security and privacy applications such as identity recognition and authentication, during the last few decades.

Gait Identification Gait Recognition +1

Paper
Code

The SAS Statistical Machine Translation System for WAT 2014

no code implementations • WS 2014 • Rui Wang, Xu Yang, Yan Gao

Machine Translation Translation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.