Search Results for author: Tao Yu

Found 137 papers, 70 papers with code

RobustFusion: Human Volumetric Capture with Data-driven Visual Cues using a RGBD Camera

no code implementations • ECCV 2020 • Zhuo Su, Lan Xu, Zerong Zheng, Tao Yu, Yebin Liu, Lu Fang

To enable robust tracking, we embrace both the initial model and the various visual cues into a novel performance capture scheme with hybrid motion optimization and semantic volumetric fusion, which can successfully capture challenging human motions under the monocular setting without pre-scanned detailed template and owns the reinitialization ability to recover from tracking failures and the disappear-reoccur scenarios.

4D reconstruction

Paper
Add Code

Effective Fine-Tuning Methods for Cross-lingual Adaptation

no code implementations • EMNLP 2021 • Tao Yu, Shafiq Joty

In this work, we propose a novel fine-tuning method based on co-training that aims to learn more generalized semantic equivalences as a complementary to multilingual language modeling using the unlabeled data in the target language.

Contrastive Learning Language Modelling +1

Paper
Add Code

Testing Cross-Database Semantic Parsers With Canonical Utterances

1 code implementation • EMNLP (Eval4NLP) 2021 • Heather Lent, Semih Yavuz, Tao Yu, Tong Niu, Yingbo Zhou, Dragomir Radev, Xi Victoria Lin

Paper
Code

An Exploratory Study on Long Dialogue Summarization: What Works and What’s Next

1 code implementation • Findings (EMNLP) 2021 • Yusen Zhang, Ansong Ni, Tao Yu, Rui Zhang, Chenguang Zhu, Budhaditya Deb, Asli Celikyilmaz, Ahmed Hassan Awadallah, Dragomir Radev

Dialogue summarization helps readers capture salient information from long conversations in meetings, interviews, and TV series.

Retrieval

Paper
Code

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

no code implementations • 11 Apr 2024 • Tianbao Xie, Danyang Zhang, Jixuan Chen, Xiaochuan Li, Siheng Zhao, Ruisheng Cao, Toh Jing Hua, Zhoujun Cheng, Dongchan Shin, Fangyu Lei, Yitao Liu, Yiheng Xu, Shuyan Zhou, Silvio Savarese, Caiming Xiong, Victor Zhong, Tao Yu

Autonomous agents that accomplish complex computer tasks with minimal human interventions have the potential to transform human-computer interaction, significantly enhancing accessibility and productivity.

Benchmarking

Paper
Add Code

MMVP: A Multimodal MoCap Dataset with Vision and Pressure Sensors

no code implementations • 26 Mar 2024 • He Zhang, Shenghao Ren, Haolei Yuan, Jianhui Zhao, Fan Li, Shuangpeng Sun, Zhenghao Liang, Tao Yu, Qiu Shen, Xun Cao

To validate the dataset, we propose an RGBD-P SMPL fitting method and also a monocular-video-based baseline framework, VP-MoCap, for human motion capture.

Translation

Paper
Add Code

Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience

no code implementations • 15 Mar 2024 • Xiaohang Yu, Zhengxian Yang, Shi Pan, Yuqi Han, Haoxiang Wang, Jun Zhang, Shi Yan, Borong Lin, Lei Yang, Tao Yu, Lu Fang

We have built a custom mobile multi-camera large-space dense light field capture system, which provides a series of high-quality and sufficiently dense light field images for various scenarios.

3D Reconstruction 3D Scene Reconstruction +1

Paper
Add Code

Yi: Open Foundation Models by 01.AI

1 code implementation • 7 Mar 2024 • 01. AI, :, Alex Young, Bei Chen, Chao Li, Chengen Huang, Ge Zhang, Guanwei Zhang, Heng Li, Jiangcheng Zhu, Jianqun Chen, Jing Chang, Kaidong Yu, Peng Liu, Qiang Liu, Shawn Yue, Senbin Yang, Shiming Yang, Tao Yu, Wen Xie, Wenhao Huang, Xiaohui Hu, Xiaoyi Ren, Xinyao Niu, Pengcheng Nie, Yuchi Xu, Yudong Liu, Yue Wang, Yuxuan Cai, Zhenyu Gu, Zhiyuan Liu, Zonghong Dai

The Yi model family is based on 6B and 34B pretrained language models, then we extend them to chat models, 200K long context models, depth-upscaled models, and vision-language models.

Attribute Chatbot +2

7,086

Paper
Code

YOLO-TLA: An Efficient and Lightweight Small Object Detection Model based on YOLOv5

no code implementations • 22 Feb 2024 • Peng Gao, Chun-Lin Ji, Tao Yu, Ru-Yue Yuan

Additionally, we have incorporated a global attention mechanism into the backbone network.

Object object-detection +1

Paper
Add Code

Automated Design and Optimization of Distributed Filtering Circuits via Reinforcement Learning

no code implementations • 22 Feb 2024 • Peng Gao, Tao Yu, Fei Wang, Ru-Yue Yuan

Designing distributed filtering circuits (DFCs) is complex and time-consuming, with the circuit performance relying heavily on the expertise and experience of electronics engineers.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Smart Mobility Digital Twin Based Automated Vehicle Navigation System: A Proof of Concept

no code implementations • 20 Feb 2024 • Kui Wang, Zongdian Li, Kazuma Nonomura, Tao Yu, Kei Sakaguchi, Omar Hashash, Walid Saad

The performance of SMDT is evaluated from two standpoints: (i) the rewards of the proposed navigation system on traffic efficiency and safety and, (ii) the latency and reliability of the SMDT platform.

Autonomous Driving Blocking

Paper
Add Code

ARKS: Active Retrieval in Knowledge Soup for Code Generation

no code implementations • 19 Feb 2024 • Hongjin Su, Shuyang Jiang, Yuhang Lai, Haoyuan Wu, Boao Shi, Che Liu, Qian Liu, Tao Yu

Recently the retrieval-augmented generation (RAG) paradigm has raised much attention for its potential in incorporating external knowledge into large language models (LLMs) without further training.

Code Generation Retrieval

Paper
Add Code

Generative Representational Instruction Tuning

2 code implementations • 15 Feb 2024 • Niklas Muennighoff, Hongjin Su, Liang Wang, Nan Yang, Furu Wei, Tao Yu, Amanpreet Singh, Douwe Kiela

Notably, we find that GRIT matches training on only generative or embedding data, thus we can unify both at no performance loss.

Language Modelling Large Language Model +1

806

Paper
Code

Momentum Approximation in Asynchronous Private Federated Learning

no code implementations • 14 Feb 2024 • Tao Yu, Congzheng Song, Jianyu Wang, Mona Chitnis

Asynchronous protocols have been shown to improve the scalability of federated learning (FL) with a massive number of clients.

Federated Learning

Paper
Add Code

OS-Copilot: Towards Generalist Computer Agents with Self-Improvement

1 code implementation • 12 Feb 2024 • Zhiyong Wu, Chengcheng Han, Zichen Ding, Zhenmin Weng, Zhoumianze Liu, Shunyu Yao, Tao Yu, Lingpeng Kong

Autonomous interaction with the computer has been a longstanding challenge with great potential, and the recent proliferation of large language models (LLMs) has markedly accelerated progress in building digital agents.

962

Paper
Code

Stochastic Dynamic Power Dispatch with High Generalization and Few-Shot Adaption via Contextual Meta Graph Reinforcement Learning

no code implementations • 19 Jan 2024 • Bairong Deng, Tao Yu, Zhenning Pan, Xuehan Zhang, Yufeng Wu, Qiaoyi Ding

To fill these gaps, a novel contextual meta graph reinforcement learning (Meta-GRL) for a highly generalized multi-stage optimal dispatch policy is proposed.

Decision Making reinforcement-learning

Paper
Add Code

Fluctuation-based Adaptive Structured Pruning for Large Language Models

1 code implementation • 19 Dec 2023 • Yongqi An, Xu Zhao, Tao Yu, Ming Tang, Jinqiao Wang

Retraining-free is important for LLMs' pruning methods.

Network Pruning

Paper
Code

DisControlFace: Disentangled Control for Personalized Facial Image Editing

no code implementations • 11 Dec 2023 • Haozhe Jia, Yan Li, Hengfei Cui, Di Xu, Changpeng Yang, Yuwang Wang, Tao Yu

Our DisControlNet can perform robust editing on any facial image through training on large-scale 2D in-the-wild portraits and also supports low-cost fine-tuning with few additional images to further learn diverse personalized priors of a specific person.

Paper
Add Code

Internet of Federated Digital Twins (IoFDT): Connecting Twins Beyond Borders for Society 5.0

no code implementations • 11 Dec 2023 • Tao Yu, Zongdian Li, Kei Sakaguchi, Omar Hashash, Walid Saad, Merouane Debbah

In contrast, this paper envisions a novel concept of an Internet of Federated Digital Twins (IoFDT) that holistically integrates heterogeneous and physically separated DTs representing different Society 5. 0 services within a single framework and system.

Paper
Add Code

OmniSeg3D: Omniversal 3D Segmentation via Hierarchical Contrastive Learning

1 code implementation • 20 Nov 2023 • Haiyang Ying, Yixuan Yin, Jinzhi Zhang, Fan Wang, Tao Yu, Ruqi Huang, Lu Fang

Towards holistic understanding of 3D scenes, a general 3D segmentation method is needed that can segment diverse objects without restrictions on object quantity or categories, while also reflecting the inherent hierarchical structure.

Contrastive Learning Novel View Synthesis +1

Paper
Code

OpenAgents: An Open Platform for Language Agents in the Wild

2 code implementations • 16 Oct 2023 • Tianbao Xie, Fan Zhou, Zhoujun Cheng, Peng Shi, Luoxuan Weng, Yitao Liu, Toh Jing Hua, Junning Zhao, Qian Liu, Che Liu, Leo Z. Liu, Yiheng Xu, Hongjin Su, Dongchan Shin, Caiming Xiong, Tao Yu

Language agents show potential in being capable of utilizing natural language for varied and intricate tasks in diverse environments, particularly when built upon large language models (LLMs).

2D Object Detection

3,378

Paper
Code

Lemur: Harmonizing Natural Language and Code for Language Agents

1 code implementation • 10 Oct 2023 • Yiheng Xu, Hongjin Su, Chen Xing, Boyu Mi, Qian Liu, Weijia Shi, Binyuan Hui, Fan Zhou, Yitao Liu, Tianbao Xie, Zhoujun Cheng, Siheng Zhao, Lingpeng Kong, Bailin Wang, Caiming Xiong, Tao Yu

We introduce Lemur and Lemur-Chat, openly accessible language models optimized for both natural language and coding capabilities to serve as the backbone of versatile language agents.

516

Paper
Code

PARF: Primitive-Aware Radiance Fusion for Indoor Scene Novel View Synthesis

no code implementations • ICCV 2023 • Haiyang Ying, Baowei Jiang, Jinzhi Zhang, Di Xu, Tao Yu, Qionghai Dai, Lu Fang

This paper proposes a method for fast scene radiance field reconstruction with strong novel view synthesis performance and convenient scene editing functionality.

Novel View Synthesis Semantic Parsing

Paper
Add Code

Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning

1 code implementation • 20 Sep 2023 • Tianbao Xie, Siheng Zhao, Chen Henry Wu, Yitao Liu, Qian Luo, Victor Zhong, Yanchao Yang, Tao Yu

Unlike inverse RL and recent work that uses LLMs to write sparse reward codes, Text2Reward produces interpretable, free-form dense reward codes that cover a wide range of tasks, utilize existing packages, and allow iterative refinement with human feedback.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Dynamic Spectrum Mixer for Visual Recognition

no code implementations • 13 Sep 2023 • Zhiqiang Hu, Tao Yu

Recently, MLP-based vision backbones have achieved promising performance in several visual recognition tasks.

Image Classification object-detection +2

Paper
Add Code

ImmersiveNeRF: Hybrid Radiance Fields for Unbounded Immersive Light Field Reconstruction

no code implementations • 4 Sep 2023 • Xiaohang Yu, Haoxiang Wang, Yuqi Han, Lei Yang, Tao Yu, Qionghai Dai

This paper proposes a hybrid radiance field representation for unbounded immersive light field reconstruction which supports high-quality rendering and aggressive view extrapolation.

Segmentation

Paper
Add Code

AutoDroid: LLM-powered Task Automation in Android

no code implementations • 29 Aug 2023 • Hao Wen, Yuanchun Li, Guohong Liu, Shanhui Zhao, Tao Yu, Toby Jia-Jun Li, Shiqi Jiang, Yunhao Liu, Yaqin Zhang, Yunxin Liu

Mobile task automation is an attractive technique that aims to enable voice-based hands-free user interaction with smartphones.

Language Modelling

Paper
Add Code

Does Collaborative Human-LM Dialogue Generation Help Information Extraction from Human Dialogues?

no code implementations • 13 Jul 2023 • Bo-Ru Lu, Nikita Haduong, Chia-Hsuan Lee, Zeqiu Wu, Hao Cheng, Paul Koester, Jean Utke, Tao Yu, Noah A. Smith, Mari Ostendorf

The capabilities of pretrained language models have opened opportunities to explore new application areas, but applications involving human-human interaction are limited by the fact that most data is protected from public release for privacy reasons.

Dialogue Generation Dialogue State Tracking +1

Paper
Add Code

Fast Segment Anything

1 code implementation • 21 Jun 2023 • Xu Zhao, Wenchao Ding, Yongqi An, Yinglong Du, Tao Yu, Min Li, Ming Tang, Jinqiao Wang

In this paper, we propose a speed-up alternative method for this fundamental task with comparable performance.

Ranked #4 on Zero-Shot Instance Segmentation on LVIS v1.0 val

Edge Detection Image Segmentation +6

6,803

Paper
Code

BrainNet: Epileptic Wave Detection from SEEG with Hierarchical Graph Diffusion Learning

no code implementations • 15 Jun 2023 • Junru Chen, Yang Yang, Tao Yu, Yingying Fan, Xiaolong Mo, Carl Yang

Therefore, we propose the first data-driven study to detect epileptic waves in a real-world SEEG dataset.

EEG Self-Supervised Learning +1

Paper
Add Code

Coneheads: Hierarchy Aware Attention

1 code implementation • NeurIPS 2023 • Albert Tseng, Tao Yu, Toni J. B. Liu, Christopher De Sa

These networks rely heavily on the dot product attention operator, which computes the similarity between two points by taking their inner product.

Paper
Code

Shadow Cones: A Generalized Framework for Partial Order Embeddings

1 code implementation • 24 May 2023 • Tao Yu, Toni J. B. Liu, Albert Tseng, Christopher De Sa

Specifically, we model partial orders as subset relations between shadows formed by a light source and opaque objects in hyperbolic space.

Paper
Code

Generating Data for Symbolic Language with Large Language Models

1 code implementation • 23 May 2023 • Jiacheng Ye, Chengzu Li, Lingpeng Kong, Tao Yu

However, such an approach has primarily been applied to natural language tasks and has not yet been explored for symbolic language tasks with complex structured outputs (e. g., semantic parsing and code generation).

Code Generation Semantic Parsing

Paper
Code

Hybrid AHS: A Hybrid of Kalman Filter and Deep Learning for Acoustic Howling Suppression

no code implementations • 4 May 2023 • Hao Zhang, Meng Yu, Yuzhong Wu, Tao Yu, Dong Yu

During offline training, a pre-processed signal obtained from the Kalman filter and an ideal microphone signal generated via teacher-forced training strategy are used to train the deep neural network (DNN).

Paper
Add Code

StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video

1 code implementation • 1 May 2023 • Lizhen Wang, Xiaochen Zhao, Jingxiang Sun, Yuxiang Zhang, Hongwen Zhang, Tao Yu, Yebin Liu

Results and experiments demonstrate the superiority of our method in terms of image quality, full portrait video generation, and real-time re-animation compared to existing facial reenactment methods.

Face Reenactment Translation +1

351

Paper
Code

Super-NeRF: View-consistent Detail Generation for NeRF super-resolution

no code implementations • 26 Apr 2023 • Yuqi Han, Tao Yu, Xiaohang Yu, Yuwang Wang, Qionghai Dai

Given multi-view low-resolution images, Super-NeRF constructs a consistency-controlling super-resolution module to generate view-consistent high-resolution details for NeRF.

Image Super-Resolution

Paper
Add Code

Learning Visibility Field for Detailed 3D Human Reconstruction and Relighting

no code implementations • CVPR 2023 • Ruichen Zheng, Peng Li, Haoqian Wang, Tao Yu

Detailed 3D reconstruction and photo-realistic relighting of digital humans are essential for various applications.

3D Human Reconstruction 3D Reconstruction

Paper
Add Code

The Seven Worlds and Experiences of the Wireless Metaverse: Challenges and Opportunities

no code implementations • 20 Apr 2023 • Omar Hashash, Christina Chaccour, Walid Saad, Tao Yu, Kei Sakaguchi, Merouane Debbah

We then articulate how these experiences bring forth interactions between diverse metaverse constituents, namely, a) humans and avatars and b) connected intelligence systems and their digital twins (DTs).

Paper
Add Code

Inpaint Anything: Segment Anything Meets Image Inpainting

1 code implementation • 13 Apr 2023 • Tao Yu, Runseng Feng, Ruoyu Feng, Jinming Liu, Xin Jin, Wenjun Zeng, Zhibo Chen

We are also very willing to help everyone share and promote new projects based on our Inpaint Anything (IA).

Image Inpainting

5,435

Paper
Code

Hi Sheldon! Creating Deep Personalized Characters from TV Shows

no code implementations • 9 Apr 2023 • Meidai Xuanyuan, Yuwang Wang, Honglei Guo, Xiao Ma, Yuchen Guo, Tao Yu, Qionghai Dai

To support this novel task, we further collect a character centric multimodal dialogue dataset, named Deep Personalized Character Dataset (DPCD), from TV shows.

Paper
Add Code

ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection

1 code implementation • CVPR 2023 • Yongqi An, Xu Zhao, Tao Yu, Haiyun Guo, Chaoyang Zhao, Ming Tang, Jinqiao Wang

However, previous unsupervised deep learning BGS algorithms perform poorly in sophisticated scenarios such as shadows or night lights, and they cannot detect objects outside the pre-defined categories.

Foreground Segmentation Object +2

Paper
Code

TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge

no code implementations • 14 Mar 2023 • Yukai Ju, Jun Chen, Shimin Zhang, Shulin He, Wei Rao, Weixin Zhu, Yannan Wang, Tao Yu, Shidong Shang

This paper introduces the Unbeatable Team's submission to the ICASSP 2023 Deep Noise Suppression (DNS) Challenge.

Speech Enhancement

Paper
Add Code

Automated Self-Supervised Learning for Recommendation

2 code implementations • 14 Mar 2023 • Lianghao Xia, Chao Huang, Chunzhen Huang, Kangyi Lin, Tao Yu, Ben Kao

This does not generalize across different datasets and downstream recommendation tasks, which is difficult to be adaptive for data augmentation and robust to noise perturbation.

Collaborative Filtering Contrastive Learning +2

282

Paper
Code

Compositional Exemplars for In-context Learning

1 code implementation • 11 Feb 2023 • Jiacheng Ye, Zhiyong Wu, Jiangtao Feng, Tao Yu, Lingpeng Kong

The performance of ICL is highly dominated by the quality of the selected in-context examples.

Code Generation Contrastive Learning +6

Paper
Code

Batch Prompting: Efficient Inference with Large Language Model APIs

2 code implementations • 19 Jan 2023 • Zhoujun Cheng, Jungo Kasai, Tao Yu

We extensively validate the effectiveness of batch prompting on ten datasets across commonsense QA, arithmetic reasoning, and NLI/NLU: batch prompting significantly~(up to 5x with six samples in batch) reduces the LLM (Codex) inference token and time costs while achieving better or comparable performance.

Arithmetic Reasoning In-Context Learning +2

Paper
Code

One Embedder, Any Task: Instruction-Finetuned Text Embeddings

3 code implementations • 19 Dec 2022 • Hongjin Su, Weijia Shi, Jungo Kasai, Yizhong Wang, Yushi Hu, Mari Ostendorf, Wen-tau Yih, Noah A. Smith, Luke Zettlemoyer, Tao Yu

Our analysis suggests that INSTRUCTOR is robust to changes in instructions, and that instruction finetuning mitigates the challenge of training a single model on diverse datasets.

Information Retrieval Learning Word Embeddings +3

4,003

Paper
Code

Realization Scheme for Visual Cryptography with Computer-generated Holograms

no code implementations • 10 Dec 2022 • Tao Yu, Jinge Ma, Guilin Li, Dongyu Yang, Rui Ma, Yishi Shi

This method can expand the application range of visual cryptography and further increase the security of visual cryptography.

Paper
Add Code

Coder Reviewer Reranking for Code Generation

1 code implementation • 29 Nov 2022 • Tianyi Zhang, Tao Yu, Tatsunori B. Hashimoto, Mike Lewis, Wen-tau Yih, Daniel Fried, Sida I. Wang

Sampling diverse programs from a code language model and reranking with model likelihood is a popular method for code generation but it is prone to preferring degenerate solutions.

Ranked #22 on Code Generation on MBPP

Code Generation Language Modelling

Paper
Code

Task Residual for Tuning Vision-Language Models

1 code implementation • CVPR 2023 • Tao Yu, Zhihe Lu, Xin Jin, Zhibo Chen, Xinchao Wang

Large-scale vision-language models (VLMs) pre-trained on billion-level data have learned general visual representations and broad visual concepts.

Transfer Learning

Paper
Code

DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation

1 code implementation • 18 Nov 2022 • Yuhang Lai, Chengxi Li, Yiming Wang, Tianyi Zhang, Ruiqi Zhong, Luke Zettlemoyer, Scott Wen-tau Yih, Daniel Fried, Sida Wang, Tao Yu

We introduce DS-1000, a code generation benchmark with a thousand data science problems spanning seven Python libraries, such as NumPy and Pandas.

Code Generation Memorization

185

Paper
Code

ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback

2 code implementations • 22 Oct 2022 • Jiacheng Ye, Jiahui Gao, Jiangtao Feng, Zhiyong Wu, Tao Yu, Lingpeng Kong

To improve the quality of dataset synthesis, we propose a progressive zero-shot dataset generation framework, ProGen, which leverages the feedback from the task-specific model to guide the generation of new training data via in-context examples.

Informativeness text-classification +2

Paper
Code

Augmenting Multi-Turn Text-to-SQL Datasets with Self-Play

1 code implementation • 21 Oct 2022 • Qi Liu, Zihuiwen Ye, Tao Yu, Phil Blunsom, Linfeng Song

We first design a SQL-to-text model conditioned on a sampled goal query, which represents a user's intent, that then converses with a text-to-SQL semantic parser to generate new interactions.

Domain Generalization SQL-to-Text +1

Paper
Code

Binding Language Models in Symbolic Languages

1 code implementation • 6 Oct 2022 • Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu

We propose Binder, a training-free neural-symbolic framework that maps the task input to a program, which (1) allows binding a unified API of language model (LM) functionalities to a programming language (e. g., SQL, Python) to extend its grammar coverage and thus tackle more diverse questions, (2) adopts an LM as both the program parser and the underlying model called by the API during execution, and (3) requires only a few in-context exemplar annotations.

Ranked #4 on Table-based Fact Verification on TabFact

Language Modelling Semantic Parsing +1

275

Paper
Code

NL2INTERFACE: Interactive Visualization Interface Generation from Natural Language Queries

no code implementations • 19 Sep 2022 • Yiru Chen, Ryan Li, Austin Mac, Tianbao Xie, Tao Yu, Eugene Wu

We develop NL2INTERFACE to explore the potential of generating usable interactive multi-visualization interfaces from natural language queries.

Natural Language Queries

Paper
Add Code

Selective Annotation Makes Language Models Better Few-Shot Learners

1 code implementation • 5 Sep 2022 • Hongjin Su, Jungo Kasai, Chen Henry Wu, Weijia Shi, Tianlu Wang, Jiayi Xin, Rui Zhang, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu

Departing from recent in-context learning methods, we formulate an annotation-efficient, two-step framework: selective annotation that chooses a pool of examples to annotate from unlabeled data in advance, followed by prompt retrieval that retrieves task examples from the annotated pool at test time.

Code Generation In-Context Learning +1

Paper
Code

FOLIO: Natural Language Reasoning with First-Order Logic

1 code implementation • 2 Sep 2022 • Simeng Han, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, Luke Benson, Lucy Sun, Ekaterina Zubova, Yujie Qiao, Matthew Burtell, David Peng, Jonathan Fan, Yixin Liu, Brian Wong, Malcolm Sailor, Ansong Ni, Linyong Nan, Jungo Kasai, Tao Yu, Rui Zhang, Shafiq Joty, Alexander R. Fabbri, Wojciech Kryscinski, Xi Victoria Lin, Caiming Xiong, Dragomir Radev

We present FOLIO, a human-annotated, open-domain, and logically complex and diverse dataset for reasoning in natural language (NL), equipped with first order logic (FOL) annotations.

Language Modelling Large Language Model +1

Paper
Code

MCTensor: A High-Precision Deep Learning Library with Multi-Component Floating-Point

1 code implementation • 18 Jul 2022 • Tao Yu, Wentao Guo, Jianan Canal Li, Tiancheng Yuan, Christopher De Sa

In this paper, we introduce MCTensor, a library based on PyTorch for providing general-purpose and high-precision arithmetic for DL training.

Paper
Code

Geometry-aware Single-image Full-body Human Relighting

no code implementations • 11 Jul 2022 • Chaonan Ji, Tao Yu, Kaiwen Guo, Jingxin Liu, Yebin Liu

For the relighting, we introduce a ray tracing-based per-pixel lighting representation that explicitly models high-frequency shadows and propose a learning-based shading refinement module to restore realistic shadows (including hard cast shadows) from the ray-traced shading maps.

Disentanglement Neural Rendering

Paper
Add Code

Design and Analysis of Robust Resilient Diffusion over Multi-Task Networks Against Byzantine Attacks

no code implementations • 25 Jun 2022 • Tao Yu, Rodrigo C. de Lamare, Yi Yu

This paper studies distributed diffusion adaptation over clustered multi-task networks in the presence of impulsive interferences and Byzantine attacks.

Paper
Add Code

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

3 code implementations • 9 Jun 2022 • Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew Dai, Andrew La, Andrew Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakaş, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartłomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, César Ferri Ramírez, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Chris Waites, Christian Voigt, Christopher D. Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodola, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A. Chi, Ethan Dyer, Ethan Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, Francois Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germán Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Wang, Gonzalo Jaimovitch-López, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Shevlin, Hinrich Schütze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B. Simon, James Koppel, James Zheng, James Zou, Jan Kocoń, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, Jose Hernandez-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh D. Dhole, Kevin Gimpel, Kevin Omondi, Kory Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan, Lianhui Qin, Lidia Contreras-Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros Colón, Luke Metz, Lütfi Kerem Şenel, Maarten Bosma, Maarten Sap, Maartje ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, Maria Jose Ramírez Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L. Leavitt, Matthias Hagen, Mátyás Schubert, Medina Orduna Baitemirova, Melody Arnaud, Melvin McElrath, Michael A. Yee, Michael Cohen, Michael Gu, Michael Ivanitskiy, Michael Starritt, Michael Strube, Michał Swędrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T, Nanyun Peng, Nathan A. Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha S. Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Miłkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan LeBras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Ruslan Salakhutdinov, Ryan Chi, Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel S. Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima, Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven T. Piantadosi, Stuart M. Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster, Tao Li, Tao Yu, Tariq Ali, Tatsu Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Yiding Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, ZiRui Wang, Ziyi Wu

BIG-bench focuses on tasks that are believed to be beyond the capabilities of current language models.

Common Sense Reasoning Math +1

2,642

Paper
Code

SwinIQA: Learned Swin Distance for Compressed Image Quality Assessment

1 code implementation • 9 May 2022 • Jianzhao Liu, Xin Li, Yanding Peng, Tao Yu, Zhibo Chen

In this paper, we design a full-reference image quality assessment metric SwinIQA to measure the perceptual quality of compressed images in a learned Swin distance space.

Ranked #1 on Compressed Image Quality Assessment on CLIC2021Test-subset

Compressed Image Quality Assessment Image Compression +1

Paper
Code

HuMMan: Multi-Modal 4D Human Dataset for Versatile Sensing and Modeling

no code implementations • 28 Apr 2022 • Zhongang Cai, Daxuan Ren, Ailing Zeng, Zhengyu Lin, Tao Yu, Wenjia Wang, Xiangyu Fan, Yang Gao, Yifan Yu, Liang Pan, Fangzhou Hong, Mingyuan Zhang, Chen Change Loy, Lei Yang, Ziwei Liu

4D human sensing and modeling are fundamental tasks in vision and graphics with numerous applications.

Fine-grained Action Recognition Pose Estimation

Paper
Add Code

GIMO: Gaze-Informed Human Motion Prediction in Context

1 code implementation • 20 Apr 2022 • Yang Zheng, Yanchao Yang, Kaichun Mo, Jiaman Li, Tao Yu, Yebin Liu, C. Karen Liu, Leonidas J. Guibas

We perform an extensive study of the benefits of leveraging the eye gaze for ego-centric human motion prediction with various state-of-the-art architectures.

Human motion prediction motion prediction

Paper
Code

ProbNVS: Fast Novel View Synthesis with Learned Probability-Guided Sampling

no code implementations • 7 Apr 2022 • Yuemei Zhou, Tao Yu, Zerong Zheng, Ying Fu, Yebin Liu

Existing state-of-the-art novel view synthesis methods rely on either fairly accurate 3D geometry estimation or sampling of the entire space for neural volumetric rendering, which limit the overall efficiency.

Novel View Synthesis

Paper
Add Code

Structured Local Radiance Fields for Human Avatar Modeling

no code implementations • CVPR 2022 • Zerong Zheng, Han Huang, Tao Yu, Hongwen Zhang, Yandong Guo, Yebin Liu

These local radiance fields not only leverage the flexibility of implicit representation in shape and appearance modeling, but also factorize cloth deformations into skeleton motions, node residual translations and the dynamic detail variations inside each individual radiance field.

Paper
Add Code

FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid Dataset

1 code implementation • CVPR 2022 • Lizhen Wang, ZhiYuan Chen, Tao Yu, Chenguang Ma, Liang Li, Yebin Liu

In the coarse module, we generate a base parametric model from large-scale RGB-D images, which is able to predict accurate rough 3D face models in different genders, ages, etc.

2k 3D Face Reconstruction +1

440

Paper
Code

Interacting Attention Graph for Single Image Two-Hand Reconstruction

1 code implementation • CVPR 2022 • Mengcheng Li, Liang An, Hongwen Zhang, Lianpeng Wu, Feng Chen, Tao Yu, Yebin Liu

To solve occlusion and interaction challenges of two-hand reconstruction, we introduce two novel attention based modules in each upsampling step of the original GCN.

Ranked #4 on 3D Interacting Hand Pose Estimation on InterHand2.6M

3D Interacting Hand Pose Estimation Vocal Bursts Valence Prediction

258

Paper
Code

In-Context Learning for Few-Shot Dialogue State Tracking

1 code implementation • 16 Mar 2022 • Yushi Hu, Chia-Hsuan Lee, Tianbao Xie, Tao Yu, Noah A. Smith, Mari Ostendorf

In this work, we propose an in-context learning (ICL) framework for zero-shot and few-shot learning DST, where a large pre-trained language model (LM) takes a test instance and a few exemplars as input, and directly decodes the dialogue state without any parameter updates.

Dialogue State Tracking Few-Shot Learning +3

Paper
Code

OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation

no code implementations • 4 Mar 2022 • Peng Li, Jiayin Zhao, Jingyao Wu, Chao Deng, Haoqian Wang, Tao Yu

Light field disparity estimation is an essential task in computer vision with various applications.

Disparity Estimation

Paper
Add Code

ZeroGen: Efficient Zero-shot Learning via Dataset Generation

3 code implementations • 16 Feb 2022 • Jiacheng Ye, Jiahui Gao, Qintong Li, Hang Xu, Jiangtao Feng, Zhiyong Wu, Tao Yu, Lingpeng Kong

There is a growing interest in dataset generation recently due to the superior generative capacity of large pre-trained language models (PLMs).

Knowledge Distillation Natural Language Inference +5

Paper
Code

Random Laplacian Features for Learning with Hyperbolic Space

1 code implementation • 14 Feb 2022 • Tao Yu, Christopher De Sa

Due to its geometric properties, hyperbolic space can support high-fidelity embeddings of tree- and graph-structured data, upon which various hyperbolic networks have been developed.

Graph Learning Node Classification +2

Paper
Code

Understanding Hyperdimensional Computing for Parallel Single-Pass Learning

1 code implementation • 10 Feb 2022 • Tao Yu, Yichi Zhang, Zhiru Zhang, Christopher De Sa

Using representation theory, we characterize which similarity matrices can be "expressed" by finite group VSA hypervectors, and we show how these VSAs can be constructed.

Paper
Code

Mask-based Latent Reconstruction for Reinforcement Learning

1 code implementation • 28 Jan 2022 • Tao Yu, Zhizheng Zhang, Cuiling Lan, Yan Lu, Zhibo Chen

For deep reinforcement learning (RL) from pixels, learning effective state representations is crucial for achieving high performance.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

HDhuman: High-quality Human Novel-view Rendering from Sparse Views

no code implementations • 20 Jan 2022 • Tiansong Zhou, Jing Huang, Tao Yu, Ruizhi Shao, Kun Li

To this end, we propose HDhuman, which uses a human reconstruction network with a pixel-aligned spatial transformer and a rendering network with geometry-guided pixel-wise feature integration to achieve high-quality human reconstruction and rendering.

2k Neural Rendering +2

Paper
Add Code

UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models

1 code implementation • 16 Jan 2022 • Tianbao Xie, Chen Henry Wu, Peng Shi, Ruiqi Zhong, Torsten Scholak, Michihiro Yasunaga, Chien-Sheng Wu, Ming Zhong, Pengcheng Yin, Sida I. Wang, Victor Zhong, Bailin Wang, Chengzu Li, Connor Boyle, Ansong Ni, Ziyu Yao, Dragomir Radev, Caiming Xiong, Lingpeng Kong, Rui Zhang, Noah A. Smith, Luke Zettlemoyer, Tao Yu

Structured knowledge grounding (SKG) leverages structured knowledge to complete user requests, such as semantic parsing over databases and question answering over knowledge bases.

Ranked #1 on Task-Oriented Dialogue Systems on KVRET

Few-Shot Learning Question Answering +3

530

Paper
Code

HVTR: Hybrid Volumetric-Textural Rendering for Human Avatars

no code implementations • 19 Dec 2021 • Tao Hu, Tao Yu, Zerong Zheng, He Zhang, Yebin Liu, Matthias Zwicker

To handle complicated motions (e. g., self-occlusions), we then leverage the encoded information on the UV manifold to construct a 3D volumetric representation based on a dynamic pose-conditioned neural radiance field.

Neural Rendering

Paper
Add Code

Representing Hyperbolic Space Accurately using Multi-Component Floats

no code implementations • NeurIPS 2021 • Tao Yu, Christopher M. De Sa

Hyperbolic space is particularly useful for embedding data with hierarchical structure; however, representing hyperbolic space with ordinary floating-point numbers greatly affects the performance due to its \emph{ineluctable} numerical errors.

Paper
Add Code

S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement

no code implementations • 16 Nov 2021 • Shubo Lv, Yihui Fu, Mengtao Xing, Jiayao Sun, Lei Xie, Jun Huang, Yannan Wang, Tao Yu

In speech enhancement, complex neural network has shown promising performance due to their effectiveness in processing complex-valued spectrum.

16k Denoising +2

Paper
Add Code

DYLE: Dynamic Latent Extraction for Abstractive Long-Input Summarization

1 code implementation • ACL 2022 • Ziming Mao, Chen Henry Wu, Ansong Ni, Yusen Zhang, Rui Zhang, Tao Yu, Budhaditya Deb, Chenguang Zhu, Ahmed H. Awadallah, Dragomir Radev

Transformer-based models have achieved state-of-the-art performance on short-input summarization.

Abstractive Text Summarization

Paper
Code

Prefix-to-SQL: Text-to-SQL Generation from Incomplete User Questions

no code implementations • 15 Sep 2021 • Naihao Deng, Shuaichen Chang, Peng Shi, Tao Yu, Rui Zhang

Existing text-to-SQL research only considers complete questions as the input, but lay-users might strive to formulate a complete question.

Text-To-SQL

Paper
Add Code

An Exploratory Study on Long Dialogue Summarization: What Works and What's Next

1 code implementation • 10 Sep 2021 • Yusen Zhang, Ansong Ni, Tao Yu, Rui Zhang, Chenguang Zhu, Budhaditya Deb, Asli Celikyilmaz, Ahmed Hassan Awadallah, Dragomir Radev

Dialogue summarization helps readers capture salient information from long conversations in meetings, interviews, and TV series.

Retrieval

Paper
Code

SummerTime: Text Summarization Toolkit for Non-experts

1 code implementation • EMNLP (ACL) 2021 • Ansong Ni, Zhangir Azerbayev, Mutethia Mutuma, Troy Feng, Yusen Zhang, Tao Yu, Ahmed Hassan Awadallah, Dragomir Radev

We also provide explanations for models and evaluation metrics to help users understand the model behaviors and select models that best suit their needs.

Document Summarization Multi-Document Summarization

260

Paper
Code

Lightweight Multi-person Total Motion Capture Using Sparse Multi-view Cameras

no code implementations • ICCV 2021 • Yuxiang Zhang, Zhe Li, Liang An, Mengcheng Li, Tao Yu, Yebin Liu

Overall, we propose the first light-weight total capture system and achieves fast, robust and accurate multi-person total motion capture performance.

Ranked #2 on 3D Multi-Person Pose Estimation on Shelf

3D Multi-Person Pose Estimation

Paper
Add Code

Logic-Consistency Text Generation from Semantic Parses

1 code implementation • Findings (ACL) 2021 • Chang Shu, Yusen Zhang, Xiangyu Dong, Peng Shi, Tao Yu, Rui Zhang

Text generation from semantic parses is to generate textual descriptions for formal representation inputs such as logic forms and SQL queries.

Text Generation

Paper
Code

End-to-End Cross-Domain Text-to-SQL Semantic Parsing with Auxiliary Task

no code implementations • 17 Jun 2021 • Peng Shi, Tao Yu, Patrick Ng, Zhiguo Wang

Furthermore, we propose two value filling methods to build the bridge from the existing zero-shot semantic parsers to real-world applications, considering most of the existing parsers ignore the values filling in the synthesized SQL.

Semantic Parsing Text-To-SQL

Paper
Add Code

PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning

2 code implementations • NeurIPS 2021 • Tao Yu, Cuiling Lan, Wenjun Zeng, Mingxiao Feng, Zhizheng Zhang, Zhibo Chen

In this work, we propose a novel method, dubbed PlayVirtual, which augments cycle-consistent virtual trajectories to enhance the data efficiency for RL feature representation learning.

Ranked #1 on Continuous Control (100k environment steps) on DeepMind Finger Spin (Images)

Continuous Control (100k environment steps) Continuous Control (500k environment steps) +3

Paper
Code

DoubleField: Bridging the Neural Surface and Radiance Fields for High-fidelity Human Reconstruction and Rendering

no code implementations • CVPR 2022 • Ruizhi Shao, Hongwen Zhang, He Zhang, Mingjia Chen, YanPei Cao, Tao Yu, Yebin Liu

We introduce DoubleField, a novel framework combining the merits of both surface field and radiance field for high-fidelity human reconstruction and rendering.

Transfer Learning

Paper
Add Code

Function4D: Real-time Human Volumetric Capture from Very Sparse Consumer RGBD Sensors

no code implementations • CVPR 2021 • Tao Yu, Zerong Zheng, Kaiwen Guo, Pengpeng Liu, Qionghai Dai, Yebin Liu

Human volumetric capture is a long-standing topic in computer vision and computer graphics.

Paper
Add Code

DeepMultiCap: Performance Capture of Multiple Characters Using Sparse Multiview Cameras

no code implementations • ICCV 2021 • Yang Zheng, Ruizhi Shao, Yuxiang Zhang, Tao Yu, Zerong Zheng, Qionghai Dai, Yebin Liu

We propose DeepMultiCap, a novel method for multi-person performance capture using sparse multi-view cameras.

Paper
Add Code

QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization

1 code implementation • NAACL 2021 • Ming Zhong, Da Yin, Tao Yu, Ahmad Zaidi, Mutethia Mutuma, Rahul Jha, Ahmed Hassan Awadallah, Asli Celikyilmaz, Yang Liu, Xipeng Qiu, Dragomir Radev

As increasing numbers of meetings are recorded and transcribed, meeting summaries have become essential to remind those who may or may not have attended the meetings about the key decisions made and the tasks to be completed.

Meeting Summarization

100

Paper
Code

INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing

1 code implementation • 2 Apr 2021 • Wei Rao, Yihui Fu, Yanxin Hu, Xin Xu, Yvkai Jv, Jiangyu Han, Zhongjie Jiang, Lei Xie, Yannan Wang, Shinji Watanabe, Zheng-Hua Tan, Hui Bu, Tao Yu, Shidong Shang

The ConferencingSpeech 2021 challenge is proposed to stimulate research on far-field multi-channel speech enhancement for video conferencing.

Speech Enhancement Task 2

Paper
Code

POSEFusion: Pose-guided Selective Fusion for Single-view Human Volumetric Capture

no code implementations • CVPR 2021 • Zhe Li, Tao Yu, Zerong Zheng, Kaiwen Guo, Yebin Liu

By contributing a novel reconstruction framework which contains pose-guided keyframe selection and robust implicit surface fusion, our method fully utilizes the advantages of both tracking-based methods and tracking-free inference methods, and finally enables the high-fidelity reconstruction of dynamic surface details even in the invisible regions.

3D Reconstruction

Paper
Add Code

Local Patch AutoAugment with Multi-Agent Collaboration

2 code implementations • 20 Mar 2021 • Shiqi Lin, Tao Yu, Ruoyu Feng, Xin Li, Xin Jin, Zhibo Chen

We formulate it as a multi-agent reinforcement learning (MARL) problem, where each agent learns an augmentation policy for each patch based on its content together with the semantics of the whole image.

Data Augmentation Fine-Grained Image Recognition +2

9,344

Paper
Code

Nematicity Arising from a Chiral Superconducting Ground State in Magic-Angle Twisted Bilayer Graphene under In-Plane Magnetic Fields

no code implementations • 5 Jan 2021 • Tao Yu, Dante M. Kennes, Angel Rubio, Michael A. Sentef

Recent measurements of the resistivity in magic-angle twisted bilayer graphene near the superconducting transition temperature show two-fold anisotropy, or nematicity, when changing the direction of an in-plane magnetic field [Cao \textit{et al.}, Science \textbf{372}, 264 (2021)].

Superconductivity

Paper
Add Code

SCoRe: Pre-Training for Context Representation in Conversational Semantic Parsing

no code implementations • NeurIPS Workshop CAP 2020 • Tao Yu, Rui Zhang, Alex Polozov, Christopher Meek, Ahmed Hassan Awadallah

Conversational Semantic Parsing (CSP) is the task of converting a sequence of natural language queries to formal language (e. g., SQL, SPARQL) that can be executed against a structured ontology (e. g. databases, knowledge bases).

Ranked #3 on Multi-domain Dialogue State Tracking on MULTIWOZ 2.1 (using extra training data)

Dialogue State Tracking Language Modelling +4

Paper
Add Code

Learning Omni-frequency Region-adaptive Representations for Real Image Super-Resolution

no code implementations • 11 Dec 2020 • Xin Li, Xin Jin, Tao Yu, Yingxue Pang, Simeng Sun, Zhizheng Zhang, Zhibo Chen

Traditional single image super-resolution (SISR) methods that focus on solving single and uniform degradation (i. e., bicubic down-sampling), typically suffer from poor performance when applied into real-world low-resolution (LR) images due to the complicated realistic degradations.

Image Super-Resolution

Paper
Add Code

Spin-Wave Doppler Shift by Magnon Drag in Magnetic Insulators

no code implementations • 30 Nov 2020 • Tao Yu, Chen Wang, Michael A. Sentef, Gerrit E. W. Bauer

The Doppler shift of the quasiparticle dispersion by charge currents is responsible for the critical supercurrents in superconductors and instabilities of the magnetic ground state of metallic ferromagnets.

Mesoscale and Nanoscale Physics

Paper
Add Code

Deep Implicit Templates for 3D Shape Representation

1 code implementation • CVPR 2021 • Zerong Zheng, Tao Yu, Qionghai Dai, Yebin Liu

Deep implicit functions (DIFs), as a kind of 3D shape representation, are becoming more and more popular in the 3D vision community due to their compactness and strong representation power.

3D Shape Representation

157

Paper
Code

DeepCloth: Neural Garment Representation for Shape and Style Editing

no code implementations • 30 Nov 2020 • Zhaoqi Su, Tao Yu, Yangang Wang, Yebin Liu

In this work, we introduce, DeepCloth, a unified framework for garment representation, reconstruction, animation and editing.

Garment Reconstruction Position

Paper
Add Code

Vehicle Reconstruction and Texture Estimation Using Deep Implicit Semantic Template Mapping

no code implementations • 30 Nov 2020 • Xiaochen Zhao, Zerong Zheng, Chaonan Ji, Zhenyi Liu, Siyou Lin, Tao Yu, Jinli Suo, Yebin Liu

We introduce VERTEX, an effective solution to recover 3D shape and intrinsic texture of vehicles from uncalibrated monocular input in real-world street environments.

Paper
Add Code

Did You Ask a Good Question? A Cross-Domain Question Intention Classification Benchmark for Text-to-SQL

1 code implementation • 23 Oct 2020 • Yusen Zhang, Xiangyu Dong, Shuaichen Chang, Tao Yu, Peng Shi, Rui Zhang

Neural models have achieved significant results on the text-to-SQL task, in which most current work assumes all the input questions are legal and generates a SQL query for any input.

Text-To-SQL

Paper
Code

Online Conversation Disentanglement with Pointer Networks

no code implementations • EMNLP 2020 • Tao Yu, Shafiq Joty

We also introduce a joint-learning objective to better capture contextual information.

Conversation Disentanglement Disentanglement +1

Paper
Add Code

Semantic Evaluation for Text-to-SQL with Distilled Test Suites

3 code implementations • EMNLP 2020 • Ruiqi Zhong, Tao Yu, Dan Klein

We propose test suite accuracy to approximate semantic accuracy for Text-to-SQL models.

Text-To-SQL

701

Paper
Code

GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing

1 code implementation • ICLR 2021 • Tao Yu, Chien-Sheng Wu, Xi Victoria Lin, Bailin Wang, Yi Chern Tan, Xinyi Yang, Dragomir Radev, Richard Socher, Caiming Xiong

We present GraPPa, an effective pre-training approach for table semantic parsing that learns a compositional inductive bias in the joint representations of textual and tabular data.

Ranked #8 on Semantic Parsing on spider

Inductive Bias Language Modelling +3

Paper
Code

NormalGAN: Learning Detailed 3D Human from a Single RGB-D Image

1 code implementation • ECCV 2020 • Lizhen Wang, Xiaochen Zhao, Tao Yu, Songtao Wang, Yebin Liu

We propose NormalGAN, a fast adversarial learning-based method to reconstruct the complete and detailed 3D human from a single RGB-D image.

3D Human Reconstruction Denoising

Paper
Code

Learning Disentangled Feature Representation for Hybrid-distorted Image Restoration

no code implementations • ECCV 2020 • Xin Li, Xin Jin, Jianxin Lin, Tao Yu, Sen Liu, Yaojun Wu, Wei Zhou, Zhibo Chen

Hybrid-distorted image restoration (HD-IR) is dedicated to restore real distorted image that is degraded by multiple distortions.

Disentanglement Image Restoration

Paper
Add Code

PaMIR: Parametric Model-Conditioned Implicit Representation for Image-based Human Reconstruction

1 code implementation • 8 Jul 2020 • Zerong Zheng, Tao Yu, Yebin Liu, Qionghai Dai

To overcome the limitations of regular 3D representations, we propose Parametric Model-Conditioned Implicit Representation (PaMIR), which combines the parametric body model with the free-form deep implicit function.

Ranked #2 on 3D Human Reconstruction on CAPE

3D Human Reconstruction Camera Calibration

185

Paper
Code

DART: Open-Domain Structured Data Record to Text Generation

2 code implementations • NAACL 2021 • Linyong Nan, Dragomir Radev, Rui Zhang, Amrit Rau, Abhinand Sivaprasad, Chiachun Hsieh, Xiangru Tang, Aadit Vyas, Neha Verma, Pranav Krishna, Yangxiaokang Liu, Nadia Irwanto, Jessica Pan, Faiaz Rahman, Ahmad Zaidi, Mutethia Mutuma, Yasin Tarabar, Ankit Gupta, Tao Yu, Yi Chern Tan, Xi Victoria Lin, Caiming Xiong, Richard Socher, Nazneen Fatema Rajani

Data-to-Text annotations can be a costly process, especially when dealing with tables which are the major source of structured data and contain nontrivial structures.

Domain Generalization Semantic Parsing +2

142

Paper
Code

Semantic Evaluation for Text-to-SQL with Distilled Test Suite

no code implementations • 2 Jul 2020 • Ruiqi Zhong, Tao Yu, Dan Klein

We propose test suite accuracy to approximate semantic accuracy for Text-to-SQL models, where a predicted query is semantically correct if its denotation is the same as the gold for every possible database.

Semantic Parsing Text-To-SQL

Paper
Add Code

MulayCap: Multi-layer Human Performance Capture Using A Monocular Video Camera

no code implementations • 13 Apr 2020 • Zhaoqi Su, Weilin Wan, Tao Yu, Lingjie Liu, Lu Fang, Wenping Wang, Yebin Liu

We introduce MulayCap, a novel human performance capture method using a monocular video camera without the need for pre-scanning.

Paper
Add Code

Robust 3D Self-portraits in Seconds

no code implementations • CVPR 2020 • Zhe Li, Tao Yu, Chuanyu Pan, Zerong Zheng, Yebin Liu

In this paper, we propose an efficient method for robust 3D self-portraits using a single RGBD camera.

Paper
Add Code

Generative Adversarial Networks Based on Collaborative Learning and Attention Mechanism for Hyperspectral Image Classification

no code implementations • Remote Sensing 2020 • Jie Feng, Xueliang Feng, Jiantong Chen, Xianghai Cao, Xiangrong Zhang, Licheng Jiao, Tao Yu

To address this problem, a symmetric convolutional GAN based on collaborative learning and attention mechanism (CA-GAN) is proposed.

Ranked #7 on Hyperspectral Image Classification on Indian Pines

Few-Shot Image Classification Generative Adversarial Network +2

Paper
Add Code

4D Association Graph for Realtime Multi-person Motion Capture Using Multiple Video Cameras

1 code implementation • CVPR 2020 • Yuxiang Zhang, Liang An, Tao Yu, Xiu Li, Kun Li, Yebin Liu

Our method enables a realtime online motion capture system running at 30fps using 5 cameras on a 5-person scene.

Ranked #8 on 3D Multi-Person Pose Estimation on Shelf

3D Multi-Person Pose Estimation

178

Paper
Code

Salvaging Federated Learning by Local Adaptation

2 code implementations • 12 Feb 2020 • Tao Yu, Eugene Bagdasaryan, Vitaly Shmatikov

First, we show that on standard tasks such as next-word prediction, many participants gain no benefit from FL because the federated model is less accurate on their data than the models they can train locally on their own.

Federated Learning Knowledge Distillation +1

Paper
Code

Numerically Accurate Hyperbolic Embeddings Using Tiling-Based Models

2 code implementations • NeurIPS 2019 • Tao Yu, Christopher M. De Sa

Hyperbolic embeddings achieve excellent performance when embedding hierarchical data structures like synonym or type hierarchies, but they can be limited by numerical error when ordinary floating-point numbers are used to represent points in hyperbolic space.

Paper
Code

Region Normalization for Image Inpainting

1 code implementation • 23 Nov 2019 • Tao Yu, Zongyu Guo, Xin Jin, Shilin Wu, Zhibo Chen, Weiping Li, Zhizheng Zhang, Sen Liu

In this work, we show that the mean and variance shifts caused by full-spatial FN limit the image inpainting network training and we propose a spatial region-wise normalization named Region Normalization (RN) to overcome the limitation.

Image Inpainting

187

Paper
Code

A New Defense Against Adversarial Images: Turning a Weakness into a Strength

1 code implementation • NeurIPS 2019 • Tao Yu, Shengyuan Hu, Chuan Guo, Wei-Lun Chao, Kilian Q. Weinberger

Natural images are virtually surrounded by low-density misclassified regions that can be efficiently discovered by gradient-guided search --- enabling the generation of adversarial images.

Adversarial Defense

Paper
Code

CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases

3 code implementations • IJCNLP 2019 • Tao Yu, Rui Zhang, He Yang Er, Suyi Li, Eric Xue, Bo Pang, Xi Victoria Lin, Yi Chern Tan, Tianze Shi, Zihan Li, Youxuan Jiang, Michihiro Yasunaga, Sungrok Shim, Tao Chen, Alexander Fabbri, Zifan Li, Luyao Chen, Yuwen Zhang, Shreya Dixit, Vincent Zhang, Caiming Xiong, Richard Socher, Walter S. Lasecki, Dragomir Radev

We present CoSQL, a corpus for building cross-domain, general-purpose database (DB) querying dialogue systems.

Ranked #8 on Dialogue State Tracking on CoSQL

Dialogue State Tracking Response Generation +1

203

Paper
Code

Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions

3 code implementations • IJCNLP 2019 • Rui Zhang, Tao Yu, He Yang Er, Sungrok Shim, Eric Xue, Xi Victoria Lin, Tianze Shi, Caiming Xiong, Richard Socher, Dragomir Radev

We focus on the cross-domain context-dependent text-to-SQL generation task.

Ranked #5 on Text-To-SQL on SParC

Dialogue State Tracking Text-To-SQL

203

Paper
Code

Progressive Image Inpainting with Full-Resolution Residual Network

2 code implementations • 24 Jul 2019 • Zongyu Guo, Zhibo Chen, Tao Yu, Jiale Chen, Sen Liu

Recently, learning-based algorithms for image inpainting achieve remarkable progress dealing with squared or irregular holes.

Image Inpainting

Paper
Code

SParC: Cross-Domain Semantic Parsing in Context

4 code implementations • ACL 2019 • Tao Yu, Rui Zhang, Michihiro Yasunaga, Yi Chern Tan, Xi Victoria Lin, Suyi Li, Heyang Er, Irene Li, Bo Pang, Tao Chen, Emily Ji, Shreya Dixit, David Proctor, Sungrok Shim, Jonathan Kraft, Vincent Zhang, Caiming Xiong, Richard Socher, Dragomir Radev

The best model obtains an exact match accuracy of 20. 2% over all questions and less than10% over all interaction sequences, indicating that the cross-domain setting and the con-textual phenomena of the dataset present significant challenges for future research.

Semantic Parsing Text-To-SQL

203

Paper
Code

DC-SPP-YOLO: Dense Connection and Spatial Pyramid Pooling Based YOLO for Object Detection

no code implementations • 20 Mar 2019 • Zhanchao Huang, Jianlin Wang, Xuesong Fu, Tao Yu, Yongqi Guo, Rutong Wang

Therefore, a dense connection (DC) and spatial pyramid pooling (SPP) based YOLO (DC-SPP-YOLO) method for ameliorating the object detection accuracy of YOLOv2 is proposed in this paper.

Object object-detection +1

Paper
Add Code

DeepHuman: 3D Human Reconstruction from a Single Image

1 code implementation • ICCV 2019 • Zerong Zheng, Tao Yu, Yixuan Wei, Qionghai Dai, Yebin Liu

We propose DeepHuman, an image-guided volume-to-volume translation CNN for 3D human reconstruction from a single RGB image.

3D Human Reconstruction Pose Estimation +1

409

Paper
Code

SimulCap : Single-View Human Performance Capture with Cloth Simulation

no code implementations • CVPR 2019 • Tao Yu, Zerong Zheng, Yuan Zhong, Jianhui Zhao, Qionghai Dai, Gerard Pons-Moll, Yebin Liu

This paper proposes a new method for live free-viewpoint human performance capture with dynamic details (e. g., cloth wrinkles) using a single RGBD camera.

Paper
Add Code

Simplifying Graph Convolutional Networks

7 code implementations • 19 Feb 2019 • Felix Wu, Tianyi Zhang, Amauri Holanda de Souza Jr., Christopher Fifty, Tao Yu, Kilian Q. Weinberger

Graph Convolutional Networks (GCNs) and their variants have experienced significant attention and have become the de facto methods for learning graph representations.

Ranked #3 on Text Classification on Ohsumed

Graph Regression Image Classification +5

12,977

Paper
Code

SyntaxSQLNet: Syntax Tree Networks for Complex and Cross-DomainText-to-SQL Task

2 code implementations • 11 Oct 2018 • Tao Yu, Michihiro Yasunaga, Kai Yang, Rui Zhang, Dongxu Wang, Zifan Li, Dragomir Radev

In this paper we propose SyntaxSQLNet, a syntax tree network to address the complex and cross-domain text-to-SQL generation task.

Ranked #7 on Text-To-SQL on SParC

Semantic Parsing Text-To-SQL

131

Paper
Code

SyntaxSQLNet: Syntax Tree Networks for Complex and Cross-Domain Text-to-SQL Task

no code implementations • EMNLP 2018 • Tao Yu, Michihiro Yasunaga, Kai Yang, Rui Zhang, Dongxu Wang, Zifan Li, Dragomir Radev

In this paper we propose SyntaxSQLNet, a syntax tree network to address the complex and cross-domain text-to-SQL generation task.

Semantic Parsing Text-To-SQL

Paper
Add Code

Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task

5 code implementations • EMNLP 2018 • Tao Yu, Rui Zhang, Kai Yang, Michihiro Yasunaga, Dongxu Wang, Zifan Li, James Ma, Irene Li, Qingning Yao, Shanelle Roman, Zilin Zhang, Dragomir Radev

We define a new complex and cross-domain semantic parsing and text-to-SQL task where different complex SQL queries and databases appear in train and test sets.

Ranked #10 on Semantic Parsing on spider

Semantic Parsing Text-To-SQL

701

Paper
Code

HybridFusion: Real-Time Performance Capture Using a Single Depth Sensor and Sparse IMUs

no code implementations • ECCV 2018 • Zerong Zheng, Tao Yu, Hao Li, Kaiwen Guo, Qionghai Dai, Lu Fang, Yebin Liu

We propose a light-weight and highly robust real-time human performance capture method based on a single depth camera and sparse inertial measurement units (IMUs).

Surface Reconstruction

Paper
Add Code

Knowledge-based Fully Convolutional Network and Its Application in Segmentation of Lung CT Images

no code implementations • 22 May 2018 • Tao Yu, Yu Qiao, Huan Long

A variety of deep neural networks have been applied in medical image segmentation and achieve good performance.

Image Segmentation Medical Image Segmentation +2

Paper
Add Code

TypeSQL: Knowledge-based Type-Aware Neural Text-to-SQL Generation

1 code implementation • NAACL 2018 • Tao Yu, Zifan Li, Zilin Zhang, Rui Zhang, Dragomir Radev

Interacting with relational databases through natural language helps users of any background easily query and analyze a vast amount of data.

Ranked #2 on Code Generation on WikiSQL

slot-filling Slot Filling +2

111

Paper
Code

DoubleFusion: Real-time Capture of Human Performances with Inner Body Shapes from a Single Depth Sensor

no code implementations • CVPR 2018 • Tao Yu, Zerong Zheng, Kaiwen Guo, Jianhui Zhao, Qionghai Dai, Hao Li, Gerard Pons-Moll, Yebin Liu

We further propose a joint motion tracking method based on the double layer representation to enable robust and fast motion tracking performance.

Dynamic Reconstruction

Paper
Add Code

Curvature-based Comparison of Two Neural Networks

no code implementations • 21 Jan 2018 • Tao Yu, Huan Long, John E. Hopcroft

In this paper we show the similarities and differences of two deep neural networks by comparing the manifolds composed of activation vectors in each fully connected layer of them.

Vocal Bursts Valence Prediction

Paper
Add Code

Preliminary theoretical troubleshooting in Variational Autoencoder

no code implementations • ICLR 2018 • Shiqi Liu, Qian Zhao, Xiangyong Cao, Deyu Meng, Zilu Ma, Tao Yu

This paper tries to preliminarily address VAE's intrinsic dimension, real factor, disentanglement and indicator issues theoretically in the idealistic situation and implementation issue practically through noise modeling perspective in the realistic case.

Disentanglement

Paper
Add Code

The Local Dimension of Deep Manifold

no code implementations • 5 Nov 2017 • Mengxiao Zhang, Wangquan Wu, Yanren Zhang, Kun He, Tao Yu, Huan Long, John E. Hopcroft

Our results show that the dimensions of different categories are close to each other and decline quickly along the convolutional layers and fully connected layers.

Paper
Add Code

BodyFusion: Real-Time Capture of Human Motion and Surface Geometry Using a Single Depth Camera

no code implementations • ICCV 2017 • Tao Yu, Kaiwen Guo, Feng Xu, Yuan Dong, Zhaoqi Su, Jianhui Zhao, Jianguo Li, Qionghai Dai, Yebin Liu

To reduce the ambiguities of the non-rigid deformation parameterization on the surface graph nodes, we take advantage of the internal articulated motion prior for human performance and contribute a skeleton-embedded surface fusion (SSF) method.

Surface Reconstruction

Paper
Add Code

Leveraging Sparse and Dense Feature Combinations for Sentiment Classification

no code implementations • 13 Aug 2017 • Tao Yu, Christopher Hidey, Owen Rambow, Kathleen McKeown

This model outperforms many deep learning models and achieves comparable results to other deep learning models with complex architectures on sentiment analysis datasets.

BIG-bench Machine Learning Classification +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.