no code implementations • 18 Aug 2024 • Ziyu Yao, Xuxin Cheng, Zhiqi Huang
Therefore, we propose a Facial Decoupled Diffusion model for Talking head generation called FD2Talk, which fully leverages the advantages of diffusion models and decouples the complex facial details through multi-stages.
1 code implementation • 2 Jul 2024 • Daking Rai, Yilun Zhou, Shi Feng, Abulhair Saparov, Ziyu Yao
Mechanistic interpretability (MI) is an emerging sub-field of interpretability that seeks to understand a neural network model by reverse-engineering its internal computations.
1 code implementation • 18 Jun 2024 • Daking Rai, Ziyu Yao
Our analyses revealed that the activation of reasoning neurons in the feed-forward layers of an LLM can explain the importance of various components in a CoT prompt, and future research can extend it for a more complete understanding.
1 code implementation • 17 Jun 2024 • Mohamed Aghzal, Erion Plaku, Ziyu Yao
Large Language Models (LLMs) have shown impressive capabilities across a wide variety of tasks.
no code implementations • 10 Apr 2024 • Murong Yue, Wijdane Mifdal, Yixuan Zhang, Jennifer Suh, Ziyu Yao
Mathematical modeling (MM) is considered a fundamental skill for students in STEM disciplines.
no code implementations • 6 Feb 2024 • Qineng Wang, Chen Qian, Xiaochang Li, Ziyu Yao, Huajie Shao
Network traffic refers to the amount of data being sent and received over the internet or any system that connects computers.
1 code implementation • 5 Oct 2023 • Mohamed Aghzal, Erion Plaku, Ziyu Yao
Large language models (LLMs) have achieved remarkable success across a wide spectrum of tasks; however, they still face limitations in scenarios that demand long-term planning and spatial reasoning.
1 code implementation • 4 Oct 2023 • Murong Yue, Jie Zhao, Min Zhang, Liang Du, Ziyu Yao
Large language models (LLMs) such as GPT-4 have exhibited remarkable performance in a variety of tasks, but this strong performance often comes with the high expense of using paid API services.
1 code implementation • 3 Oct 2023 • Saurabh Srivastava, Chengyue Huang, Weiguo Fan, Ziyu Yao
Large language models (LLMs) have revolutionized zero-shot task performance, mitigating the need for task-specific annotations while enhancing task generalizability.
1 code implementation • 8 Aug 2023 • Binfeng Xu, Xukun Liu, Hua Shen, Zeyu Han, Yuhan Li, Murong Yue, Zhiyuan Peng, Yuchen Liu, Ziyu Yao, Dongkuan Xu
We present gentopia, an ALM framework enabling flexible customization of agents through simple configurations, seamlessly integrating various language models, task formats, prompting modules, and plugins into a unified paradigm.
1 code implementation • 27 May 2023 • Daking Rai, Bailin Wang, Yilun Zhou, Ziyu Yao
Compositional and domain generalization present significant challenges in semantic parsing, even for state-of-the-art semantic parsers based on pre-trained language models (LMs).
Ranked #6 on Text-To-SQL on spider
1 code implementation • 22 May 2023 • Saurabh Srivastava, Gaurav Singh, Shou Matsumoto, Ali Raz, Paulo Costa, Joshua Poore, Ziyu Yao
In this work, we present the first dataset, MailEx, for performing event extraction from conversational email threads.
2 code implementations • 14 May 2023 • Hao Yan, Saurabh Srivastava, Yintao Tai, Sida I. Wang, Wen-tau Yih, Ziyu Yao
In this work, we propose a new task of simulating NL feedback for interactive semantic parsing.
1 code implementation • 15 Mar 2023 • Ziyu Yao, Xuxin Cheng, Yuexian Zou
Moreover, we introduce a pose-level method, PoseRAC, which is based on this representation and achieves state-of-the-art performance on two new version datasets by using Pose Saliency Annotation to annotate salient poses for training.
Ranked #2 on Repetitive Action Counting on RepCount (using extra training data)
no code implementations • 25 Jan 2023 • Daking Rai, Yilun Zhou, Bailin Wang, Ziyu Yao
While large language models (LLMs) have demonstrated strong capability in structured prediction tasks such as semantic parsing, few amounts of research have explored the underlying mechanisms of their success.
1 code implementation • ACL 2022 • Xiang Yue, Ziyu Yao, Huan Sun
Synthesizing QA pairs with a question generator (QG) on the target domain has become a popular approach for domain adaptation of question answering (QA) models.
1 code implementation • 16 Jan 2022 • Tianbao Xie, Chen Henry Wu, Peng Shi, Ruiqi Zhong, Torsten Scholak, Michihiro Yasunaga, Chien-Sheng Wu, Ming Zhong, Pengcheng Yin, Sida I. Wang, Victor Zhong, Bailin Wang, Chengzu Li, Connor Boyle, Ansong Ni, Ziyu Yao, Dragomir Radev, Caiming Xiong, Lingpeng Kong, Rui Zhang, Noah A. Smith, Luke Zettlemoyer, Tao Yu
Structured knowledge grounding (SKG) leverages structured knowledge to complete user requests, such as semantic parsing over databases and question answering over knowledge bases.
Ranked #1 on Task-Oriented Dialogue Systems on KVRET
no code implementations • 29 Sep 2021 • Peizhao Li, Xuchao Zhang, Ziyu Yao, Wei Cheng, Haifeng Chen, Hongfu Liu
To achieve this, we propose a machine learning approach to adapt the editorial style derived from few exemplars to a query code snippet.
1 code implementation • ICLR 2021 • Ziyu Yao, Frank F. Xu, Pengcheng Yin, Huan Sun, Graham Neubig
To show the unique benefits of modeling tree edits directly, we further propose a novel edit encoder for learning to represent edits, as well as an imitation learning method that allows the editor to be more robust.
2 code implementations • 30 Oct 2020 • Xiang Yue, Xinliang Frederick Zhang, Ziyu Yao, Simon Lin, Huan Sun
Clinical question answering (QA) aims to automatically answer questions from medical professionals based on clinical texts.
1 code implementation • EMNLP 2020 • Ziyu Yao, Yiqi Tang, Wen-tau Yih, Huan Sun, Yu Su
Despite the widely successful applications, bootstrapping and fine-tuning semantic parsers are still a tedious process with challenges such as costly data annotation and privacy risks.
2 code implementations • IJCNLP 2019 • Ziyu Yao, Yu Su, Huan Sun, Wen-tau Yih
As a promising paradigm, interactive semantic parsing has shown to improve both semantic parsing accuracy and user confidence in the results.
1 code implementation • ACL 2019 • Boyuan Pan, Hao Li, Ziyu Yao, Deng Cai, Huan Sun
This paper investigates a new task named Conversational Question Generation (CQG) which is to generate a question based on a passage and a conversation history (i. e., previous turns of question-answer pairs).
1 code implementation • 13 Mar 2019 • Ziyu Yao, Jayavardhan Reddy Peddamail, Huan Sun
In this work, we investigate a novel perspective of Code annotation for Code retrieval (hence called `CoaCor'), where a code annotation model is trained to generate a natural language annotation that can represent the semantic meaning of a given code snippet and can be leveraged by a code retrieval model to better distinguish relevant code snippets from others.
1 code implementation • 21 Aug 2018 • Ziyu Yao, Xiujun Li, Jianfeng Gao, Brian Sadler, Huan Sun
Given a text description, most existing semantic parsers synthesize a program in one shot.
Hierarchical Reinforcement Learning reinforcement-learning +2
1 code implementation • 26 Mar 2018 • Ziyu Yao, Daniel S. Weld, Wei-Peng Chen, Huan Sun
In this paper, we investigate a new problem of systematically mining question-code pairs from Stack Overflow (in contrast to heuristically collecting them).