Search Results for author: Jie Tang

Found 194 papers, 125 papers with code

P-Tuning: Prompt Tuning Can Be Comparable to Fine-tuning Across Scales and Tasks

no code implementations • ACL 2022 • Xiao Liu, Kaixuan Ji, Yicheng Fu, Weng Tam, Zhengxiao Du, Zhilin Yang, Jie Tang

Prompt tuning, which only tunes continuous prompts with a frozen language model, substantially reduces per-task storage and memory usage at training.

Language Modelling

Paper
Add Code

HOSMEL: A Hot-Swappable Modularized Entity Linking Toolkit for Chinese

1 code implementation • ACL 2022 • Daniel Zhang-li, Jing Zhang, Jifan Yu, Xiaokang Zhang, Peng Zhang, Jie Tang, Juanzi Li

We investigate the usage of entity linking (EL)in downstream tasks and present the first modularized EL toolkit for easy task adaptation.

Entity Linking Question Answering

Paper
Code

Intelligent Reflecting Surface-Enabled Anti-Detection for Secure Sensing and Communications

no code implementations • 12 Apr 2024 • Beixiong Zheng, Xue Xiong, Tiantian Ma, Jie Tang, Derrick Wing Kwan Ng, A. Lee Swindlehurst, Rui Zhang

The ever-increasing reliance on wireless communication and sensing has led to growing concerns over the vulnerability of sensitive information to unauthorized detection and interception.

Paper
Add Code

BOND: Bootstrapping From-Scratch Name Disambiguation with Multi-task Promoting

2 code implementations • 12 Apr 2024 • Yuqing Cheng, Bo Chen, Fanjin Zhang, Jie Tang

Therefore, we present BOND, which bootstraps the local and global informative signals to promote each other in an end-to-end regime.

Clustering

Paper
Code

Transferable and Efficient Non-Factual Content Detection via Probe Training with Offline Consistency Checking

2 code implementations • 10 Apr 2024 • Xiaokang Zhang, Zijun Yao, Jing Zhang, Kaifeng Yun, Jifan Yu, Juanzi Li, Jie Tang

Detecting non-factual content is a longstanding goal to increase the trustworthiness of large language models (LLMs) generations.

Question Answering

144

Paper
Code

AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent

1 code implementation • 4 Apr 2024 • Hanyu Lai, Xiao Liu, Iat Long Iong, Shuntian Yao, Yuxuan Chen, Pengbo Shen, Hao Yu, Hanchen Zhang, Xiaohan Zhang, Yuxiao Dong, Jie Tang

Large language models (LLMs) have fueled many intelligent agent tasks, such as web navigation -- but most existing agents perform far from satisfying in real-world webpages due to three factors: (1) the versatility of actions on webpages, (2) HTML text exceeding model processing capacity, and (3) the complexity of decision-making due to the open-domain nature of web.

Decision Making Language Modelling +1

287

Paper
Code

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

1 code implementation • 3 Apr 2024 • Yifan Xu, Xiao Liu, Xinghan Liu, Zhenyu Hou, Yueyan Li, Xiaohan Zhang, Zihan Wang, Aohan Zeng, Zhengxiao Du, Wenyi Zhao, Jie Tang, Yuxiao Dong

Large language models (LLMs) have shown excellent mastering of human language, but still struggle in real-world applications that require mathematical problem-solving.

Math

Paper
Code

ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback

no code implementations • 1 Apr 2024 • Zhenyu Hou, Yilin Niu, Zhengxiao Du, Xiaohan Zhang, Xiao Liu, Aohan Zeng, Qinkai Zheng, Minlie Huang, Hongning Wang, Jie Tang, Yuxiao Dong

The work presents our practices of aligning LLMs with human preferences, offering insights into the challenges and solutions in RLHF implementations.

Paper
Add Code

Extensive Self-Contrast Enables Feedback-Free Language Model Alignment

2 code implementations • 31 Mar 2024 • Xiao Liu, Xixuan Song, Yuxiao Dong, Jie Tang

In this work, we introduce Self-Contrast, a feedback-free large language model alignment method via exploiting extensive self-generated negatives.

Language Modelling Large Language Model +1

19,045

Paper
Code

TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios

1 code implementation • 28 Mar 2024 • Xiaokang Zhang, Jing Zhang, Zeyao Ma, Yang Li, Bohan Zhang, Guanlin Li, Zijun Yao, Kangli Xu, Jinchang Zhou, Daniel Zhang-li, Jifan Yu, Shu Zhao, Juanzi Li, Jie Tang

We introduce TableLLM, a robust large language model (LLM) with 13 billion parameters, purpose-built for proficiently handling tabular data manipulation tasks, whether they are embedded within documents or spreadsheets, catering to real-world office scenarios.

Language Modelling Large Language Model

Paper
Code

Understanding Emergent Abilities of Language Models from the Loss Perspective

no code implementations • 23 Mar 2024 • Zhengxiao Du, Aohan Zeng, Yuxiao Dong, Jie Tang

Recent studies have put into question the belief that emergent abilities in language models are exclusive to large models.

Paper
Add Code

A New Intelligent Reflecting Surface-Aided Electromagnetic Stealth Strategy

no code implementations • 19 Mar 2024 • Xue Xiong, Beixiong Zheng, A. Lee Swindlehurst, Jie Tang, Wen Wu

Electromagnetic wave absorbing material (EWAM) plays an essential role in manufacturing stealth aircraft, which can achieve the electromagnetic stealth (ES) by reducing the strength of the signal reflected back to the radar system.

Paper
Add Code

Bilateral Propagation Network for Depth Completion

1 code implementation • 17 Mar 2024 • Jie Tang, Fei-Peng Tian, Boshi An, Jian Li, Ping Tan

Depth completion aims to derive a dense depth map from sparse depth measurements with a synchronized color image.

Depth Completion

Paper
Code

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion

no code implementations • 8 Mar 2024 • Wendi Zheng, Jiayan Teng, Zhuoyi Yang, Weihan Wang, Jidong Chen, Xiaotao Gu, Yuxiao Dong, Ming Ding, Jie Tang

Recent advancements in text-to-image generative systems have been largely driven by diffusion models.

Computational Efficiency Super-Resolution +1

Paper
Add Code

Does Negative Sampling Matter? A Review with Insights into its Theory and Applications

no code implementations • 27 Feb 2024 • Zhen Yang, Ming Ding, Tinglin Huang, Yukuo Cen, Junshuai Song, Bin Xu, Yuxiao Dong, Jie Tang

Is there a general framework that can incorporate all existing negative sampling methods?

Recommendation Systems

Paper
Add Code

PST-Bench: Tracing and Benchmarking the Source of Publications

1 code implementation • 25 Feb 2024 • Fanjin Zhang, Kun Cao, Yukuo Cen, Jifan Yu, Da Yin, Jie Tang

Tracing the source of research papers is a fundamental yet challenging task for researchers.

Benchmarking

Paper
Code

OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining

no code implementations • 24 Feb 2024 • Fanjin Zhang, Shijie Shi, Yifan Zhu, Bo Chen, Yukuo Cen, Jifan Yu, Yelin Chen, Lulu Wang, Qingfei Zhao, Yuqing Cheng, Tianyi Han, Yuwei An, Dan Zhang, Weng Lam Tam, Kun Cao, Yunhe Pang, Xinyu Guan, Huihui Yuan, Jian Song, Xiaoyan Li, Yuxiao Dong, Jie Tang

We envisage that OAG-Bench can serve as a common ground for the community to evaluate and compare algorithms in academic graph mining, thereby accelerating algorithm development and advancement in this field.

Graph Mining

Paper
Add Code

Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments

no code implementations • 22 Feb 2024 • Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su

The applications of large language models (LLMs) have expanded well beyond the confines of text processing, signaling a new era where LLMs are envisioned as generalist language agents capable of operating within complex real-world environments.

Paper
Add Code

TriSampler: A Better Negative Sampling Principle for Dense Retrieval

no code implementations • 19 Feb 2024 • Zhen Yang, Zhou Shao, Yuxiao Dong, Jie Tang

Negative sampling stands as a pivotal technique in dense retrieval, essential for training effective retrieval models and significantly impacting retrieval performance.

Retrieval

Paper
Add Code

CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations

1 code implementation • 6 Feb 2024 • Ji Qi, Ming Ding, Weihan Wang, Yushi Bai, Qingsong Lv, Wenyi Hong, Bin Xu, Lei Hou, Juanzi Li, Yuxiao Dong, Jie Tang

Vision-Language Models (VLMs) have demonstrated their widespread viability thanks to extensive training in aligning visual instructions to answers.

Visual Reasoning

120

Paper
Code

Towards Efficient and Exact Optimization of Language Model Alignment

1 code implementation • 1 Feb 2024 • Haozhe Ji, Cheng Lu, Yilin Niu, Pei Ke, Hongning Wang, Jun Zhu, Jie Tang, Minlie Huang

We prove that EXO is guaranteed to optimize in the same direction as the RL algorithms asymptotically for arbitary parametrization of the policy, while enables efficient optimization by circumventing the complexities associated with RL algorithms.

Language Modelling Reinforcement Learning (RL)

Paper
Code

LongAlign: A Recipe for Long Context Alignment of Large Language Models

1 code implementation • 31 Jan 2024 • Yushi Bai, Xin Lv, Jiajie Zhang, Yuze He, Ji Qi, Lei Hou, Jie Tang, Yuxiao Dong, Juanzi Li

Extending large language models to effectively handle long contexts requires instruction fine-tuning on input sequences of similar length.

Instruction Following

101

Paper
Code

RecDCL: Dual Contrastive Learning for Recommendation

1 code implementation • 28 Jan 2024 • Dan Zhang, Yangliao Geng, Wenwen Gong, Zhongang Qi, Zhiyu Chen, Xing Tang, Ying Shan, Yuxiao Dong, Jie Tang

In this work, we investigate how to employ both batch-wise CL (BCL) and feature-wise CL (FCL) for recommendation.

Collaborative Filtering Contrastive Learning +2

Paper
Code

Sketch and Refine: Towards Fast and Accurate Lane Detection

1 code implementation • 26 Jan 2024 • Chao Chen, Jie Liu, Chang Zhou, Jie Tang, Gangshan Wu

At the "Sketch" stage, local directions of keypoints can be easily estimated by fast convolutional layers.

Lane Detection

Paper
Code

SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning

1 code implementation • 15 Jan 2024 • Dan Zhang, Ziniu Hu, Sining Zhoubian, Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, Jie Tang

To bridge these gaps, we introduce SciGLM, a suite of scientific language models able to conduct college-level scientific reasoning.

Math Mathematical Reasoning

Paper
Code

APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding

no code implementations • 12 Jan 2024 • Mingdao Liu, Aohan Zeng, Bowen Wang, Peng Zhang, Jie Tang, Yuxiao Dong

The massive adoption of large language models (LLMs) demands efficient deployment strategies.

Paper
Add Code

xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein

no code implementations • 11 Jan 2024 • Bo Chen, Xingyi Cheng, Pan Li, Yangli-ao Geng, Jing Gong, Shen Li, Zhilei Bei, Xu Tan, Boyan Wang, Xin Zeng, Chiming Liu, Aohan Zeng, Yuxiao Dong, Jie Tang, Le Song

We propose a unified protein language model, xTrimoPGLM, to address these two types of tasks simultaneously through an innovative pre-training framework.

Protein Language Model

Paper
Add Code

CogCartoon: Towards Practical Story Visualization

no code implementations • 17 Dec 2023 • Zhongyang Zhu, Jie Tang

The state-of-the-art methods for story visualization demonstrate a significant demand for training data and storage, as well as limited flexibility in story presentation, thereby rendering them impractical for real-world applications.

Story Visualization

Paper
Add Code

CogAgent: A Visual Language Model for GUI Agents

1 code implementation • 14 Dec 2023 • Wenyi Hong, Weihan Wang, Qingsong Lv, Jiazheng Xu, Wenmeng Yu, Junhui Ji, Yan Wang, Zihan Wang, Yuxuan Zhang, Juanzi Li, Bin Xu, Yuxiao Dong, Ming Ding, Jie Tang

People are spending an enormous amount of time on digital devices through graphical user interfaces (GUIs), e. g., computer or smartphone screens.

Ranked #14 on Visual Question Answering on MM-Vet

Language Modelling Visual Question Answering

4,989

Paper
Code

Intelligent Reflecting Surface-Aided Electromagnetic Stealth Against Radar Detection

no code implementations • 4 Dec 2023 • Beixiong Zheng, Xue Xiong, Jie Tang, Rui Zhang

While traditional electromagnetic stealth materials/metasurfaces can render a target virtually invisible to some extent, they lack flexibility and adaptability, and can only operate within a limited frequency and angle/direction range, making it challenging to ensure the expected stealth performance.

Paper
Add Code

AlignBench: Benchmarking Chinese Alignment of Large Language Models

1 code implementation • 30 Nov 2023 • Xiao Liu, Xuanyu Lei, Shengyuan Wang, Yue Huang, Zhuoer Feng, Bosi Wen, Jiale Cheng, Pei Ke, Yifan Xu, Weng Lam Tam, Xiaohan Zhang, Lichao Sun, Hongning Wang, Jing Zhang, Minlie Huang, Yuxiao Dong, Jie Tang

We will provide public APIs for evaluating AlignBench with CritiqueLLM to facilitate the evaluation of LLMs' Chinese alignment.

Benchmarking

190

Paper
Code

CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation

2 code implementations • 30 Nov 2023 • Pei Ke, Bosi Wen, Zhuoer Feng, Xiao Liu, Xuanyu Lei, Jiale Cheng, Shengyuan Wang, Aohan Zeng, Yuxiao Dong, Hongning Wang, Jie Tang, Minlie Huang

Since the natural language processing (NLP) community started to make large language models (LLMs), such as GPT-4, act as a critic to evaluate the quality of generated texts, most of them only train a critique generation model of a specific scale on specific datasets.

Language Modelling Large Language Model

190

Paper
Code

CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models

1 code implementation • 28 Nov 2023 • Jinfeng Zhou, Zhuang Chen, Dazhen Wan, Bosi Wen, Yi Song, Jifan Yu, Yongkang Huang, Libiao Peng, Jiaming Yang, Xiyao Xiao, Sahand Sabour, Xiaohan Zhang, Wenjing Hou, Yijia Zhang, Yuxiao Dong, Jie Tang, Minlie Huang

In this paper, we present CharacterGLM, a series of models built upon ChatGLM, with model sizes ranging from 6B to 66B parameters.

Dialogue Generation

310

Paper
Code

Black-Box Prompt Optimization: Aligning Large Language Models without Model Training

1 code implementation • 7 Nov 2023 • Jiale Cheng, Xiao Liu, Kehan Zheng, Pei Ke, Hongning Wang, Yuxiao Dong, Jie Tang, Minlie Huang

However, these models are often not well aligned with human intents, which calls for additional treatments on them, that is, the alignment problem.

248

Paper
Code

CogVLM: Visual Expert for Pretrained Language Models

1 code implementation • 6 Nov 2023 • Weihan Wang, Qingsong Lv, Wenmeng Yu, Wenyi Hong, Ji Qi, Yan Wang, Junhui Ji, Zhuoyi Yang, Lei Zhao, Xixuan Song, Jiazheng Xu, Bin Xu, Juanzi Li, Yuxiao Dong, Ming Ding, Jie Tang

We introduce CogVLM, a powerful open-source visual language foundation model.

Ranked #4 on Visual Question Answering (VQA) on InfiMM-Eval

Language Modelling Visual Question Answering

4,989

Paper
Code

CVPR 2023 Text Guided Video Editing Competition

1 code implementation • 24 Oct 2023 • Jay Zhangjie Wu, Xiuyu Li, Difei Gao, Zhen Dong, Jinbin Bai, Aishani Singh, Xiaoyu Xiang, Youzeng Li, Zuwei Huang, Yuanxi Sun, Rui He, Feng Hu, Junhua Hu, Hai Huang, Hanyu Zhu, Xu Cheng, Jie Tang, Mike Zheng Shou, Kurt Keutzer, Forrest Iandola

In this paper we present a retrospective on the competition and describe the winning method.

Video Editing Video Generation

Paper
Code

AgentTuning: Enabling Generalized Agent Abilities for LLMs

1 code implementation • 19 Oct 2023 • Aohan Zeng, Mingdao Liu, Rui Lu, Bowen Wang, Xiao Liu, Yuxiao Dong, Jie Tang

Though many prompting methods have been proposed to complete particular agent tasks, there is lack of research focusing on improving the agent capabilities of LLMs themselves without compromising their general abilities.

Memorization

1,224

Paper
Code

Synthetic IMU Datasets and Protocols Can Simplify Fall Detection Experiments and Optimize Sensor Configuration

no code implementations • 16 Oct 2023 • Jie Tang, Bin He, Junkai Xu, Tian Tan, Zhipeng Wang, Yanmin Zhou, Shuo Jiang

The proposed method simplifies fall detection data acquisition experiments, provides novel venue for generating low cost synthetic data in scenario where acquiring data for machine learning is challenging and paves the way for customizing machine learning configurations.

Paper
Add Code

MBIR Training for a 2.5D DL network in X-ray CT

no code implementations • 23 Sep 2023 • Obaidullah Rahman, Madhuri Nagare, Ken D. Sauer, Charles A. Bouman, Roman Melnyk, Brian Nett, Jie Tang

The cost we have to pay is that MBIR is computationally expensive.

Paper
Add Code

Design of Novel Loss Functions for Deep Learning in X-ray CT

no code implementations • 23 Sep 2023 • Obaidullah Rahman, Ken D. Sauer, Madhuri Nagare, Charles A. Bouman, Roman Melnyk, Jie Tang, Brian Nett

Particularly in a field such as X-ray CT, where radiologists' subjective preferences in image characteristics are key to acceptance, it may be desirable to penalize differences in DL more creatively.

Computed Tomography (CT)

Paper
Add Code

SafetyBench: Evaluating the Safety of Large Language Models with Multiple Choice Questions

1 code implementation • 13 Sep 2023 • Zhexin Zhang, Leqi Lei, Lindong Wu, Rui Sun, Yongkang Huang, Chong Long, Xiao Liu, Xuanyu Lei, Jie Tang, Minlie Huang

Notably, SafetyBench also incorporates both Chinese and English data, facilitating the evaluation in both languages.

Multiple-choice

101

Paper
Code

GPT Can Solve Mathematical Problems Without a Calculator

1 code implementation • 6 Sep 2023 • Zhen Yang, Ming Ding, Qingsong Lv, Zhihuan Jiang, Zehai He, Yuyi Guo, Jinfeng Bai, Jie Tang

Previous studies have typically assumed that large language models are unable to accurately perform arithmetic operations, particularly multiplication of >8 digits, and operations involving decimals and fractions, without the use of calculator tools.

Language Modelling Math

308

Paper
Code

Relay Diffusion: Unifying diffusion process across resolutions for image synthesis

1 code implementation • 4 Sep 2023 • Jiayan Teng, Wendi Zheng, Ming Ding, Wenyi Hong, Jianqiao Wangni, Zhuoyi Yang, Jie Tang

Diffusion models achieved great success in image synthesis, but still face challenges in high-resolution generation.

Ranked #1 on Image Generation on CelebA-HQ 256x256

Image Generation

225

Paper
Code

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

1 code implementation • 28 Aug 2023 • Yushi Bai, Xin Lv, Jiajie Zhang, Hongchang Lyu, Jiankai Tang, Zhidian Huang, Zhengxiao Du, Xiao Liu, Aohan Zeng, Lei Hou, Yuxiao Dong, Jie Tang, Juanzi Li

In this paper, we introduce LongBench, the first bilingual, multi-task benchmark for long context understanding, enabling a more rigorous evaluation of long context understanding.

16k Code Completion +2

466

Paper
Code

Robust Object Modeling for Visual Tracking

1 code implementation • ICCV 2023 • Yidong Cai, Jie Liu, Jie Tang, Gangshan Wu

To enjoy the merits of both methods, we propose a robust object modeling framework for visual tracking (ROMTrack), which simultaneously models the inherent template and the hybrid template features.

Object Visual Tracking

Paper
Code

AgentBench: Evaluating LLMs as Agents

1 code implementation • 7 Aug 2023 • Xiao Liu, Hao Yu, Hanchen Zhang, Yifan Xu, Xuanyu Lei, Hanyu Lai, Yu Gu, Hangliang Ding, Kaiwen Men, Kejuan Yang, Shudan Zhang, Xiang Deng, Aohan Zeng, Zhengxiao Du, Chenhui Zhang, Sheng Shen, Tianjun Zhang, Yu Su, Huan Sun, Minlie Huang, Yuxiao Dong, Jie Tang

We present AgentBench, a multi-dimensional evolving benchmark that currently consists of 8 distinct environments to assess LLM-as-Agent's reasoning and decision-making abilities in a multi-turn open-ended generation setting.

Decision Making Instruction Following

1,831

Paper
Code

Lightweight Super-Resolution Head for Human Pose Estimation

1 code implementation • 31 Jul 2023 • Haonan Wang, Jie Liu, Jie Tang, Gangshan Wu

We first propose the SR head, which predicts heatmaps with a spatial resolution higher than the input feature maps (or even consistent with the input image) by super-resolution, to effectively reduce the quantization error and the dependence on further post-processing.

Pose Estimation Quantization +1

Paper
Code

MAE-GEBD:Winning the CVPR'2023 LOVEU-GEBD Challenge

1 code implementation • 27 Jun 2023 • Yuanxi Sun, Rui He, Youzeng Li, Zuwei Huang, Feng Hu, Xu Cheng, Jie Tang

The Generic Event Boundary Detection (GEBD) task aims to build a model for segmenting videos into segments by detecting general event boundaries applicable to various classes.

Boundary Detection Generic Event Boundary Detection +2

Paper
Code

KoLA: Carefully Benchmarking World Knowledge of Large Language Models

1 code implementation • 15 Jun 2023 • Jifan Yu, Xiaozhi Wang, Shangqing Tu, Shulin Cao, Daniel Zhang-li, Xin Lv, Hao Peng, Zijun Yao, Xiaohan Zhang, Hanming Li, Chunyang Li, Zheyuan Zhang, Yushi Bai, Yantao Liu, Amy Xin, Nianyi Lin, Kaifeng Yun, Linlu Gong, Jianhui Chen, Zhili Wu, Yunjia Qi, Weikai Li, Yong Guan, Kaisheng Zeng, Ji Qi, Hailong Jin, Jinxin Liu, Yu Gu, Yuan YAO, Ning Ding, Lei Hou, Zhiyuan Liu, Bin Xu, Jie Tang, Juanzi Li

The unprecedented performance of large language models (LLMs) necessitates improvements in evaluations.

Benchmarking Hallucination +1

Paper
Code

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences

2 code implementations • 13 Jun 2023 • Xiao Liu, Hanyu Lai, Hao Yu, Yifan Xu, Aohan Zeng, Zhengxiao Du, Peng Zhang, Yuxiao Dong, Jie Tang

We present WebGLM, a web-enhanced question-answering system based on the General Language Model (GLM).

Language Modelling Large Language Model +2

1,505

Paper
Code

GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model

1 code implementation • 11 Jun 2023 • Shicheng Tan, Weng Lam Tam, Yuanchun Wang, Wenwen Gong, Yang Yang, Hongyin Tang, Keqing He, Jiahao Liu, Jingang Wang, Shu Zhao, Peng Zhang, Jie Tang

Currently, the reduction in the parameter scale of large-scale pre-trained language models (PLMs) through knowledge distillation has greatly facilitated their widespread deployment on various devices.

General Knowledge Knowledge Distillation +1

Paper
Code

Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method

1 code implementation • 11 Jun 2023 • Shicheng Tan, Weng Lam Tam, Yuanchun Wang, Wenwen Gong, Shu Zhao, Peng Zhang, Jie Tang

To address these problems, we propose a general language model distillation (GLMD) method that performs two-stage word prediction distillation and vocabulary compression, which is simple and surprisingly shows extremely strong performance.

Knowledge Distillation Language Modelling

Paper
Code

BatchSampler: Sampling Mini-Batches for Contrastive Learning in Vision, Language, and Graphs

1 code implementation • 6 Jun 2023 • Zhen Yang, Tinglin Huang, Ming Ding, Yuxiao Dong, Rex Ying, Yukuo Cen, Yangliao Geng, Jie Tang

To make each mini-batch have fewer false negatives, we design the proximity graph of randomly-selected instances.

Contrastive Learning STS

Paper
Code

Optimizing Airbnb Search Journey with Multi-task Learning

no code implementations • 28 May 2023 • Chun How Tan, Austin Chan, Malay Haldar, Jie Tang, Xin Liu, Mustafa Abdool, Huiji Gao, Liwei He, Sanjeev Katariya

The long and exploratory nature of the search journey, as well as the need to balance both guest and host preferences, present unique challenges for Airbnb search ranking.

Multi-Task Learning

Paper
Add Code

Revisiting Parallel Context Windows: A Frustratingly Simple Alternative and Chain-of-Thought Deterioration

no code implementations • 24 May 2023 • Kejuan Yang, Xiao Liu, Kaiwen Men, Aohan Zeng, Yuxiao Dong, Jie Tang

We identify two crucial limitations in the evaluation of recent parallel-integrated method Parallel Context Windows (PCW), which extends the maximum context lengths of language models, e. g., 2048 for LLaMA, by harnessing window-wise attention and positional embedding techniques.

Long-Context Understanding

Paper
Add Code

Video Frame Interpolation with Densely Queried Bilateral Correlation

1 code implementation • 26 Apr 2023 • Chang Zhou, Jie Liu, Jie Tang, Gangshan Wu

To better model correlations and to produce more accurate motion fields, we propose the Densely Queried Bilateral Correlation (DQBC) that gets rid of the receptive field dependency problem and thus is more friendly to small and fast-moving objects.

Ranked #1 on Video Frame Interpolation on MSU Video Frame Interpolation (VMAF metric)

Motion Estimation Video Frame Interpolation

Paper
Code

ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation

1 code implementation • NeurIPS 2023 • Jiazheng Xu, Xiao Liu, Yuchen Wu, Yuxuan Tong, Qinkai Li, Ming Ding, Jie Tang, Yuxiao Dong

We present a comprehensive solution to learn and improve text-to-image models from human preference feedback.

Text-to-Image Generation

938

Paper
Code

GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner

2 code implementations • 10 Apr 2023 • Zhenyu Hou, Yufei He, Yukuo Cen, Xiao Liu, Yuxiao Dong, Evgeny Kharlamov, Jie Tang

Graph self-supervised learning (SSL), including contrastive and generative approaches, offers great potential to address the fundamental challenge of label scarcity in real-world graph data.

Self-Supervised Learning

419

Paper
Code

MoocRadar: A Fine-grained and Multi-aspect Knowledge Repository for Improving Cognitive Student Modeling in MOOCs

1 code implementation • 5 Apr 2023 • Jifan Yu, Mengying Lu, Qingyang Zhong, Zijun Yao, Shangqing Tu, Zhengshan Liao, Xiaoya Li, Manli Li, Lei Hou, Hai-Tao Zheng, Juanzi Li, Jie Tang

Student modeling, the task of inferring a student's learning characteristics through their interactions with coursework, is a fundamental issue in intelligent education.

cognitive diagnosis Knowledge Tracing

Paper
Code

CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Evaluations on HumanEval-X

2 code implementations • 30 Mar 2023 • Qinkai Zheng, Xiao Xia, Xu Zou, Yuxiao Dong, Shan Wang, Yufei Xue, Zihan Wang, Lei Shen, Andi Wang, Yang Li, Teng Su, Zhilin Yang, Jie Tang

Large pre-trained code generation models, such as OpenAI Codex, can generate syntax- and function-correct code, making the coding of programmers more productive and our pursuit of artificial general intelligence closer.

Ranked #81 on Code Generation on MBPP

Code Generation

7,767

Paper
Code

GOAL: A Challenging Knowledge-grounded Video Captioning Benchmark for Real-time Soccer Commentary Generation

1 code implementation • 26 Mar 2023 • Ji Qi, Jifan Yu, Teng Tu, Kunyu Gao, Yifan Xu, Xinyu Guan, Xiaozhi Wang, Yuxiao Dong, Bin Xu, Lei Hou, Juanzi Li, Jie Tang, Weidong Guo, Hui Liu, Yu Xu

Despite the recent emergence of video captioning models, how to generate vivid, fine-grained video descriptions based on the background knowledge (i. e., long and informative commentary about the domain-specific scenes with appropriate reasoning) is still far from being solved, which however has great applications such as automatic sports narrative.

Video Captioning

Paper
Code

GPT-4 Technical Report

9 code implementations • Preprint 2023 • OpenAI, :, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko, Madelaine Boyd, Anna-Luisa Brakman, Greg Brockman, Tim Brooks, Miles Brundage, Kevin Button, Trevor Cai, Rosie Campbell, Andrew Cann, Brittany Carey, Chelsea Carlson, Rory Carmichael, Brooke Chan, Che Chang, Fotis Chantzis, Derek Chen, Sully Chen, Ruby Chen, Jason Chen, Mark Chen, Ben Chess, Chester Cho, Casey Chu, Hyung Won Chung, Dave Cummings, Jeremiah Currier, Yunxing Dai, Cory Decareaux, Thomas Degry, Noah Deutsch, Damien Deville, Arka Dhar, David Dohan, Steve Dowling, Sheila Dunning, Adrien Ecoffet, Atty Eleti, Tyna Eloundou, David Farhi, Liam Fedus, Niko Felix, Simón Posada Fishman, Juston Forte, Isabella Fulford, Leo Gao, Elie Georges, Christian Gibson, Vik Goel, Tarun Gogineni, Gabriel Goh, Rapha Gontijo-Lopes, Jonathan Gordon, Morgan Grafstein, Scott Gray, Ryan Greene, Joshua Gross, Shixiang Shane Gu, Yufei Guo, Chris Hallacy, Jesse Han, Jeff Harris, Yuchen He, Mike Heaton, Johannes Heidecke, Chris Hesse, Alan Hickey, Wade Hickey, Peter Hoeschele, Brandon Houghton, Kenny Hsu, Shengli Hu, Xin Hu, Joost Huizinga, Shantanu Jain, Shawn Jain, Joanne Jang, Angela Jiang, Roger Jiang, Haozhun Jin, Denny Jin, Shino Jomoto, Billie Jonn, Heewoo Jun, Tomer Kaftan, Łukasz Kaiser, Ali Kamali, Ingmar Kanitscheider, Nitish Shirish Keskar, Tabarak Khan, Logan Kilpatrick, Jong Wook Kim, Christina Kim, Yongjik Kim, Jan Hendrik Kirchner, Jamie Kiros, Matt Knight, Daniel Kokotajlo, Łukasz Kondraciuk, Andrew Kondrich, Aris Konstantinidis, Kyle Kosic, Gretchen Krueger, Vishal Kuo, Michael Lampe, Ikai Lan, Teddy Lee, Jan Leike, Jade Leung, Daniel Levy, Chak Ming Li, Rachel Lim, Molly Lin, Stephanie Lin, Mateusz Litwin, Theresa Lopez, Ryan Lowe, Patricia Lue, Anna Makanju, Kim Malfacini, Sam Manning, Todor Markov, Yaniv Markovski, Bianca Martin, Katie Mayer, Andrew Mayne, Bob McGrew, Scott Mayer McKinney, Christine McLeavey, Paul McMillan, Jake McNeil, David Medina, Aalok Mehta, Jacob Menick, Luke Metz, Andrey Mishchenko, Pamela Mishkin, Vinnie Monaco, Evan Morikawa, Daniel Mossing, Tong Mu, Mira Murati, Oleg Murk, David Mély, Ashvin Nair, Reiichiro Nakano, Rajeev Nayak, Arvind Neelakantan, Richard Ngo, Hyeonwoo Noh, Long Ouyang, Cullen O'Keefe, Jakub Pachocki, Alex Paino, Joe Palermo, Ashley Pantuliano, Giambattista Parascandolo, Joel Parish, Emy Parparita, Alex Passos, Mikhail Pavlov, Andrew Peng, Adam Perelman, Filipe de Avila Belbute Peres, Michael Petrov, Henrique Ponde de Oliveira Pinto, Michael, Pokorny, Michelle Pokrass, Vitchyr H. Pong, Tolly Powell, Alethea Power, Boris Power, Elizabeth Proehl, Raul Puri, Alec Radford, Jack Rae, Aditya Ramesh, Cameron Raymond, Francis Real, Kendra Rimbach, Carl Ross, Bob Rotsted, Henri Roussez, Nick Ryder, Mario Saltarelli, Ted Sanders, Shibani Santurkar, Girish Sastry, Heather Schmidt, David Schnurr, John Schulman, Daniel Selsam, Kyla Sheppard, Toki Sherbakov, Jessica Shieh, Sarah Shoker, Pranav Shyam, Szymon Sidor, Eric Sigler, Maddie Simens, Jordan Sitkin, Katarina Slama, Ian Sohl, Benjamin Sokolowsky, Yang song, Natalie Staudacher, Felipe Petroski Such, Natalie Summers, Ilya Sutskever, Jie Tang, Nikolas Tezak, Madeleine B. Thompson, Phil Tillet, Amin Tootoonchian, Elizabeth Tseng, Preston Tuggle, Nick Turley, Jerry Tworek, Juan Felipe Cerón Uribe, Andrea Vallone, Arun Vijayvergiya, Chelsea Voss, Carroll Wainwright, Justin Jay Wang, Alvin Wang, Ben Wang, Jonathan Ward, Jason Wei, CJ Weinmann, Akila Welihinda, Peter Welinder, Jiayi Weng, Lilian Weng, Matt Wiethoff, Dave Willner, Clemens Winter, Samuel Wolrich, Hannah Wong, Lauren Workman, Sherwin Wu, Jeff Wu, Michael Wu, Kai Xiao, Tao Xu, Sarah Yoo, Kevin Yu, Qiming Yuan, Wojciech Zaremba, Rowan Zellers, Chong Zhang, Marvin Zhang, Shengjia Zhao, Tianhao Zheng, Juntang Zhuang, William Zhuk, Barret Zoph

We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs.

Ranked #1 on Long-Context Understanding on Ada-LEval (BestAnswer)

Arithmetic Reasoning Bug fixing +10

13,888

Paper
Code

GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue Generation

1 code implementation • 28 Feb 2023 • Jing Zhang, Xiaokang Zhang, Daniel Zhang-li, Jifan Yu, Zijun Yao, Zeyao Ma, Yiqi Xu, Haohua Wang, Xiaohan Zhang, Nianyi Lin, Sunrui Lu, Juanzi Li, Jie Tang

We present GLM-Dialog, a large-scale language model (LLM) with 10B parameters capable of knowledge-grounded conversation in Chinese using a search engine to access the Internet knowledge.

Dialogue Evaluation Dialogue Generation +2

Paper
Code

Web-Scale Academic Name Disambiguation: the WhoIsWho Benchmark, Leaderboard, and Toolkit

1 code implementation • 23 Feb 2023 • Bo Chen, Jing Zhang, Fanjin Zhang, Tianyi Han, Yuqing Cheng, Xiaoyan Li, Yuxiao Dong, Jie Tang

The toolkit is at https://github. com/THUDM/WhoIsWho.

Data Integration

Paper
Code

Scaling laws for single-agent reinforcement learning

no code implementations • 31 Jan 2023 • Jacob Hilton, Jie Tang, John Schulman

Recent work has shown that, in generative modeling, cross-entropy loss improves smoothly with model size and training compute, following a power law plus constant scaling law.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

From Coarse to Fine: Hierarchical Pixel Integration for Lightweight Image Super-Resolution

1 code implementation • 30 Nov 2022 • Jie Liu, Chao Chen, Jie Tang, Gangshan Wu

In the fine area, we use an Intra-Patch Self-Attention (IPSA) module to model long-range pixel dependencies in a local patch, and then a $3\times3$ convolution is applied to process the finest details.

Image Super-Resolution

Paper
Code

AICOM-MP: an AI-based Monkeypox Detector for Resource-Constrained Environments

no code implementations • 21 Nov 2022 • Tim Tianyi Yang, Tom Tianze Yang, Andrew Liu, Jie Tang, Na An, Shaoshan Liu, Xue Liu

Also, through the AICOM-MP project, we have generalized a methodology of developing health AI technologies for AMCs to allow universal access even in resource-constrained environments.

Paper
Add Code

Parameter-Efficient Tuning Makes a Good Classification Head

1 code implementation • 30 Oct 2022 • Zhuoyi Yang, Ming Ding, Yanhui Guo, Qingsong Lv, Jie Tang

In this paper, we find that parameter-efficient tuning makes a good classification head, with which we can simply replace the randomly initialized heads for a stable performance gain.

Classification Natural Language Understanding

Paper
Code

GLM-130B: An Open Bilingual Pre-trained Model

10 code implementations • 5 Oct 2022 • Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, WenGuang Chen, Peng Zhang, Yuxiao Dong, Jie Tang

We introduce GLM-130B, a bilingual (English and Chinese) pre-trained language model with 130 billion parameters.

Ranked #1 on Language Modelling on CLUE (OCNLI_50K)

Language Modelling Long-Context Understanding +2

39,246

Paper
Code

Mask and Reason: Pre-Training Knowledge Graph Transformers for Complex Logical Queries

1 code implementation • 16 Aug 2022 • Xiao Liu, Shiyu Zhao, Kai Su, Yukuo Cen, Jiezhong Qiu, Mengdi Zhang, Wei Wu, Yuxiao Dong, Jie Tang

In this work, we present the Knowledge Graph Transformer (kgTransformer) with masked pre-training and fine-tuning strategies.

Paper
Code

Towards a General Pre-training Framework for Adaptive Learning in MOOCs

1 code implementation • 18 Jul 2022 • Qingyang Zhong, Jifan Yu, Zheyuan Zhang, Yiming Mao, Yuquan Wang, Yankai Lin, Lei Hou, Juanzi Li, Jie Tang

Adaptive learning aims to stimulate and meet the needs of individual learners, which requires sophisticated system-level coordination of diverse tasks, including modeling learning resources, estimating student states, and making personalized recommendations.

Knowledge Tracing

Paper
Code

Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text Retrievers

2 code implementations • 14 Jul 2022 • Weng Lam Tam, Xiao Liu, Kaixuan Ji, Lilong Xue, Xingjian Zhang, Yuxiao Dong, Jiahua Liu, Maodi Hu, Jie Tang

By updating only 0. 1% of the model parameters, the prompt tuning strategy can help retrieval models achieve better generalization performance than traditional methods in which all parameters are updated.

Retrieval Text Retrieval +1

1,885

Paper
Code

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

2 code implementations • 23 Jun 2022 • Bowen Baker, Ilge Akkaya, Peter Zhokhov, Joost Huizinga, Jie Tang, Adrien Ecoffet, Brandon Houghton, Raul Sampedro, Jeff Clune

Pretraining on noisy, internet-scale datasets has been heavily studied as a technique for training models with broad, general capabilities for text, images, and other modalities.

Imitation Learning reinforcement-learning +1

1,206

Paper
Code

GACT: Activation Compressed Training for Generic Network Architectures

1 code implementation • 22 Jun 2022 • Xiaoxuan Liu, Lianmin Zheng, Dequan Wang, Yukuo Cen, Weize Chen, Xu Han, Jianfei Chen, Zhiyuan Liu, Jie Tang, Joey Gonzalez, Michael Mahoney, Alvin Cheung

Training large neural network (NN) models requires extensive memory resources, and Activation Compressed Training (ACT) is a promising approach to reduce training memory footprint.

Paper
Code

Masked Autoencoders for Generic Event Boundary Detection CVPR'2022 Kinetics-GEBD Challenge

1 code implementation • 17 Jun 2022 • Rui He, Yuanxi Sun, Youzeng Li, Zuwei Huang, Feng Hu, Xu Cheng, Jie Tang

In this paper, we apply Masked Autoencoders to improve algorithm performance on the GEBD tasks.

Boundary Detection Generic Event Boundary Detection +1

Paper
Code

CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers

1 code implementation • 29 May 2022 • Wenyi Hong, Ming Ding, Wendi Zheng, Xinghan Liu, Jie Tang

Large-scale pretrained transformers have created milestones in text (GPT-3) and text-to-image (DALL-E and CogView) generation.

Ranked #12 on Video Generation on UCF-101

Text-to-Video Generation Video Generation

3,488

Paper
Code

Rethinking the Setting of Semi-supervised Learning on Graphs

1 code implementation • 28 May 2022 • Ziang Li, Ming Ding, Weikai Li, Zihan Wang, Ziyu Zeng, Yukuo Cen, Jie Tang

graph benchmark (IGB) consisting of 4 datasets.

Paper
Code

GraphMAE: Self-Supervised Masked Graph Autoencoders

3 code implementations • 22 May 2022 • Zhenyu Hou, Xiao Liu, Yukuo Cen, Yuxiao Dong, Hongxia Yang, Chunjie Wang, Jie Tang

Despite this, contrastive learning-which heavily relies on structural data augmentation and complicated training strategies-has been the dominant approach in graph SSL, while the progress of generative SSL on graphs, especially graph autoencoders (GAEs), has thus far not reached the potential as promised in other fields.

Ranked #1 on Node Classification on Cora: fixed 20 node per class

Contrastive Learning Graph Classification +4

419

Paper
Code

DeepStruct: Pretraining of Language Models for Structure Prediction

1 code implementation • Findings (ACL) 2022 • Chenguang Wang, Xiao Liu, Zui Chen, Haoyun Hong, Jie Tang, Dawn Song

We introduce a method for improving the structural understanding abilities of language models.

Ranked #1 on Open Information Extraction on Penn Treebank

coreference-resolution Dialogue State Tracking +11

Paper
Code

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

2 code implementations • 11 May 2022 • Yawei Li, Kai Zhang, Radu Timofte, Luc van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu1, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoğlu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gaoand Dengwen Zhouand Qian Ning, Jingzhu Tang, Han Huang, YuFei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Dengwen Zhou, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang

The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29. 00dB on DIV2K validation set.

Image Super-Resolution

117

Paper
Code

CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers

1 code implementation • 28 Apr 2022 • Ming Ding, Wendi Zheng, Wenyi Hong, Jie Tang

The development of the transformer-based text-to-image models are impeded by its slow generation and complexity for high-resolution images.

Ranked #44 on Text-to-Image Generation on MS COCO

Language Modelling Super-Resolution +1

929

Paper
Code

Fast and Memory-Efficient Network Towards Efficient Image Super-Resolution

1 code implementation • 18 Apr 2022 • Zongcai Du, Ding Liu, Jie Liu, Jie Tang, Gangshan Wu, Lean Fu

Besides, FMEN-S achieves the lowest memory consumption and the second shortest runtime in NTIRE 2022 challenge on efficient super-resolution.

Image Super-Resolution

Paper
Code

A Roadmap for Big Model

no code implementations • 26 Mar 2022 • Sha Yuan, Hanyu Zhao, Shuai Zhao, Jiahong Leng, Yangxiao Liang, Xiaozhi Wang, Jifan Yu, Xin Lv, Zhou Shao, Jiaao He, Yankai Lin, Xu Han, Zhenghao Liu, Ning Ding, Yongming Rao, Yizhao Gao, Liang Zhang, Ming Ding, Cong Fang, Yisen Wang, Mingsheng Long, Jing Zhang, Yinpeng Dong, Tianyu Pang, Peng Cui, Lingxiao Huang, Zheng Liang, HuaWei Shen, HUI ZHANG, Quanshi Zhang, Qingxiu Dong, Zhixing Tan, Mingxuan Wang, Shuo Wang, Long Zhou, Haoran Li, Junwei Bao, Yingwei Pan, Weinan Zhang, Zhou Yu, Rui Yan, Chence Shi, Minghao Xu, Zuobai Zhang, Guoqiang Wang, Xiang Pan, Mengjie Li, Xiaoyu Chu, Zijun Yao, Fangwei Zhu, Shulin Cao, Weicheng Xue, Zixuan Ma, Zhengyan Zhang, Shengding Hu, Yujia Qin, Chaojun Xiao, Zheni Zeng, Ganqu Cui, Weize Chen, Weilin Zhao, Yuan YAO, Peng Li, Wenzhao Zheng, Wenliang Zhao, Ziyi Wang, Borui Zhang, Nanyi Fei, Anwen Hu, Zenan Ling, Haoyang Li, Boxi Cao, Xianpei Han, Weidong Zhan, Baobao Chang, Hao Sun, Jiawen Deng, Chujie Zheng, Juanzi Li, Lei Hou, Xigang Cao, Jidong Zhai, Zhiyuan Liu, Maosong Sun, Jiwen Lu, Zhiwu Lu, Qin Jin, Ruihua Song, Ji-Rong Wen, Zhouchen Lin, LiWei Wang, Hang Su, Jun Zhu, Zhifang Sui, Jiajun Zhang, Yang Liu, Xiaodong He, Minlie Huang, Jian Tang, Jie Tang

With the rapid development of deep learning, training Big Models (BMs) for multiple downstream tasks becomes a popular paradigm.

Language Modelling Machine Translation +1

Paper
Add Code

WuDaoMM: A large-scale Multi-Modal Dataset for Pre-training models

no code implementations • 22 Mar 2022 • Sha Yuan, Shuai Zhao, Jiahong Leng, Zhao Xue, Hanyu Zhao, Peiyu Liu, Zheng Gong, Wayne Xin Zhao, Junyi Li, Jie Tang

The results show that WuDaoMM can be applied as an efficient dataset for VLPMs, especially for the model in text-to-image generation task.

Image Captioning Question Answering +2

Paper
Add Code

Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

1 code implementation • 14 Mar 2022 • Ning Ding, Yujia Qin, Guang Yang, Fuchao Wei, Zonghan Yang, Yusheng Su, Shengding Hu, Yulin Chen, Chi-Min Chan, Weize Chen, Jing Yi, Weilin Zhao, Xiaozhi Wang, Zhiyuan Liu, Hai-Tao Zheng, Jianfei Chen, Yang Liu, Jie Tang, Juanzi Li, Maosong Sun

This necessitates a new branch of research focusing on the parameter-efficient adaptation of PLMs, dubbed as delta tuning in this paper.

Text Classification

938

Paper
Code

GRAND+: Scalable Graph Random Neural Networks

1 code implementation • 12 Mar 2022 • Wenzheng Feng, Yuxiao Dong, Tinglin Huang, Ziqi Yin, Xu Cheng, Evgeny Kharlamov, Jie Tang

In this work, we present a scalable and high-performance GNN framework GRAND+ for semi-supervised graph learning.

Ranked #1 on Node Classification on MAG-scholar-C

Data Augmentation Graph Learning +2

Paper
Code

Reinforced MOOCs Concept Recommendation in Heterogeneous Information Networks

no code implementations • 8 Mar 2022 • Jibing Gong, Yao Wan, Ye Liu, Xuewen Li, Yi Zhao, Cheng Wang, YuTing Lin, Xiaohan Fang, Wenzheng Feng, Jingyi Zhang, Jie Tang

Despite the usefulness of this service, we consider that recommending courses to users directly may neglect their varying degrees of expertise.

Graph Attention reinforcement-learning +1

Paper
Add Code

SelfKG: Self-Supervised Entity Alignment in Knowledge Graphs

1 code implementation • 2 Mar 2022 • Xiao Liu, Haoyun Hong, Xinghao Wang, Zeyi Chen, Evgeny Kharlamov, Yuxiao Dong, Jie Tang

We present SelfKG with efficient strategies to optimize this objective for aligning entities without label supervision.

Entity Alignment Knowledge Graphs +1

Paper
Code

Subgraph Retrieval Enhanced Model for Multi-hop Knowledge Base Question Answering

1 code implementation • ACL 2022 • Jing Zhang, Xiaokang Zhang, Jifan Yu, Jian Tang, Jie Tang, Cuiping Li, Hong Chen

Recent works on knowledge base question answering (KBQA) retrieve subgraphs for easier reasoning.

Knowledge Base Question Answering Retrieval

Paper
Code

Training Free Graph Neural Networks for Graph Matching

1 code implementation • 14 Jan 2022 • Zhiyuan Liu, Yixin Cao, Fuli Feng, Xiang Wang, Jie Tang, Kenji Kawaguchi, Tat-Seng Chua

We present a framework of Training Free Graph Matching (TFGM) to boost the performance of Graph Neural Networks (GNNs) based graph matching, providing a fast promising solution without training (training-free).

Entity Alignment Graph Matching +1

Paper
Code

BodyGAN: General-Purpose Controllable Neural Human Body Generation

no code implementations • CVPR 2022 • Chaojie Yang, Hanhui Li, Shengjie Wu, Shengkai Zhang, Haonan Yan, Nianhong Jiao, Jie Tang, Runnan Zhou, Xiaodan Liang, Tianxiang Zheng

This is because current methods mainly rely on a single pose/appearance model, which is limited in disentangling various poses and appearance in human images.

Disentanglement Image Generation +1

Paper
Add Code

Are we really making much progress? Revisiting, benchmarking, and refining heterogeneous graph neural networks

1 code implementation • 30 Dec 2021 • Qingsong Lv, Ming Ding, Qiang Liu, Yuxiang Chen, Wenzheng Feng, Siming He, Chang Zhou, Jianguo Jiang, Yuxiao Dong, Jie Tang

Heterogeneous graph neural networks (HGNNs) have been blossoming in recent years, but the unique data processing and evaluation setups used by each work obstruct a full understanding of their advancements.

Benchmarking

285

Paper
Code

SCR: Training Graph Neural Networks with Consistency Regularization

4 code implementations • 8 Dec 2021 • Chenhui Zhang, Yufei He, Yukuo Cen, Zhenyu Hou, Wenzheng Feng, Yuxiao Dong, Xu Cheng, Hongyun Cai, Feng He, Jie Tang

However, it is unclear how to best design the generalization strategies in GNNs, as it works in a semi-supervised setting for graph data.

Ranked #3 on Node Property Prediction on ogbn-papers100M

Node Classification Node Property Prediction

Paper
Code

Adaptive Diffusion in Graph Neural Networks

no code implementations • NeurIPS 2021 • Jialin Zhao, Yuxiao Dong, Ming Ding, Evgeny Kharlamov, Jie Tang

Notably, message passing based GNNs, e. g., graph convolutional networks, leverage the immediate neighbors of each node during the aggregation process, and recently, graph diffusion convolution (GDC) is proposed to expand the propagation neighborhood by leveraging generalized graph diffusion.

Paper
Add Code

A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems

no code implementations • NeurIPS 2021 • Yi Ma, Xiaotian Hao, Jianye Hao, Jiawen Lu, Xing Liu, Tong Xialiang, Mingxuan Yuan, Zhigang Li, Jie Tang, Zhaopeng Meng

To address this problem, existing methods partition the overall DPDP into fixed-size sub-problems by caching online generated orders and solve each sub-problem, or on this basis to utilize the predicted future orders to optimize each sub-problem further.

Hierarchical Reinforcement Learning

Paper
Add Code

AdaDM: Enabling Normalization for Image Super-Resolution

1 code implementation • 27 Nov 2021 • Jie Liu, Jie Tang, Gangshan Wu

We found that the standard deviation of the residual feature shrinks a lot after normalization layers, which causes the performance degradation in SR networks.

Image Super-Resolution

Paper
Code

Network representation learning: A macro and micro view

no code implementations • 21 Nov 2021 • Xueyi Liu, Jie Tang

Representation learning can facilitate the design of new algorithms on the graph data.

Network Embedding

Paper
Add Code

Calculating Question Similarity is Enough: A New Method for KBQA Tasks

no code implementations • 15 Nov 2021 • Hanyu Zhao, Sha Yuan, Jiahong Leng, Xiang Pan, Guoqiang Wang, Ledell Wu, Jie Tang

Knowledge Base Question Answering (KBQA) aims to answer natural language questions with the help of an external knowledge base.

Entity Linking Knowledge Base Question Answering +3

Paper
Add Code

Graph Robustness Benchmark: Benchmarking the Adversarial Robustness of Graph Machine Learning

1 code implementation • 8 Nov 2021 • Qinkai Zheng, Xu Zou, Yuxiao Dong, Yukuo Cen, Da Yin, Jiarong Xu, Yang Yang, Jie Tang

To bridge this gap, we present the Graph Robustness Benchmark (GRB) with the goal of providing a scalable, unified, modular, and reproducible evaluation for the adversarial robustness of GML models.

Adversarial Robustness Benchmarking +1

Paper
Code

P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks

4 code implementations • 14 Oct 2021 • Xiao Liu, Kaixuan Ji, Yicheng Fu, Weng Lam Tam, Zhengxiao Du, Zhilin Yang, Jie Tang

Prompt tuning, which only tunes continuous prompts with a frozen language model, substantially reduces per-task storage and memory usage at training.

Language Modelling

2,451

Paper
Code

FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding

1 code implementation • ACL 2022 • Yanan Zheng, Jing Zhou, Yujie Qian, Ming Ding, Chonghua Liao, Jian Li, Ruslan Salakhutdinov, Jie Tang, Sebastian Ruder, Zhilin Yang

The few-shot natural language understanding (NLU) task has attracted much recent attention.

Benchmarking Natural Language Understanding

Paper
Code

Zero-Shot Information Extraction as a Unified Text-to-Triple Translation

1 code implementation • EMNLP 2021 • Chenguang Wang, Xiao Liu, Zui Chen, Haoyun Hong, Jie Tang, Dawn Song

We cast a suite of information extraction tasks into a text-to-triple translation framework.

Ranked #1 on Open Information Extraction on OIE2016 (using extra training data)

Factual probe Language Modelling +3

105

Paper
Code

Graph Contrastive Learning for Anomaly Detection

2 code implementations • 17 Aug 2021 • Bo Chen, Jing Zhang, Xiaokang Zhang, Yuxiao Dong, Jian Song, Peng Zhang, Kaibo Xu, Evgeny Kharlamov, Jie Tang

To achieve the contrastive objective, we design a graph neural network encoder that can infer and further remove suspicious links during message passing, as well as learn the global context of the input graph.

Anomaly Detection Binary Classification +2

Paper
Code

Modeling Protein Using Large-scale Pretrain Language Model

1 code implementation • 17 Aug 2021 • Yijia Xiao, Jiezhong Qiu, Ziang Li, Chang-Yu Hsieh, Jie Tang

The emergence of deep learning models makes modeling data patterns in large quantities of data possible.

Drug Discovery Language Modelling

103

Paper
Code

FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning

1 code implementation • ACL 2022 • Jing Zhou, Yanan Zheng, Jie Tang, Jian Li, Zhilin Yang

Most previous methods for text data augmentation are limited to simple tasks and weak baselines.

Data Augmentation Few-Shot Learning +1

Paper
Code

EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training

2 code implementations • 3 Aug 2021 • Hao Zhou, Pei Ke, Zheng Zhang, Yuxian Gu, Yinhe Zheng, Chujie Zheng, Yida Wang, Chen Henry Wu, Hao Sun, Xiaocong Yang, Bosi Wen, Xiaoyan Zhu, Minlie Huang, Jie Tang

Although pre-trained language models have remarkably enhanced the generation ability of dialogue systems, open-domain Chinese dialogue systems are still limited by the dialogue data and the model size compared with English ones.

565

Paper
Code

Evaluating Large Language Models Trained on Code

13 code implementations • 7 Jul 2021 • Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde de Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter, Philippe Tillet, Felipe Petroski Such, Dave Cummings, Matthias Plappert, Fotios Chantzis, Elizabeth Barnes, Ariel Herbert-Voss, William Hebgen Guss, Alex Nichol, Alex Paino, Nikolas Tezak, Jie Tang, Igor Babuschkin, Suchir Balaji, Shantanu Jain, William Saunders, Christopher Hesse, Andrew N. Carr, Jan Leike, Josh Achiam, Vedant Misra, Evan Morikawa, Alec Radford, Matthew Knight, Miles Brundage, Mira Murati, Katie Mayer, Peter Welinder, Bob McGrew, Dario Amodei, Sam McCandlish, Ilya Sutskever, Wojciech Zaremba

We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities.

Ranked #1 on Multi-task Language Understanding on BBH-alg

Code Generation Language Modelling +1

7,767

Paper
Code

Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft

no code implementations • 28 Jun 2021 • Ingmar Kanitscheider, Joost Huizinga, David Farhi, William Hebgen Guss, Brandon Houghton, Raul Sampedro, Peter Zhokhov, Bowen Baker, Adrien Ecoffet, Jie Tang, Oleg Klimov, Jeff Clune

An important challenge in reinforcement learning is training agents that can solve a wide variety of tasks.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Turing Award elites revisited: patterns of productivity, collaboration, authorship and impact

no code implementations • 22 Jun 2021 • Yinyu Jin, Sha Yuan, Zhou Shao, Wendy Hall, Jie Tang

The Turing Award is recognized as the most influential and prestigious award in the field of computer science(CS).

Paper
Add Code

Cascaded Channel Estimation for RIS Assisted mmWave MIMO Transmissions

no code implementations • 19 Jun 2021 • Yushan Liu, Shun Zhang, Feifei Gao, Jie Tang, Octavia A. Dobre

Channel estimation is challenging for the reconfigurable intelligence surface (RIS) assisted millimeter wave (mmWave) communications.

Paper
Add Code

A Self-supervised Method for Entity Alignment

1 code implementation • 17 Jun 2021 • Xiao Liu, Haoyun Hong, Xinghao Wang, Zeyi Chen, Evgeny Kharlamov, Yuxiao Dong, Jie Tang

We present SelfKG by leveraging this discovery to design a contrastive learning strategy across two KGs.

Contrastive Learning Entity Alignment +2

Paper
Code

Pre-Trained Models: Past, Present and Future

no code implementations • 14 Jun 2021 • Xu Han, Zhengyan Zhang, Ning Ding, Yuxian Gu, Xiao Liu, Yuqi Huo, Jiezhong Qiu, Yuan YAO, Ao Zhang, Liang Zhang, Wentao Han, Minlie Huang, Qin Jin, Yanyan Lan, Yang Liu, Zhiyuan Liu, Zhiwu Lu, Xipeng Qiu, Ruihua Song, Jie Tang, Ji-Rong Wen, Jinhui Yuan, Wayne Xin Zhao, Jun Zhu

Large-scale pre-trained models (PTMs) such as BERT and GPT have recently achieved great success and become a milestone in the field of artificial intelligence (AI).

Computational Efficiency Self-Supervised Learning +1

Paper
Add Code

TDGIA:Effective Injection Attacks on Graph Neural Networks

1 code implementation • 12 Jun 2021 • Xu Zou, Qinkai Zheng, Yuxiao Dong, Xinyu Guan, Evgeny Kharlamov, Jialiang Lu, Jie Tang

In the GIA scenario, the adversary is not able to modify the existing link structure and node attributes of the input graph, instead the attack is performed by injecting adversarial nodes into it.

Adversarial Attack

Paper
Code

A Generalizable Approach to Learning Optimizers

1 code implementation • 2 Jun 2021 • Diogo Almeida, Clemens Winter, Jie Tang, Wojciech Zaremba

A core issue with learning to optimize neural networks has been the lack of generalization to real world problems.

Language Modelling

Paper
Code

M6-UFC: Unifying Multi-Modal Controls for Conditional Image Synthesis via Non-Autoregressive Generative Transformers

no code implementations • NeurIPS 2021 • Zhu Zhang, Jianxin Ma, Chang Zhou, Rui Men, Zhikang Li, Ming Ding, Jie Tang, Jingren Zhou, Hongxia Yang

Conditional image synthesis aims to create an image according to some multi-modal guidance in the forms of textual descriptions, reference images, and image blocks to preserve, as well as their combinations.

Image Generation

Paper
Add Code

CogView: Mastering Text-to-Image Generation via Transformers

4 code implementations • NeurIPS 2021 • Ming Ding, Zhuoyi Yang, Wenyi Hong, Wendi Zheng, Chang Zhou, Da Yin, Junyang Lin, Xu Zou, Zhou Shao, Hongxia Yang, Jie Tang

Text-to-Image generation in the general domain has long been an open problem, which requires both a powerful generative model and cross-modal understanding.

Ranked #56 on Text-to-Image Generation on MS COCO (using extra training data)

Super-Resolution Zero-Shot Text-to-Image Generation

3,929

Paper
Code

UFC-BERT: Unifying Multi-Modal Controls for Conditional Image Synthesis

no code implementations • NeurIPS 2021 • Zhu Zhang, Jianxin Ma, Chang Zhou, Rui Men, Zhikang Li, Ming Ding, Jie Tang, Jingren Zhou, Hongxia Yang

Image Generation

Paper
Add Code

Anchor-based Plain Net for Mobile Image Super-Resolution

3 code implementations • 20 May 2021 • Zongcai Du, Jie Liu, Jie Tang, Gangshan Wu

Along with the rapid development of real-world applications, higher requirements on the accuracy and efficiency of image super-resolution (SR) are brought forward.

Image Super-Resolution Quantization

275

Paper
Code

FastMoE: A Fast Mixture-of-Expert Training System

3 code implementations • 24 Mar 2021 • Jiaao He, Jiezhong Qiu, Aohan Zeng, Zhilin Yang, Jidong Zhai, Jie Tang

However, training trillion-scale MoE requires algorithm and system co-design for a well-tuned high performance distributed training system.

Language Modelling

1,379

Paper
Code

Controllable Generation from Pre-trained Language Models via Inverse Prompting

1 code implementation • 19 Mar 2021 • Xu Zou, Da Yin, Qingyang Zhong, Ming Ding, Hongxia Yang, Zhilin Yang, Jie Tang

To tackle this challenge, we propose an innovative method, inverse prompting, to better control text generation.

Language Modelling Long Form Question Answering +1

120

Paper
Code

GLM: General Language Model Pretraining with Autoregressive Blank Infilling

9 code implementations • ACL 2022 • Zhengxiao Du, Yujie Qian, Xiao Liu, Ming Ding, Jiezhong Qiu, Zhilin Yang, Jie Tang

On a wide range of tasks across NLU, conditional and unconditional generation, GLM outperforms BERT, T5, and GPT given the same model sizes and data, and achieves the best performance from a single pretrained model with 1. 25x parameters of BERT Large , demonstrating its generalizability to different downstream tasks.

Ranked #4 on Language Modelling on WikiText-103 (using extra training data)

Abstractive Text Summarization Classification +4

39,246

Paper
Code

GPT Understands, Too

7 code implementations • 18 Mar 2021 • Xiao Liu, Yanan Zheng, Zhengxiao Du, Ming Ding, Yujie Qian, Zhilin Yang, Jie Tang

Prompting a pretrained language model with natural language patterns has been proved effective for natural language understanding (NLU).

Knowledge Probing Language Modelling +2

11,406

Paper
Code

Understanding WeChat User Preferences and "Wow" Diffusion

1 code implementation • 4 Mar 2021 • Fanjin Zhang, Jie Tang, Xueyi Liu, Zhenyu Hou, Yuxiao Dong, Jing Zhang, Xiao Liu, Ruobing Xie, Kai Zhuang, Xu Zhang, Leyu Lin, Philip S. Yu

"Top Stories" is a novel friend-enhanced recommendation engine in WeChat, in which users can read articles based on preferences of both their own and their friends.

Graph Representation Learning Social and Information Networks

Paper
Code

OAG-BERT: Towards A Unified Backbone Language Model For Academic Knowledge Services

1 code implementation • 3 Mar 2021 • Xiao Liu, Da Yin, Jingnan Zheng, Xingjian Zhang, Peng Zhang, Hongxia Yang, Yuxiao Dong, Jie Tang

Academic knowledge services have substantially facilitated the development of the science enterprise by providing a plenitude of efficient research tools.

Language Modelling Link Prediction

Paper
Code

M6: A Chinese Multimodal Pretrainer

no code implementations • 1 Mar 2021 • Junyang Lin, Rui Men, An Yang, Chang Zhou, Ming Ding, Yichang Zhang, Peng Wang, Ang Wang, Le Jiang, Xianyan Jia, Jie Zhang, Jianwei Zhang, Xu Zou, Zhikang Li, Xiaodong Deng, Jie Liu, Jinbao Xue, Huiling Zhou, Jianxin Ma, Jin Yu, Yong Li, Wei Lin, Jingren Zhou, Jie Tang, Hongxia Yang

In this work, we construct the largest dataset for multimodal pretraining in Chinese, which consists of over 1. 9TB images and 292GB texts that cover a wide range of domains.

Image Generation

Paper
Add Code

CogDL: A Comprehensive Library for Graph Deep Learning

1 code implementation • 1 Mar 2021 • Yukuo Cen, Zhenyu Hou, Yan Wang, Qibin Chen, Yizhen Luo, Zhongming Yu, Hengrui Zhang, Xingcheng Yao, Aohan Zeng, Shiguang Guo, Yuxiao Dong, Yang Yang, Peng Zhang, Guohao Dai, Yu Wang, Chang Zhou, Hongxia Yang, Jie Tang

In CogDL, we propose a unified design for the training and evaluation of GNN models for various graph tasks, making it unique among existing graph learning libraries.

Graph Classification Graph Embedding +5

1,679

Paper
Code

Generalizing Graph Convolutional Networks

1 code implementation • 1 Jan 2021 • Jialin Zhao, Yuxiao Dong, Jie Tang, Ming Ding, Kuansan Wang

Graph convolutional networks (GCNs) have emerged as a powerful framework for mining and learning with graphs.

Paper
Code

Local Clustering Graph Neural Networks

no code implementations • 1 Jan 2021 • Jiezhong Qiu, Yukuo Cen, Qibin Chen, Chang Zhou, Jingren Zhou, Hongxia Yang, Jie Tang

Based on the theoretical analysis, we propose Local Clustering Graph Neural Networks (LCGNN), a GNN learning paradigm that utilizes local clustering to efficiently search for small but compact subgraphs for GNN training and inference.

Clustering

Paper
Add Code

CODE: Contrastive Pre-training with Adversarial Fine-tuning for Zero-shot Expert Linking

2 code implementations • 14 Dec 2020 • Bo Chen, Jing Zhang, Xiaokang Zhang, Xiaobin Tang, Lingfan Cai, Hong Chen, Cuiping Li, Peng Zhang, Jie Tang

In this paper, we propose CODE, which first pre-trains an expert linking model by contrastive learning on AMiner such that it can capture the representation and matching patterns of experts without supervised signals, then it is fine-tuned between AMiner and external sources to enhance the models transferability in an adversarial manner.

Active Learning Contrastive Learning +2

Paper
Code

Eudoxus: Characterizing and Accelerating Localization in Autonomous Machines

no code implementations • 2 Dec 2020 • Yiming Gan, Yu Bo, Boyuan Tian, Leimeng Xu, Wei Hu, Shaoshan Liu, Qiang Liu, Yanjun Zhang, Jie Tang, Yuhao Zhu

We develop and commercialize autonomous machines, such as logistic robots and self-driving cars, around the globe.

Self-Driving Cars Hardware Architecture

Paper
Add Code

CogLTX: Applying BERT to Long Texts

1 code implementation • NeurIPS 2020 • Ming Ding, Chang Zhou, Hongxia Yang, Jie Tang

BERTs are incapable of processing long texts due to its quadratically increasing memory and time consumption.

text-classification Text Classification

261

Paper
Code

ExpanRL: Hierarchical Reinforcement Learning for Course Concept Expansion in MOOCs

no code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Jifan Yu, Chenyu Wang, Gan Luo, Lei Hou, Juanzi Li, Jie Tang, Minlie Huang, Zhiyuan Liu

Within the prosperity of Massive Open Online Courses (MOOCs), the education applications that automatically provide extracurricular knowledge for MOOC users become rising research topics.

Hierarchical Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Graph Random Neural Networks for Semi-Supervised Learning on Graphs

1 code implementation • NeurIPS 2020 • Wenzheng Feng, Jie Zhang, Yuxiao Dong, Yu Han, Huanbo Luan, Qian Xu, Qiang Yang, Evgeny Kharlamov, Jie Tang

We study the problem of semi-supervised learning on graphs, for which graph neural networks (GNNs) have been extensively explored.

Data Augmentation Node Classification

202

Paper
Code

CPM: A Large-scale Generative Chinese Pre-trained Language Model

6 code implementations • 1 Dec 2020 • Zhengyan Zhang, Xu Han, Hao Zhou, Pei Ke, Yuxian Gu, Deming Ye, Yujia Qin, Yusheng Su, Haozhe Ji, Jian Guan, Fanchao Qi, Xiaozhi Wang, Yanan Zheng, Guoyang Zeng, Huanqi Cao, Shengqi Chen, Daixuan Li, Zhenbo Sun, Zhiyuan Liu, Minlie Huang, Wentao Han, Jie Tang, Juanzi Li, Xiaoyan Zhu, Maosong Sun

However, applying GPT-3 to address Chinese NLP tasks is still challenging, as the training corpus of GPT-3 is primarily English, and the parameters are not publicly available.

Cloze Test Language Modelling +1

1,589

Paper
Code

Residual Feature Distillation Network for Lightweight Image Super-Resolution

2 code implementations • 24 Sep 2020 • Jie Liu, Jie Tang, Gangshan Wu

Thanks to FDC, we can rethink the information multi-distillation network (IMDN) and propose a lightweight and accurate SISR model called residual feature distillation network (RFDN).

Image Super-Resolution

344

Paper
Code

AIM 2020 Challenge on Efficient Super-Resolution: Methods and Results

3 code implementations • 15 Sep 2020 • Kai Zhang, Martin Danelljan, Yawei Li, Radu Timofte, Jie Liu, Jie Tang, Gangshan Wu, Yu Zhu, Xiangyu He, Wenjie Xu, Chenghua Li, Cong Leng, Jian Cheng, Guangyang Wu, Wenyi Wang, Xiaohong Liu, Hengyuan Zhao, Xiangtao Kong, Jingwen He, Yu Qiao, Chao Dong, Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan, Xiaochuan Li, Zhiqiang Lang, Jiangtao Nie, Wei Wei, Lei Zhang, Abdul Muqeet, Jiwon Hwang, Subin Yang, JungHeum Kang, Sung-Ho Bae, Yongwoo Kim, Geun-Woo Jeon, Jun-Ho Choi, Jun-Hyuk Kim, Jong-Seok Lee, Steven Marty, Eric Marty, Dongliang Xiong, Siang Chen, Lin Zha, Jiande Jiang, Xinbo Gao, Wen Lu, Haicheng Wang, Vineeth Bhaskara, Alex Levinshtein, Stavros Tsogkas, Allan Jepson, Xiangzhen Kong, Tongtong Zhao, Shanshan Zhao, Hrishikesh P. S, Densen Puthussery, Jiji C. V, Nan Nan, Shuai Liu, Jie Cai, Zibo Meng, Jiaming Ding, Chiu Man Ho, Xuehui Wang, Qiong Yan, Yuzhi Zhao, Long Chen, Jiangtao Zhang, Xiaotong Luo, Liang Chen, Yanyun Qu, Long Sun, Wenhao Wang, Zhenbing Liu, Rushi Lan, Rao Muhammad Umer, Christian Micheloni

This paper reviews the AIM 2020 challenge on efficient single image super-resolution with focus on the proposed solutions and results.

Image Super-Resolution

2,712

Paper
Code

A Survey of FPGA-Based Robotic Computing

no code implementations • 13 Sep 2020 • Zishen Wan, Bo Yu, Thomas Yuang Li, Jie Tang, Yuhao Zhu, Yu Wang, Arijit Raychowdhury, Shaoshan Liu

On the other hand, FPGA-based robotic accelerators are becoming increasingly competitive alternatives, especially in latency-critical and power-limited scenarios.

Autonomous Vehicles

Paper
Add Code

A Matrix Chernoff Bound for Markov Chains and Its Application to Co-occurrence Matrices

no code implementations • NeurIPS 2020 • Jiezhong Qiu, Chi Wang, Ben Liao, Richard Peng, Jie Tang

Our result gives the first bound on the convergence rate of the co-occurrence matrix and the first sample complexity analysis in graph representation learning.

Graph Learning Graph Representation Learning

Paper
Add Code

MOOCCube: A Large-scale Data Repository for NLP Applications in MOOCs

no code implementations • ACL 2020 • Jifan Yu, Gan Luo, Tong Xiao, Qingyang Zhong, Yuquan Wang, Wenzheng Feng, Junyi Luo, Chenyu Wang, Lei Hou, Juanzi Li, Zhiyuan Liu, Jie Tang

The prosperity of Massive Open Online Courses (MOOCs) provides fodder for many NLP and AI research for education applications, e. g., course concept extraction, prerequisite relation discovery, etc.

Paper
Add Code

Attentional Graph Convolutional Networks for Knowledge Concept Recommendation in MOOCs in a Heterogeneous View

2 code implementations • 23 Jun 2020 • Shen Wang, Jibing Gong, Jinlong Wang, Wenzheng Feng, Hao Peng, Jie Tang, Philip S. Yu

To address this issue, we leverage both content information and context information to learn the representation of entities via graph convolution network.

Representation Learning

Paper
Code

Spectral-Energy Efficiency Trade-off-based Beamforming Design for MISO Non-Orthogonal Multiple Access Systems

no code implementations • 19 Jun 2020 • Haitham Al-Obiedollah, Kanapathippillai Cumanan, Jeyarajan Thiyagalingam, Jie Tang, Alister G. Burr, Zhiguo Ding, Octavia A. Dobre

In particular, we formulate a joint SE-EE based design as a multi-objective optimization (MOO) problem to achieve a good tradeoff between the two performance metrics.

Paper
Add Code

GCC: Graph Contrastive Coding for Graph Neural Network Pre-Training

4 code implementations • 17 Jun 2020 • Jiezhong Qiu, Qibin Chen, Yuxiao Dong, Jing Zhang, Hongxia Yang, Ming Ding, Kuansan Wang, Jie Tang

Graph representation learning has emerged as a powerful technique for addressing real-world problems.

Contrastive Learning General Classification +5

318

Paper
Code

Self-supervised Learning: Generative or Contrastive

no code implementations • 15 Jun 2020 • Xiao Liu, Fanjin Zhang, Zhenyu Hou, Zhaoyu Wang, Li Mian, Jing Zhang, Jie Tang

As an alternative, self-supervised learning attracts many researchers for its soaring performance on representation learning in the last several years.

Graph Learning Representation Learning +1

Paper
Add Code

Attention: to Better Stand on the Shoulders of Giants

no code implementations • 27 May 2020 • Sha Yuan, Zhou Shao, Yu Zhang, Xingxing Wei, Tong Xiao, Yifan Wang, Jie Tang

In the progress of science, the previously discovered knowledge principally inspires new scientific ideas, and citation is a reasonably good reflection of this cumulative nature of scientific research.

Paper
Add Code

Graph Random Neural Network for Semi-Supervised Learning on Graphs

7 code implementations • 22 May 2020 • Wenzheng Feng, Jie Zhang, Yuxiao Dong, Yu Han, Huanbo Luan, Qian Xu, Qiang Yang, Evgeny Kharlamov, Jie Tang

We study the problem of semi-supervised learning on graphs, for which graph neural networks (GNNs) have been extensively explored.

Ranked #2 on Node Classification on CiteSeer with Public Split: fixed 20 nodes per class

Data Augmentation Graph Learning +1

12,994

Paper
Code

Understanding Negative Sampling in Graph Representation Learning

4 code implementations • 20 May 2020 • Zhen Yang, Ming Ding, Chang Zhou, Hongxia Yang, Jingren Zhou, Jie Tang

To the best of our knowledge, we are the first to derive the theory and quantify that the negative sampling distribution should be positively but sub-linearly correlated to their positive sampling distribution.

Graph Learning Graph Representation Learning +2

111

Paper
Code

Controllable Multi-Interest Framework for Recommendation

2 code implementations • 19 May 2020 • Yukuo Cen, Jianwei Zhang, Xu Zou, Chang Zhou, Hongxia Yang, Jie Tang

Recent works usually give an overall embedding from a user's behavior sequence.

Sequential Recommendation

2,146

Paper
Code

Modelling High-Order Social Relations for Item Recommendation

no code implementations • 23 Mar 2020 • Yang Liu, Liang Chen, Xiangnan He, Jiaying Peng, Zibin Zheng, Jie Tang

The prevalence of online social network makes it compulsory to study how social relations affect user choice.

Vocal Bursts Intensity Prediction

Paper
Add Code

A multi-label classification method using a hierarchical and transparent representation for paper-reviewer recommendation

no code implementations • 19 Dec 2019 • Dong Zhang, Shu Zhao, Zhen Duan, Jie Chen, Yangping Zhang, Jie Tang

Paper-reviewer recommendation task is of significant academic importance for conference chairs and journal editors.

General Classification Multi-Label Classification

Paper
Add Code

Dota 2 with Large Scale Deep Reinforcement Learning

1 code implementation • 13 Dec 2019 • Christopher Berner, Greg Brockman, Brooke Chan, Vicki Cheung, Przemysław Dębiak, Christy Dennison, David Farhi, Quirin Fischer, Shariq Hashme, Chris Hesse, Rafal Józefowicz, Scott Gray, Catherine Olsson, Jakub Pachocki, Michael Petrov, Henrique Pondé de Oliveira Pinto, Jonathan Raiman, Tim Salimans, Jeremy Schlatter, Jonas Schneider, Szymon Sidor, Ilya Sutskever, Jie Tang, Filip Wolski, Susan Zhang

On April 13th, 2019, OpenAI Five became the first AI system to defeat the world champions at an esports game.

Dota 2 reinforcement-learning +1

399

Paper
Code

Simple and Lightweight Human Pose Estimation

1 code implementation • 23 Nov 2019 • Zhe Zhang, Jie Tang, Gangshan Wu

Specifically, our LPN-50 can achieve 68. 7 in AP score on the COCO test-dev set, with only 2. 7M parameters and 1. 0 GFLOPs, while the inference speed is 17 FPS on an Intel i7-8700K CPU machine.

Keypoint Detection Novel Concepts

Paper
Code

Blockwise Self-Attention for Long Document Understanding

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Jiezhong Qiu, Hao Ma, Omer Levy, Scott Wen-tau Yih, Sinong Wang, Jie Tang

We present BlockBERT, a lightweight and efficient BERT model for better modeling long-distance dependencies.

document understanding Language Modelling +1

Paper
Code

Diagonal Graph Convolutional Networks with Adaptive Neighborhood Aggregation

no code implementations • 25 Sep 2019 • Jie Zhang, Yuxiao Dong, Jie Tang

In this paper, we revisit the mathematical foundation of GCNs and study how to extend their representation capacity.

Graph Attention Graph Classification +1

Paper
Add Code

Dimensional Reweighting Graph Convolution Networks

no code implementations • 25 Sep 2019 • Xu Zou, Qiuye Jia, Jianwei Zhang, Chang Zhou, Zijun Yao, Hongxia Yang, Jie Tang

In this paper, we propose a method named Dimensional reweighting Graph Convolutional Networks (DrGCNs), to tackle the problem of variance between dimensional information in the node representations of GCNs.

Node Classification

Paper
Add Code

Course Concept Expansion in MOOCs with External Knowledge and Interactive Game

no code implementations • ACL 2019 • Jifan Yu, Chenyu Wang, Gan Luo, Lei Hou, Juanzi Li, Jie Tang, Zhiyuan Liu

As Massive Open Online Courses (MOOCs) become increasingly popular, it is promising to automatically provide extracurricular knowledge for MOOC users.

Paper
Add Code

Towards Knowledge-Based Recommender Dialog System

1 code implementation • IJCNLP 2019 • Qibin Chen, Junyang Lin, Yichang Zhang, Ming Ding, Yukuo Cen, Hongxia Yang, Jie Tang

In this paper, we propose a novel end-to-end framework called KBRD, which stands for Knowledge-Based Recommender Dialog System.

Ranked #5 on Text Generation on ReDial

Recommendation Systems Text Generation

134

Paper
Code

Learning Guided Convolutional Network for Depth Completion

2 code implementations • 3 Aug 2019 • Jie Tang, Fei-Peng Tian, Wei Feng, Jian Li, Ping Tan

It is thus necessary to complete the sparse LiDAR data, where a synchronized guidance RGB image is often used to facilitate this completion.

Ranked #5 on Stereo-LiDAR Fusion on KITTI Depth Completion Validation

Autonomous Driving Depth Completion +1

150

Paper
Code

Infer Implicit Contexts in Real-time Online-to-Offline Recommendation

1 code implementation • 8 Jul 2019 • Xichen Ding, Jie Tang, Tracy Liu, Cheng Xu, Yaping Zhang, Feng Shi, Qixia Jiang, Dan Shen

Understanding users' context is essential for successful recommendations, especially for Online-to-Offline (O2O) recommendation, such as Yelp, Groupon, and Koubei.

Paper
Code

Dimensional Reweighting Graph Convolutional Networks

2 code implementations • 4 Jul 2019 • Xu Zou, Qiuye Jia, Jianwei Zhang, Chang Zhou, Hongxia Yang, Jie Tang

Graph Convolution Networks (GCNs) are becoming more and more popular for learning node representations on graphs.

Node Classification

Paper
Code

NetSMF: Large-Scale Network Embedding as Sparse Matrix Factorization

1 code implementation • 26 Jun 2019 • Jiezhong Qiu, Yuxiao Dong, Hao Ma, Jian Li, Chi Wang, Kuansan Wang, Jie Tang

Previous research shows that 1) popular network embedding benchmarks, such as DeepWalk, are in essence implicitly factorizing a matrix with a closed form, and 2)the explicit factorization of such matrix generates more powerful embeddings than existing methods.

Network Embedding

129

Paper
Code

Gift Contagion in Online Groups: Evidence From Virtual Red Packets

no code implementations • 24 Jun 2019 • Yuan Yuan, Tracy Liu, Chenhao Tan, Qian Chen, Alex Pentland, Jie Tang

Using data on 36 million online red packet gifts on a large social site in East Asia, we leverage a natural experimental design to identify the social contagion of gift giving in online groups.

Experimental Design Marketing

Paper
Add Code

Alchemy: A Quantum Chemistry Dataset for Benchmarking AI Models

1 code implementation • 22 Jun 2019 • Guangyong Chen, Pengfei Chen, Chang-Yu Hsieh, Chee-Kong Lee, Benben Liao, Renjie Liao, Weiwen Liu, Jiezhong Qiu, Qiming Sun, Jie Tang, Richard Zemel, Shengyu Zhang

We introduce a new molecular dataset, named Alchemy, for developing machine learning models useful in chemistry and material science.

Benchmarking BIG-bench Machine Learning

113

Paper
Code

Cognitive Knowledge Graph Reasoning for One-shot Relational Learning

1 code implementation • 13 Jun 2019 • Zhengxiao Du, Chang Zhou, Ming Ding, Hongxia Yang, Jie Tang

Inferring new facts from existing knowledge graphs (KG) with explainable reasoning processes is a significant problem and has received much attention recently.

Knowledge Graphs Relational Reasoning +1

Paper
Code

Sequential Scenario-Specific Meta Learner for Online Recommendation

1 code implementation • 2 Jun 2019 • Zhengxiao Du, Xiaowei Wang, Hongxia Yang, Jingren Zhou, Jie Tang

Our approach is based on the insight that having a good generalization from a few examples relies on both a generic model initialization and an effective strategy for adapting this model to newly arising tasks.

Few-Shot Learning

Paper
Code

Cognitive Graph for Multi-Hop Reading Comprehension at Scale

3 code implementations • ACL 2019 • Ming Ding, Chang Zhou, Qibin Chen, Hongxia Yang, Jie Tang

We propose a new CogQA framework for multi-hop question answering in web-scale documents.

Ranked #50 on Question Answering on HotpotQA

Multi-hop Question Answering Multi-Hop Reading Comprehension +1

454

Paper
Code

Representation Learning for Attributed Multiplex Heterogeneous Network

4 code implementations • 5 May 2019 • Yukuo Cen, Xu Zou, Jianwei Zhang, Hongxia Yang, Jingren Zhou, Jie Tang

Network embedding (or graph embedding) has been widely used in many real-world applications.

Ranked #1 on Link Prediction on Alibaba

Graph Embedding Link Prediction +2

12,994

Paper
Code

Towards Knowledge-Based Personalized Product Description Generation in E-commerce

4 code implementations • 29 Mar 2019 • Qibin Chen, Junyang Lin, Yichang Zhang, Hongxia Yang, Jingren Zhou, Jie Tang

In order to make the description both informative and personalized, KOBE considers a variety of important factors during text generation, including product aspects, user categories, and knowledge base, etc.

Text Generation

236

Paper
Code

Graph Adversarial Training: Dynamically Regularizing Based on Graph Structure

1 code implementation • 20 Feb 2019 • Fuli Feng, Xiangnan He, Jie Tang, Tat-Seng Chua

Adversarial Training (AT), a dynamic regularization technique, can resist the worst-case perturbations on input features and is a promising choice to improve model robustness and generalization.

Ranked #3 on Node Classification on NELL

General Classification Node Classification

Paper
Code

Bandit Learning with Implicit Feedback

1 code implementation • NeurIPS 2018 • Yi Qi, Qingyun Wu, Hongning Wang, Jie Tang, Maosong Sun

Implicit feedback, such as user clicks, although abundant in online information service systems, does not provide substantial evidence on users' evaluation of system's output.

Bayesian Inference Thompson Sampling

Paper
Code

Modeling and Predicting Citation Count via Recurrent Neural Network with Long Short-Term Memory

no code implementations • 6 Nov 2018 • Sha Yuan, Jie Tang, Yu Zhang, Yifan Wang, Tong Xiao

The rapid evolution of scientific research has been creating a huge volume of publications every year.

Digital Libraries Physics and Society

Paper
Add Code

Modeling and Predicting Popularity Dynamics via Deep Learning Attention Mechanism

no code implementations • 6 Nov 2018 • Sha Yuan, Yu Zhang, Jie Tang, Hua-Wei Shen, Xingxing Wei

Here we propose a deep learning attention mechanism to model the process through which individual items gain their popularity.

Paper
Add Code

Fast Randomized PCA for Sparse Data

2 code implementations • 16 Oct 2018 • Xu Feng, Yuyang Xie, Mingye Song, Wenjian Yu, Jie Tang

The algorithm has similar accuracy to the basic randomized SVD (rPCA) algorithm (Halko et al., 2011), but is largely optimized for sparse data.

Dimensionality Reduction Information Retrieval +1

Paper
Code

Semi-supervised Learning on Graphs with Generative Adversarial Nets

2 code implementations • 1 Sep 2018 • Ming Ding, Jie Tang, Jie Zhang

We first provide insights on working principles of adversarial learning over graphs and then present GraphSGAN, a novel approach to semi-supervised learning on graphs.

Paper
Code

DeepInf: Social Influence Prediction with Deep Learning

1 code implementation • 15 Jul 2018 • Jiezhong Qiu, Jian Tang, Hao Ma, Yuxiao Dong, Kuansan Wang, Jie Tang

Inspired by the recent success of deep neural networks in a wide range of computing applications, we design an end-to-end framework, DeepInf, to learn users' latent feature representation for predicting social influence.

Feature Engineering Representation Learning

289

Paper
Code

Spectral Network Embedding: A Fast and Scalable Method via Sparsity

1 code implementation • 7 Jun 2018 • Jie Zhang, Yan Wang, Jie Tang, Ming Ding

In this paper, we propose a $10\times \sim 100\times$ faster network embedding method, called Progle, by elegantly utilizing the sparsity property of online networks and spectral analysis.

Link Prediction Network Embedding +1

Paper
Code

Expert Finding in Community Question Answering: A Review

no code implementations • 21 Apr 2018 • Sha Yuan, Yu Zhang, Jie Tang, Juan Bautista Cabotà

Moreover, we use innovative diagrams to clarify several important concepts of ensemble learning, and find that ensemble models with several specific single models can further boosting the performance.

Community Question Answering Ensemble Learning +2

Paper
Add Code

Teaching Autonomous Driving Using a Modular and Integrated Approach

no code implementations • 22 Feb 2018 • Jie Tang, Shaoshan Liu, Songwen Pei, Stephane Zuckerman, Chen Liu, Weisong Shi, Jean-Luc Gaudiot

Then, once the students have understood these modules, the experimental platforms for integration we have developed allow the students to fully understand how the modules interact with each other.

Autonomous Driving

Paper
Add Code

Revisiting Knowledge Base Embedding as Tensor Decomposition

no code implementations • ICLR 2018 • Jiezhong Qiu, Hao Ma, Yuxiao Dong, Kuansan Wang, Jie Tang

We study the problem of knowledge base (KB) embedding, which is usually addressed through two frameworks---neural KB embedding and tensor decomposition.

Link Prediction Tensor Decomposition

Paper
Add Code

Course Concept Extraction in MOOCs via Embedding-Based Graph Propagation

no code implementations • IJCNLP 2017 • Liangming Pan, Xiaochen Wang, Chengjiang Li, Juanzi Li, Jie Tang

Massive Open Online Courses (MOOCs), offering a new way to study online, are revolutionizing education.

Paper
Add Code

Fast Top-k Area Topics Extraction with Knowledge Base

no code implementations • 13 Oct 2017 • Fang Zhang, Xiaochen Wang, Jingfei Han, Jie Tang, Shiyin Wang, Marie-Francine Moens

We leverage a large-scale knowledge base (Wikipedia) to generate topic embeddings using neural networks and use this kind of representations to help capture the representativeness of topics for given areas.

Paper
Add Code

Network Embedding as Matrix Factorization: Unifying DeepWalk, LINE, PTE, and node2vec

4 code implementations • 9 Oct 2017 • Jiezhong Qiu, Yuxiao Dong, Hao Ma, Jian Li, Kuansan Wang, Jie Tang

This work lays the theoretical foundation for skip-gram based network embedding methods, leading to a better understanding of latent network representation learning.

Network Embedding

2,085

Paper
Code

Prerequisite Relation Learning for Concepts in MOOCs

no code implementations • ACL 2017 • Liangming Pan, Chengjiang Li, Juanzi Li, Jie Tang

What prerequisite knowledge should students achieve a level of mastery before moving forward to learn subsequent coursewares?

Relation Representation Learning

Paper
Add Code

Learn-Memorize-Recall-Reduce A Robotic Cloud Computing Paradigm

no code implementations • 16 Apr 2017 • Shaoshan Liu, Bolin Ding, Jie Tang, Dawei Sun, Zhe Zhang, Grace Tsai, Jean-Luc Gaudiot

The rise of robotic applications has led to the generation of a huge volume of unstructured data, whereas the current cloud infrastructure was designed to process limited amounts of structured data.

Cloud Computing Memorization

Paper
Add Code

A Probabilistic Framework for Location Inference from Social Media

no code implementations • 23 Feb 2017 • Yujie Qian, Jie Tang, Zhilin Yang, Binxuan Huang, Wei Wei, Kathleen M. Carley

In this paper, we formalize the problem of inferring location from social media into a semi-supervised factor graph model (SSFGM).

Management

Paper
Add Code

Weakly Learning to Match Experts in Online Community

no code implementations • 14 Nov 2016 • Yujie Qian, Jie Tang, Kan Wu

The challenge is how to trade off the matching degree between users' expertise and the question topic, and the likelihood of positive response from the invited users.

Paper
Add Code

OpenAI Gym

45 code implementations • 5 Jun 2016 • Greg Brockman, Vicki Cheung, Ludwig Pettersson, Jonas Schneider, John Schulman, Jie Tang, Wojciech Zaremba

OpenAI Gym is a toolkit for reinforcement learning research.

reinforcement-learning Reinforcement Learning (RL)

33,869

Paper
Code

An Empirical Study on Academic Commentary and Its Implications on Reading and Writing

no code implementations • 12 Feb 2016 • Tai Wang, Xiangen Hu, Keith Shubeck, Zhiqiang Cai, Jie Tang

The relationship between reading and writing (RRW) is one of the major themes in learning science.

Paper
Add Code

Word Embedding based Correlation Model for Question/Answer Matching

no code implementations • 15 Nov 2015 • Yikang Shen, Wenge Rong, Nan Jiang, Baolin Peng, Jie Tang, Zhang Xiong

With the development of community based question answering (Q&A) services, a large scale of Q&A archives have been accumulated and are an important information and knowledge resource on the web.

Question Answering Translation

Paper
Add Code

Name List Only? Target Entity Disambiguation in Short Texts

no code implementations • EMNLP 2015 • Yixin Cao, Juanzi Li, Xiaofei Guo, Shuanhu Bai, Heng Ji, Jie Tang

Entity Disambiguation

Paper
Add Code

Multi-Modal Bayesian Embeddings for Learning Social Knowledge Graphs

no code implementations • 4 Aug 2015 • Zhilin Yang, Jie Tang, William Cohen

GenVector leverages large-scale unlabeled data with embeddings and represents data of two modalities---i. e., social network users and knowledge concepts---in a shared latent topic space.

Knowledge Graphs

Paper
Add Code

Learning Topic Hierarchies for Wikipedia Categories

no code implementations • IJCNLP 2015 • Linmei Hu, Xuzhong Wang, Mengdi Zhang, Juanzi Li, Xiao-Li Li, Chao Shao, Jie Tang, Yongbin Liu

Paper
Add Code

Panther: Fast Top-k Similarity Search in Large Networks

2 code implementations • 10 Apr 2015 • Jing Zhang, Jie Tang, Cong Ma, Hanghang Tong, Yu Jing, Juanzi Li

The algorithm is based on a novel idea of random path, and an extended method is also presented, to enhance the structural similarity when two vertices are completely disconnected.

Social and Information Networks

Paper
Code

Inferring Social Status and Rich Club Effects in Enterprise Communication Networks

no code implementations • 14 Apr 2014 • Yuxiao Dong, Jie Tang, Nitesh Chawla, Tiancheng Lou, Yang Yang, Bai Wang

Our model can predict social status of individuals with 93% accuracy.

Position

Paper
Add Code

Transfer Learning Based Cross-lingual Knowledge Extraction for Wikipedia

no code implementations • ACL 2013 • Zhigang Wang, Zhixing Li, Juanzi Li, Jie Tang, Jeff Z. Pan

Information Retrieval Question Answering +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.