Search Results for author: Wei Lin

Found 63 papers, 24 papers with code

Context-based Fast Recommendation Strategy for Long User Behavior Sequence in Meituan Waimai

no code implementations • 19 Mar 2024 • Zhichao Feng, Junjiie Xie, Kaiyuan Li, Yu Qin, Pengfei Wang, Qianzhong Li, Bin Yin, Xiang Li, Wei Lin, Shangguang Wang

We first identify contexts that share similar user preferences with the target context and then locate the corresponding PoIs based on these identified contexts.

Sequential Recommendation

Paper
Add Code

AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework

1 code implementation • 19 Mar 2024 • Xiang Li, Zhenyu Li, Chen Shi, Yong Xu, Qing Du, Mingkui Tan, Jun Huang, Wei Lin

The task of financial analysis primarily encompasses two key areas: stock trend prediction and the corresponding financial question answering.

Benchmarking Question Answering +2

Paper
Code

Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs

1 code implementation • 18 Mar 2024 • M. Jehanzeb Mirza, Leonid Karlinsky, Wei Lin, Sivan Doveh, Jakub Micorek, Mateusz Kozinski, Hilde Kuhene, Horst Possegger

Prompt ensembling of Large Language Model (LLM) generated category-specific prompts has emerged as an effective method to enhance zero-shot recognition ability of Vision-Language Models (VLMs).

Language Modelling Large Language Model +1

Paper
Code

A Fixed-Point Approach to Unified Prompt-Based Counting

no code implementations • 15 Mar 2024 • Wei Lin, Antoni B. Chan

Additionally, a contrastive training scheme is implemented to mitigate dataset bias inherent in current class-agnostic counting datasets, a strategy whose effectiveness is confirmed by our ablation study.

Paper
Add Code

Don't Half-listen: Capturing Key-part Information in Continual Instruction Tuning

no code implementations • 15 Mar 2024 • Yongquan He, Xuancheng Huang, Minghao Tang, Lingxun Meng, Xiang Li, Wei Lin, Wenyuan Zhang, Yifu Gao

Recent methods try to alleviate the CF problem by modifying models or replaying data, which may only remember the surface-level pattern of instructions and get confused on held-out tasks.

Instruction Following

Paper
Add Code

Robust Unsupervised Crowd Counting and Localization with Adaptive Resolution SAM

no code implementations • 27 Feb 2024 • Jia Wan, Qiangqiang Wu, Wei Lin, Antoni B. Chan

The existing crowd counting models require extensive training data, which is time-consuming to annotate.

Crowd Counting

Paper
Add Code

Target Recognition Algorithm for Monitoring Images in Electric Power Construction Process

no code implementations • 9 Feb 2024 • Hao Song, Wei Lin, Wei Song, Man Wang

To enhance precision and comprehensiveness in identifying targets in electric power construction monitoring video, a novel target recognition algorithm utilizing infrared imaging is explored.

Paper
Add Code

Arithmetic Feature Interaction Is Necessary for Deep Tabular Learning

1 code implementation • 4 Feb 2024 • Yi Cheng, Renjun Hu, Haochao Ying, Xing Shi, Jian Wu, Wei Lin

Our extensive experiments on real-world data also validate the consistent effectiveness, efficiency, and rationale of AMFormer, suggesting it has established a strong inductive bias for deep learning on tabular data.

Inductive Bias

Paper
Code

Quantifying energy landscape of oscillatory systems: Explosion, pre-solution, and diffusion decomposition

no code implementations • 13 Jan 2024 • Shirui Bian, Ruisong Zhou, Wei Lin, Chunhe Li

Although the weighted summation of the Gaussian approximation (WSGA) approach has been proposed for quantifying the energy landscape in multistable systems by solving the diffusion equation approximately from moment equations, we are still lacking an accurate approach for quantifying the energy landscape of the periodic oscillatory systems.

Paper
Add Code

PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization

no code implementations • 11 Dec 2023 • Xu Peng, Junwei Zhu, Boyuan Jiang, Ying Tai, Donghao Luo, Jiangning Zhang, Wei Lin, Taisong Jin, Chengjie Wang, Rongrong Ji

Moreover, these methods often grapple with identity distortion and limited expression diversity.

Face Recognition Text-to-Image Generation

Paper
Add Code

ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language Models

1 code implementation • 13 Oct 2023 • Haoran Luo, Haihong E, Zichen Tang, Shiyao Peng, Yikai Guo, Wentai Zhang, Chenghao Ma, Guanting Dong, Meina Song, Wei Lin

Knowledge Base Question Answering (KBQA) aims to derive answers to natural language questions over large-scale knowledge bases (KBs), which are generally divided into two research components: knowledge retrieval and semantic parsing.

Ranked #1 on Knowledge Base Question Answering on WebQuestionsSP

Knowledge Base Question Answering Knowledge Graphs +2

190

Paper
Code

Text2NKG: Fine-Grained N-ary Relation Extraction for N-ary relational Knowledge Graph Construction

1 code implementation • 8 Oct 2023 • Haoran Luo, Haihong E, Yuhao Yang, Tianyu Yao, Yikai Guo, Zichen Tang, Wentai Zhang, Kaiyang Wan, Shiyao Peng, Meina Song, Wei Lin

To address these restrictions, we propose Text2NKG, a novel fine-grained n-ary relation extraction framework for n-ary relational knowledge graph construction.

Ranked #1 on Hypergraph-based N-ary Relaiton Extraction on HyperRED

Event-based N-ary Relaiton Extraction Hypergraph-based N-ary Relaiton Extraction +3

Paper
Code

Accelerating Large Batch Training via Gradient Signal to Noise Ratio (GSNR)

no code implementations • 24 Sep 2023 • Guo-qing Jiang, Jinlong Liu, Zixiang Ding, Lin Guo, Wei Lin

As models for nature language processing (NLP), computer vision (CV) and recommendation systems (RS) require surging computation, a large number of GPUs/TPUs are paralleled as a large batch (LB) to improve training throughput.

Recommendation Systems

Paper
Add Code

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

1 code implementation • 19 Sep 2023 • Haojun Xia, Zhen Zheng, Yuchao Li, Donglin Zhuang, Zhongzhu Zhou, Xiafei Qiu, Yong Li, Wei Lin, Shuaiwen Leon Song

Therefore, we propose Flash-LLM for enabling low-cost and highly-efficient large generative model inference with the sophisticated support of unstructured sparsity on high-performance but highly restrictive Tensor Cores.

142

Paper
Code

CARE: Large Precision Matrix Estimation for Compositional Data

no code implementations • 13 Sep 2023 • Shucong Zhang, Huiyuan Wang, Wei Lin

High-dimensional compositional data are prevalent in many applications.

Paper
Add Code

TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification

1 code implementation • 13 Sep 2023 • M. Jehanzeb Mirza, Leonid Karlinsky, Wei Lin, Horst Possegger, Rogerio Feris, Horst Bischof

Vision and Language Models (VLMs), such as CLIP, have enabled visual recognition of a potentially unlimited set of categories described by text prompts.

Zero-Shot Learning

Paper
Code

Accurate Prediction of Antibody Function and Structure Using Bio-Inspired Antibody Language Model

1 code implementation • 31 Aug 2023 • Hongtai Jing, Zhengtao Gao, Sheng Xu, Tao Shen, Zhangzhi Peng, Shwai He, Tao You, Shuang Ye, Wei Lin, Siqi Sun

Remarkably, BALMFold outperforms those well-established methods like AlphaFold2, IgFold, ESMFold, and OmegaFold in the antibody benchmark, demonstrating significant potential to advance innovative engineering and streamline therapeutic antibody development by reducing the need for unnecessary trials.

Language Modelling

Paper
Code

Heterogeneous Knowledge Fusion: A Novel Approach for Personalized Recommendation via LLM

no code implementations • 7 Aug 2023 • Bin Yin, Junjie Xie, Yu Qin, Zixiang Ding, Zhichao Feng, Xiang Li, Wei Lin

The analysis and mining of user heterogeneous behavior are of paramount importance in recommendation systems.

Language Modelling Large Language Model +1

Paper
Add Code

Modeling Dual Period-Varying Preferences for Takeaway Recommendation

1 code implementation • 7 Jun 2023 • Yuting Zhang, Yiqing Wu, Ran Le, Yongchun Zhu, Fuzhen Zhuang, Ruidong Han, Xiang Li, Wei Lin, Zhulin An, Yongjun Xu

Different from traditional recommendation, takeaway recommendation faces two main challenges: (1) Dual Interaction-Aware Preference Modeling.

Recommendation Systems

Paper
Code

Sit Back and Relax: Learning to Drive Incrementally in All Weather Conditions

1 code implementation • 30 May 2023 • Stefan Leitner, M. Jehanzeb Mirza, Wei Lin, Jakub Micorek, Marc Masana, Mateusz Kozinski, Horst Possegger, Horst Bischof

We propose to store these affine parameters as a memory bank for each weather condition and plug-in their weather-specific parameters during driving (i. e. test time) when the respective weather conditions are encountered.

Autonomous Driving Incremental Learning +2

Paper
Code

HAHE: Hierarchical Attention for Hyper-Relational Knowledge Graphs in Global and Local Level

1 code implementation • ACL 2023 • Haoran Luo, Haihong E, Yuhao Yang, Yikai Guo, Mingzhi Sun, Tianyu Yao, Zichen Tang, Kaiyang Wan, Meina Song, Wei Lin

The global-level attention can model the graphical structure of HKG using hypergraph dual-attention layers, while the local-level attention can learn the sequential structure inside H-Facts via heterogeneous self-attention layers.

Ranked #1 on Link Prediction on Wikipeople

Attribute Knowledge Graphs +1

Paper
Code

Dual Intent Enhanced Graph Neural Network for Session-based New Item Recommendation

1 code implementation • 10 May 2023 • Di Jin, Luzhi Wang, Yizhen Zheng, Guojie Song, Fei Jiang, Xiang Li, Wei Lin, Shirui Pan

We design a dual-intent network to learn user intent from an attention mechanism and the distribution of historical data respectively, which can simulate users' decision-making process in interacting with a new item.

Decision Making Session-Based Recommendations +1

Paper
Code

Neural Delay Differential Equations: System Reconstruction and Image Classification

no code implementations • 11 Apr 2023 • Qunxi Zhu, Yao Guo, Wei Lin

Neural Ordinary Differential Equations (NODEs), a framework of continuous-depth neural networks, have been widely applied, showing exceptional efficacy in coping with representative datasets.

Classification Image Classification

Paper
Add Code

AIR-DA: Adversarial Image Reconstruction for Unsupervised Domain Adaptive Object Detection

no code implementations • 27 Mar 2023 • Kunyang Sun, Wei Lin, Haoqin Shi, Zhengming Zhang, Yongming Huang, Horst Bischof

This results in an imbalance of the adversarial training between the domain discriminator and the feature extractor.

Image Reconstruction object-detection +1

Paper
Add Code

Embedding Theory of Reservoir Computing and Reducing Reservoir Network Using Time Delays

no code implementations • 16 Mar 2023 • Xing-Yue Duan, Xiong Ying, Si-Yang Leng, Jürgen Kurths, Wei Lin, Huan-Fei Ma

Reservoir computing (RC), a particular form of recurrent neural network, is under explosive development due to its exceptional efficacy and high performance in reconstruction or/and prediction of complex physical systems.

Paper
Add Code

MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge

1 code implementation • ICCV 2023 • Wei Lin, Leonid Karlinsky, Nina Shvetsova, Horst Possegger, Mateusz Kozinski, Rameswar Panda, Rogerio Feris, Hilde Kuehne, Horst Bischof

We adapt a VL model for zero-shot and few-shot action recognition using a collection of unlabeled videos and an unpaired action dictionary.

Ranked #3 on Zero-Shot Action Recognition on Kinetics

Few-Shot action recognition Few Shot Action Recognition +5

Paper
Code

TAEC: Unsupervised Action Segmentation with Temporal-Aware Embedding and Clustering

no code implementations • 9 Mar 2023 • Wei Lin, Anna Kukleva, Horst Possegger, Hilde Kuehne, Horst Bischof

Temporal action segmentation in untrimmed videos has gained increased attention recently.

Action Segmentation Clustering +1

Paper
Add Code

Auto-Parallelizing Large Models with Rhino: A Systematic Approach on Production AI Platform

no code implementations • 16 Feb 2023 • Shiwei Zhang, Lansong Diao, Siyu Wang, Zongyan Cao, Yiliang Gu, Chang Si, Ziji Shi, Zhen Zheng, Chuan Wu, Wei Lin

We present Rhino, a system for accelerating tensor programs with automatic parallelization on AI platform for real production environment.

Paper
Add Code

Expediting Distributed DNN Training with Device Topology-Aware Graph Deployment

no code implementations • 13 Feb 2023 • Shiwei Zhang, Xiaodong Yi, Lansong Diao, Chuan Wu, Siyu Wang, Wei Lin

This paper presents TAG, an automatic system to derive optimized DNN training graph and its deployment onto any device topology, for expedited training in device- and topology- heterogeneous ML clusters.

Combinatorial Optimization TAG

Paper
Add Code

TAP: Accelerating Large-Scale DNN Training Through Tensor Automatic Parallelisation

no code implementations • 1 Feb 2023 • Ziji Shi, Le Jiang, Ang Wang, Jie Zhang, Xianyan Jia, Yong Li, Chencan Wu, Jialin Li, Wei Lin

However, finding a suitable model parallel schedule for an arbitrary neural network is a non-trivial task due to the exploding search space.

Paper
Add Code

Optimal Transport Minimization: Crowd Localization on Density Maps for Semi-Supervised Counting

1 code implementation • CVPR 2023 • Wei Lin, Antoni B. Chan

In this paper, we propose the optimal transport minimization (OT-M) algorithm for crowd localization with density maps.

Crowd Counting

Paper
Code

Scale-Prior Deformable Convolution for Exemplar-Guided Class-Agnostic Counting

1 code implementation • Conference 2022 • Wei Lin, Kunlin Yang, Xinzhu Ma, Junyu Gao, Lingbo Liu, Shinan Liu, Jun Hou, Shuai Yi, Antoni B. Chan

Here we propose a scale-sensitive generalized loss to tackle this problem.

Ranked #6 on Object Counting on FSC147

Object Counting

Paper
Code

SKDBERT: Compressing BERT via Stochastic Knowledge Distillation

no code implementations • 26 Nov 2022 • Zixiang Ding, Guoqing Jiang, Shuai Zhang, Lin Guo, Wei Lin

In this paper, we propose Stochastic Knowledge Distillation (SKD) to obtain compact BERT-style language model dubbed SKDBERT.

Knowledge Distillation Language Modelling

Paper
Add Code

Video Test-Time Adaptation for Action Recognition

1 code implementation • CVPR 2023 • Wei Lin, Muhammad Jehanzeb Mirza, Mateusz Kozinski, Horst Possegger, Hilde Kuehne, Horst Bischof

Our proposed method demonstrates a substantial performance gain over existing test-time adaptation approaches in both evaluations of a single distribution shift and the challenging case of random distribution shifts.

Action Recognition Temporal Action Localization +1

Paper
Code

ActMAD: Activation Matching to Align Distributions for Test-Time-Training

1 code implementation • CVPR 2023 • Muhammad Jehanzeb Mirza, Pol Jané Soneira, Wei Lin, Mateusz Kozinski, Horst Possegger, Horst Bischof

Test-Time-Training (TTT) is an approach to cope with out-of-distribution (OOD) data by adapting a trained model to distribution shifts occurring at test-time.

Image Classification

Paper
Code

MATE: Masked Autoencoders are Online 3D Test-Time Learners

1 code implementation • ICCV 2023 • M. Jehanzeb Mirza, Inkyu Shin, Wei Lin, Andreas Schriebl, Kunyang Sun, Jaesung Choe, Horst Possegger, Mateusz Kozinski, In So Kweon, Kun-Jin Yoon, Horst Bischof

Our MATE is the first Test-Time-Training (TTT) method designed for 3D data, which makes deep networks trained for point cloud classification robust to distribution shifts occurring in test data.

3D Object Classification Point Cloud Classification

Paper
Code

Multi-Frequency-Aware Patch Adversarial Learning for Neural Point Cloud Rendering

no code implementations • 7 Oct 2022 • Jay Karhade, Haiyue Zhu, Ka-Shing Chung, Rajesh Tripathy, Wei Lin, Marcelo H. Ang Jr

The proposed approach aims to improve the rendering realness by minimizing the spectrum discrepancy between real and synthesized images, especially on the high-frequency localized sharpness information which causes image blur visually.

Paper
Add Code

Heterogeneous Federated Learning on a Graph

no code implementations • 19 Sep 2022 • Huiyuan Wang, Xuyang Zhao, Wei Lin

In this work, we consider parameter estimation in federated learning with data distribution and communication heterogeneity, as well as limited computational capacity of local devices.

Federated Learning

Paper
Add Code

Neural Stochastic Control

1 code implementation • 15 Sep 2022 • Jingdong Zhang, Qunxi Zhu, Wei Lin

These two stochastic controllers thus are complementary in applications.

Paper
Code

RAW-GNN: RAndom Walk Aggregation based Graph Neural Network

no code implementations • 28 Jun 2022 • Di Jin, Rui Wang, Meng Ge, Dongxiao He, Xiang Li, Wei Lin, Weixiong Zhang

Due to the homophily assumption of Graph Convolutional Networks (GCNs) that these methods use, they are not suitable for heterophily graphs where nodes with different labels or dissimilar attributes tend to be adjacent.

Representation Learning

Paper
Add Code

CGMN: A Contrastive Graph Matching Network for Self-Supervised Graph Similarity Learning

1 code implementation • 30 May 2022 • Di Jin, Luzhi Wang, Yizhen Zheng, Xiang Li, Fei Jiang, Wei Lin, Shirui Pan

As most of the existing graph neural networks yield effective graph representations of a single graph, little effort has been made for jointly learning two graph representations and calculating their similarity score.

Collaborative Filtering Graph Classification +4

Paper
Code

Cross-View Cross-Scene Multi-View Crowd Counting

no code implementations • CVPR 2021 • Qi Zhang, Wei Lin, Antoni B. Chan

Multi-view crowd counting has been previously proposed to utilize multi-cameras to extend the field-of-view of a single camera, capturing more people in the scene, and improve counting performance for occluded people or those in low resolution.

Camera Calibration Crowd Counting

Paper
Add Code

EasyNLP: A Comprehensive and Easy-to-use Toolkit for Natural Language Processing

1 code implementation • 30 Apr 2022 • Chengyu Wang, Minghui Qiu, Chen Shi, Taolin Zhang, Tingting Liu, Lei LI, Jianing Wang, Ming Wang, Jun Huang, Wei Lin

The success of Pre-Trained Models (PTMs) has reshaped the development of Natural Language Processing (NLP).

Few-Shot Learning Knowledge Distillation

1,950

Paper
Code

PICASSO: Unleashing the Potential of GPU-centric Training for Wide-and-deep Recommender Systems

1 code implementation • 11 Apr 2022 • Yuanxing Zhang, Langshi Chen, Siran Yang, Man Yuan, Huimin Yi, Jie Zhang, Jiamang Wang, Jianbo Dong, Yunlong Xu, Yue Song, Yong Li, Di Zhang, Wei Lin, Lin Qu, Bo Zheng

However, we observe that GPU devices in training recommender systems are underutilized, and they cannot attain an expected throughput improvement as what it has achieved in CV and NLP areas.

Marketing Recommendation Systems

149

Paper
Code

CycDA: Unsupervised Cycle Domain Adaptation from Image to Video

1 code implementation • 30 Mar 2022 • Wei Lin, Anna Kukleva, Kunyang Sun, Horst Possegger, Hilde Kuehne, Horst Bischof

To address these challenges, we propose Cycle Domain Adaptation (CycDA), a cycle-based approach for unsupervised image-to-video domain adaptation by leveraging the joint spatial information in images and videos on the one hand and, on the other hand, training an independent spatio-temporal model to bridge the modality gap.

Action Recognition Domain Adaptation +1

Paper
Code

AC-Feasible Power Transfer Regions of Virtual Power Plants: Characterization and Application

no code implementations • 9 Feb 2022 • Wei Lin, Changhong Zhao

Distributed energy resources (DERs) in distribution networks can be aggregated as a virtual power plant (VPP) for transmission-level operations.

Paper
Add Code

Neural Piecewise-Constant Delay Differential Equations

no code implementations • 4 Jan 2022 • Qunxi Zhu, Yifei Shen, Dongsheng Li, Wei Lin

Continuous-depth neural networks, such as the Neural Ordinary Differential Equations (ODEs), have aroused a great deal of interest from the communities of machine learning and data science in recent years, which bridge the connection between deep neural networks and dynamical systems.

Paper
Add Code

Tie-line Security Regions in High Dimension for Renewable Accommodations

no code implementations • 4 Jan 2022 • Wei Lin, Hua Jiang, Zhifang Yang

However, a tie-line security region is a high-dimension polytope due to multiple time periods and border buses inherently in power system operations, leading to the considerable computational burden.

Vocal Bursts Intensity Prediction

Paper
Add Code

Cost Functions over Feasible Power Transfer Regions of Virtual Power Plants

no code implementations • 2 Dec 2021 • Wei Lin, Changhong Zhao

To address this challenge, a characterization method is presented in this paper for the intraday operation of a VPP based on the concepts of nonanticipativity and robustness to DERs' uncertainties.

Paper
Add Code

M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining

no code implementations • 8 Oct 2021 • Junyang Lin, An Yang, Jinze Bai, Chang Zhou, Le Jiang, Xianyan Jia, Ang Wang, Jie Zhang, Yong Li, Wei Lin, Jingren Zhou, Hongxia Yang

Recent expeditious developments in deep learning algorithms, distributed training, and even hardware design for large models have enabled training extreme-scale models, say GPT-3 and Switch Transformer possessing hundreds of billions or even trillions of parameters.

Paper
Add Code

Learning Effective and Efficient Embedding via an Adaptively-Masked Twins-based Layer

no code implementations • 24 Aug 2021 • Bencheng Yan, Pengjie Wang, Kai Zhang, Wei Lin, Kuang-Chih Lee, Jian Xu, Bo Zheng

Each feature value is mapped to an embedding vector via an embedding learning process.

Neural Architecture Search

Paper
Add Code

Binary Code based Hash Embedding for Web-scale Applications

no code implementations • 24 Aug 2021 • Bencheng Yan, Pengjie Wang, Jinquan Liu, Wei Lin, Kuang-Chih Lee, Jian Xu, Bo Zheng

In these applications, embedding learning of categorical features is crucial to the success of deep learning models.

Recommendation Systems

Paper
Add Code

Boosting the Convergence of Reinforcement Learning-based Auto-pruning Using Historical Data

no code implementations • 16 Jul 2021 • Jiandong Mu, Mengdi Wang, Feiwen Zhu, Jun Yang, Wei Lin, Wei zhang

Reinforcement learning (RL)-based auto-pruning has been further proposed to automate the DNN pruning process to avoid expensive hand-crafted work.

Neural Network Compression reinforcement-learning +2

Paper
Add Code

Nonasymptotic theory for two-layer neural networks: Beyond the bias-variance trade-off

no code implementations • 9 Jun 2021 • Huiyuan Wang, Wei Lin

Large neural networks have proved remarkably effective in modern deep learning practice, even in the overparametrized regime where the number of active parameters is large relative to the sample size.

Vocal Bursts Valence Prediction

Paper
Add Code

M6-T: Exploring Sparse Expert Models and Beyond

no code implementations • 31 May 2021 • An Yang, Junyang Lin, Rui Men, Chang Zhou, Le Jiang, Xianyan Jia, Ang Wang, Jie Zhang, Jiamang Wang, Yong Li, Di Zhang, Wei Lin, Lin Qu, Jingren Zhou, Hongxia Yang

Mixture-of-Experts (MoE) models can achieve promising results with outrageous large amount of parameters but constant computation cost, and thus it has become a trend in model scaling.

Playing the Game of 2048

Paper
Add Code

Towards a Better Tradeoff between Effectiveness and Efficiency in Pre-Ranking: A Learnable Feature Selection based Approach

no code implementations • 17 May 2021 • Xu Ma, Pengjie Wang, Hui Zhao, Shaoguo Liu, Chuhan Zhao, Wei Lin, Kuang-Chih Lee, Jian Xu, Bo Zheng

In real-world search, recommendation, and advertising systems, the multi-stage ranking architecture is commonly adopted.

feature selection Re-Ranking

Paper
Add Code

Explicit Semantic Cross Feature Learning via Pre-trained Graph Neural Networks for CTR Prediction

no code implementations • 17 May 2021 • Feng Li, Bencheng Yan, Qingqing Long, Pengjie Wang, Wei Lin, Jian Xu, Bo Zheng

Most of the existing methods adopt a DNN-based model to capture the cross features in an implicit manner.

Click-Through Rate Prediction

Paper
Add Code

Joule-Thomson expansion of the torus-like black hole

no code implementations • 4 Mar 2021 • Jing Liang, Wei Lin, Benrong Mu

Furthermore, we investigate similarities and differences between the Van der Waals fluid, the torus-like black hole and the charged AdS black holes for the expansion.

General Relativity and Quantum Cosmology

Paper
Add Code

M6: A Chinese Multimodal Pretrainer

no code implementations • 1 Mar 2021 • Junyang Lin, Rui Men, An Yang, Chang Zhou, Ming Ding, Yichang Zhang, Peng Wang, Ang Wang, Le Jiang, Xianyan Jia, Jie Zhang, Jianwei Zhang, Xu Zou, Zhikang Li, Xiaodong Deng, Jie Liu, Jinbao Xue, Huiling Zhou, Jianxin Ma, Jin Yu, Yong Li, Wei Lin, Jingren Zhou, Jie Tang, Hongxia Yang

In this work, we construct the largest dataset for multimodal pretraining in Chinese, which consists of over 1. 9TB images and 292GB texts that cover a wide range of domains.

Image Generation

Paper
Add Code

Neural Delay Differential Equations

no code implementations • ICLR 2021 • Qunxi Zhu, Yao Guo, Wei Lin

Neural Ordinary Differential Equations (NODEs), a framework of continuous-depth neural networks, have been widely applied, showing exceptional efficacy in coping with some representative datasets.

Paper
Add Code

EasyTransfer -- A Simple and Scalable Deep Transfer Learning Platform for NLP Applications

2 code implementations • 18 Nov 2020 • Minghui Qiu, Peng Li, Chengyu Wang, Hanjie Pan, Ang Wang, Cen Chen, Xianyan Jia, Yaliang Li, Jun Huang, Deng Cai, Wei Lin

The literature has witnessed the success of leveraging Pre-trained Language Models (PLMs) and Transfer Learning (TL) algorithms to a wide range of Natural Language Processing (NLP) applications, yet it is not easy to build an easy-to-use and scalable TL toolkit for this purpose.

Compiler Optimization Conversational Question Answering +1

1,950

Paper
Code

INT8 Winograd Acceleration for Conv1D Equipped ASR Models Deployed on Mobile Devices

no code implementations • 28 Oct 2020 • Yiwu Yao, Yuchao Li, Chengyu Wang, Tianhang Yu, Houjiang Chen, Xiaotang Jiang, Jun Yang, Jun Huang, Wei Lin, Hui Shu, Chengfei Lv

The intensive computation of Automatic Speech Recognition (ASR) models obstructs them from being deployed on mobile devices.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

A bi-diffusion based layer-wise sampling method for deep learning in large graphs

no code implementations • 25 Sep 2019 • Yu He, Shiyang Wen, Wenjin Wu, Yan Zhang, Siran Yang, Yuan Wei, Di Zhang, Guojie Song, Wei Lin, Liang Wang, Bo Zheng

The Graph Convolutional Network (GCN) and its variants are powerful models for graph representation learning and have recently achieved great success on many graph-based applications.

Graph Representation Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.