Search Results for author: Liang Zhang

Found 121 papers, 36 papers with code

Language Model Guided Interpretable Video Action Reasoning

no code implementations2 Apr 2024 Ning Wang, Guangming Zhu, HS Li, Liang Zhang, Syed Afaq Ali Shah, Mohammed Bennamoun

Extensive experiments on two complex video action datasets, Charades & CAD-120, validates the improved performance and interpretability of our LaIAR framework.

Action Recognition Decision Making +3

Breaking the Length Barrier: LLM-Enhanced CTR Prediction in Long Textual User Behaviors

no code implementations28 Mar 2024 Binzong Geng, ZhaoXin Huan, Xiaolu Zhang, Yong He, Liang Zhang, Fajie Yuan, Jun Zhou, Linjian Mo

However, we argue that a critical obstacle remains in deploying LLMs for practical use: the efficiency of LLMs when processing long textual user behaviors.

Click-Through Rate Prediction

mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding

1 code implementation19 Mar 2024 Anwen Hu, Haiyang Xu, Jiabo Ye, Ming Yan, Liang Zhang, Bo Zhang, Chen Li, Ji Zhang, Qin Jin, Fei Huang, Jingren Zhou

In this work, we emphasize the importance of structure information in Visual Document Understanding and propose the Unified Structure Learning to boost the performance of MLLMs.

document understanding Optical Character Recognition (OCR)

Primal Methods for Variational Inequality Problems with Functional Constraints

no code implementations19 Mar 2024 Liang Zhang, Niao He, Michael Muehlebach

In this work, we propose a simple primal method, termed Constrained Gradient Method (CGM), for addressing functional constrained variational inequality problems, without necessitating any information on the optimal Lagrange multipliers.

Navigate

HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback

no code implementations13 Mar 2024 Ang Li, Qiugen Xiao, Peng Cao, Jian Tang, Yi Yuan, Zijie Zhao, Xiaoyuan Chen, Liang Zhang, Xiangyang Li, Kaitong Yang, Weidong Guo, Yukang Gan, Xu Yu, Daniell Wang, Ying Shan

Using ChatGPT as a labeler to provide feedback on open-domain prompts in RLAIF training, we observe an increase in human evaluators' preference win ratio for model responses, but a decrease in evaluators' satisfaction rate.

Language Modelling Large Language Model +2

Predicting Learning Performance with Large Language Models: A Study in Adult Literacy

no code implementations4 Mar 2024 Liang Zhang, Jionghao Lin, Conrad Borchers, John Sabatini, John Hollander, Meng Cao, Xiangen Hu

This research is motivated by the potential of LLMs to predict learning performance based on its inherent reasoning and computational capabilities.

Knowledge Tracing Reading Comprehension

Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective

1 code implementation22 Feb 2024 Zihao Yue, Liang Zhang, Qin Jin

In this paper, we explore a new angle of this issue: overly detailed training data hinders the model's ability to timely terminate generation, leading to continued outputs beyond visual perception limits.

Hallucination Sentence

Adversarial Curriculum Graph Contrastive Learning with Pair-wise Augmentation

no code implementations16 Feb 2024 Xinjian Zhao, Liang Zhang, Yang Liu, Ruocheng Guo, Xiangyu Zhao

To address this challenge, we propose an innovative framework: Adversarial Curriculum Graph Contrastive Learning (ACGCL), which capitalizes on the merits of pair-wise augmentation to engender graph-level positive and negative samples with controllable similarity, alongside subgraph contrastive learning to discern effective graph patterns therein.

Contrastive Learning Graph Representation Learning

Large Language Model-Based Interpretable Machine Learning Control in Building Energy Systems

no code implementations14 Feb 2024 Liang Zhang, Zhelun Chen

The potential of Machine Learning Control (MLC) in HVAC systems is hindered by its opaque nature and inference mechanisms, which is challenging for users and modelers to fully comprehend, ultimately leading to a lack of trust in MLC-based decision-making.

Decision Making In-Context Learning +4

Advancing Building Energy Modeling with Large Language Models: Exploration and Case Studies

no code implementations14 Feb 2024 Liang Zhang, Zhelun Chen, Vitaly Ford

The findings advocate a multidisciplinary approach in future artificial intelligence research, with implications extending beyond building energy modeling to other specialized engineering modeling.

Language Modelling Large Language Model +1

Enhancing Large Language Model Performance To Answer Questions and Extract Information More Accurately

no code implementations27 Jan 2024 Liang Zhang, Katherine Jijo, Spurthi Setty, Eden Chung, Fatima Javid, Natan Vidra, Tommy Clifford

Large Language Models (LLMs) generate responses to questions; however, their effectiveness is often hindered by sub-optimal quality of answers and occasional failures to provide accurate responses to questions.

Language Modelling Large Language Model +1

Improving Classification Performance With Human Feedback: Label a few, we label the rest

no code implementations17 Jan 2024 Natan Vidra, Thomas Clifford, Katherine Jijo, Eden Chung, Liang Zhang

In the realm of artificial intelligence, where a vast majority of data is unstructured, obtaining substantial amounts of labeled data to train supervised machine learning models poses a significant challenge.

Active Learning text-classification +1

The two-way knowledge interaction interface between humans and neural networks

no code implementations10 Jan 2024 Zhanliang He, Nuoye Xiong, Hongsheng Li, Peiyi Shen, Guangming Zhu, Liang Zhang

Through experimental validation, based on this interaction interface, NN can provide humans with easily understandable explanations of the reasoning process.

Content-Conditioned Generation of Stylized Free hand Sketches

no code implementations9 Jan 2024 Jiajun Liu, Siyuan Wang, Guangming Zhu, Liang Zhang, Ning li, Eryang Gao

We explore the performance of the model, including using styles randomly sampled from a prior normal distribution to generate images with various free-hand sketching styles, disentangling the painters' styles from known free-hand sketches to generate images with specific styles, and generating images of unknown classes that are not in the training set.

Data Augmentation Image Generation

Image classification network enhancement methods based on knowledge injection

no code implementations9 Jan 2024 Yishuang Tian, Ning Wang, Liang Zhang

The current deep neural network algorithm still stays in the end-to-end training supervision method like Image-Label pairs, which makes traditional algorithm is difficult to explain the reason for the results, and the prediction logic is difficult to understand and analyze.

Classification Image Classification

A multimodal gesture recognition dataset for desktop human-computer interaction

no code implementations8 Jan 2024 Qi Wang, Fengchao Zhu, Guangming Zhu, Liang Zhang, Ning li, Eryang Gao

Gesture recognition is an indispensable component of natural and efficient human-computer interaction technology, particularly in desktop-level applications, where it can significantly enhance people's productivity.

Gesture Recognition

Conditional Variational Autoencoder for Sign Language Translation with Cross-Modal Alignment

1 code implementation25 Dec 2023 Rui Zhao, Liang Zhang, Biao Fu, Cong Hu, Jinsong Su, Yidong Chen

The first KL divergence optimizes the conditional variational autoencoder and regularizes the encoder outputs, while the second KL divergence performs a self-distillation from the posterior path to the prior path, ensuring the consistency of decoder outputs.

Sign Language Translation Translation

Opportunities and Challenges of Applying Large Language Models in Building Energy Efficiency and Decarbonization Studies: An Exploratory Overview

no code implementations18 Dec 2023 Liang Zhang, Zhelun Chen

In recent years, the rapid advancement and impressive capabilities of Large Language Models (LLMs) have been evident across various domains.

Code Generation

Enhance Sketch Recognition's Explainability via Semantic Component-Level Parsing

1 code implementation13 Dec 2023 Guangming Zhu, Siyuan Wang, Tianci Wu, Liang Zhang

Humans can recognize varied sketches of a category easily by identifying the concurrence and layout of the intrinsic semantic components of the category, since humans draw free-hand sketches based a common consensus that which types of semantic components constitute each sketch category.

Sketch Recognition

Sketch Input Method Editor: A Comprehensive Dataset and Methodology for Systematic Input Recognition

1 code implementation30 Nov 2023 Guangming Zhu, Siyuan Wang, Qing Cheng, Kelong Wu, Hao Li, Liang Zhang

With the recent surge in the use of touchscreen devices, free-hand sketching has emerged as a promising modality for human-computer interaction.

Class Incremental Learning Domain Adaptation +2

Optimizing and Fine-tuning Large Language Model for Urban Renewal

no code implementations27 Nov 2023 Xi Wang, Xianyao Ling, Tom Zhang, Xuecao Li, Shaolan Wang, Zhixing Li, Liang Zhang, Peng Gong

This study demonstrates the effectiveness and superiority of the joint fine-tuning method using Prefix and LoRA for ChatGLM in the urban renewal knowledge QA tasks.

Language Modelling Large Language Model +2

CDR-Adapter: Learning Adapters to Dig Out More Transferring Ability for Cross-Domain Recommendation Models

no code implementations4 Nov 2023 Yanyu Chen, Yao Yao, Wai Kin Victor Chan, Li Xiao, Kai Zhang, Liang Zhang, Yun Ye

In this paper, we present a scalable and efficient paradigm to address data sparsity and cold-start issues in CDR, named CDR-Adapter, by decoupling the original recommendation model from the mapping function, without requiring re-engineering the network structure.

Recommendation Systems Transfer Learning

Optimal Guarantees for Algorithmic Reproducibility and Gradient Complexity in Convex Optimization

no code implementations NeurIPS 2023 Liang Zhang, Junchi Yang, Amin Karbasi, Niao He

Particularly, given the inexact initialization oracle, our regularization-based algorithms achieve the best of both worlds - optimal reproducibility and near-optimal gradient complexity - for minimization and minimax optimization.

DPZero: Private Fine-Tuning of Language Models without Backpropagation

no code implementations14 Oct 2023 Liang Zhang, Bingcong Li, Kiran Koshy Thekumparampil, Sewoong Oh, Niao He

The widespread practice of fine-tuning large language models (LLMs) on domain-specific data faces two major challenges in memory and privacy.

Graph Edit Distance Learning via Different Attention

no code implementations26 Aug 2023 Jiaxi Lv, Liang Zhang, Yi Huang, Jiancheng Huang, Shifeng Chen

To this end, DiffAtt uses the difference between two graph-level embeddings as an attentional mechanism to capture the graph structural difference of the two graphs.

Graph Similarity

Explore and Tell: Embodied Visual Captioning in 3D Environments

no code implementations ICCV 2023 Anwen Hu, ShiZhe Chen, Liang Zhang, Qin Jin

To overcome this limitation, we propose a novel task called Embodied Captioning, which equips visual captioning models with navigation capabilities, enabling them to actively explore the scene and reduce visual ambiguity from suboptimal viewpoints.

Image Captioning Navigate +1

Movie101: A New Movie Understanding Benchmark

1 code implementation20 May 2023 Zihao Yue, Qi Zhang, Anwen Hu, Liang Zhang, Ziheng Wang, Qin Jin

Closer to real scenarios, the Movie Clip Narrating (MCN) task in our benchmark asks models to generate role-aware narration paragraphs for complete movie clips where no actors are speaking.

Video Captioning

InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation

1 code implementation10 May 2023 Anwen Hu, ShiZhe Chen, Liang Zhang, Qin Jin

Existing metrics only provide a single score to measure caption qualities, which are less explainable and informative.

Benchmarking Image Captioning

Biomarker Investigation using Multiple Brain Measures from MRI through XAI in Alzheimer's Disease Classification

no code implementations3 May 2023 Davide Coluzzi, Valentina Bordin, Massimo Walter Rivolta, Igor Fortel, Liang Zhang, Alex Leow, Giuseppe Baselli

The XAI assessment was conducted across 132 brain parcels, extracted from a combination of the Harvard-Oxford and AAL brain atlases, and compared to well-known pathological regions to measure adherence to domain knowledge.

Classification Explainable artificial intelligence +1

MPMQA: Multimodal Question Answering on Product Manuals

1 code implementation19 Apr 2023 Liang Zhang, Anwen Hu, Jing Zhang, Shuo Hu, Qin Jin

Taking into account the length of product manuals and the fact that a question is always related to a small number of pages, MPMQA can be naturally split into two subtasks: retrieving most related pages and then generating multimodal answers.

Question Answering Sentence

Road Network Representation Learning: A Dual Graph based Approach

no code implementations13 Apr 2023 Liang Zhang, Cheng Long

The constructed hypergraph would naturally capture the high-order relationships among roads with hyperedges.

Graph Reconstruction hyperedge classification +1

TemPL: A Novel Deep Learning Model for Zero-Shot Prediction of Protein Stability and Activity Based on Temperature-Guided Language Modeling

no code implementations7 Apr 2023 Pan Tan, Mingchen Li, Liang Zhang, Zhiqiang Hu, Liang Hong

We introduce TemPL, a novel deep learning approach for zero-shot prediction of protein stability and activity, harnessing temperature-guided language modeling.

Language Modelling

Data Might be Enough: Bridge Real-World Traffic Signal Control Using Offline Reinforcement Learning

1 code implementation20 Mar 2023 Liang Zhang, Jianming Deng

To address these challenges, we propose: (1) a cyclical offline dataset (COD), designed based on common real-world scenarios to facilitate easy collection; (2) an offline RL model called DataLight, capable of learning satisfactory control strategies from the COD; and (3) a method called Arbitrary To Cyclical (ATC), which can transform most RL-based methods into cyclical signal control.

Offline RL Reinforcement Learning (RL)

Accommodating Audio Modality in CLIP for Multimodal Processing

1 code implementation12 Mar 2023 Ludan Ruan, Anwen Hu, Yuqing Song, Liang Zhang, Sipeng Zheng, Qin Jin

In this paper, we extend the stateof-the-art Vision-Language model CLIP to accommodate the audio modality for Vision-Language-Audio multimodal processing.

AudioCaps Contrastive Learning +4

3D Spatial Multimodal Knowledge Accumulation for Scene Graph Prediction in Point Cloud

no code implementations CVPR 2023 Mingtao Feng, Haoran Hou, Liang Zhang, Zijie Wu, Yulan Guo, Ajmal Mian

In-depth understanding of a 3D scene not only involves locating/recognizing individual objects, but also requires to infer the relationships and interactions among them.

Towards Better Document-level Relation Extraction via Iterative Inference

1 code implementation26 Nov 2022 Liang Zhang, Jinsong Su, Yidong Chen, Zhongjian Miao, Zijun Min, Qingguo Hu, Xiaodong Shi

Existing methods usually directly predict the relations of all entity pairs of input document in a one-pass manner, ignoring the fact that predictions of some entity pairs heavily depend on the predicted results of other pairs.

Contrastive Learning Document-level Relation Extraction +1

Region Embedding with Intra and Inter-View Contrastive Learning

1 code implementation15 Nov 2022 Liang Zhang, Cheng Long, Gao Cong

Motivated by the success of contrastive learning for representation learning, we propose to leverage it for multi-view region representation learning and design a model called ReMVC (Region Embedding with Multi-View Contrastive Learning) by following two guidelines: i) comparing a region with others within each view for effective representation extraction and ii) comparing a region with itself across different views for cross-view information sharing.

Clustering Contrastive Learning +1

Deep Reinforcement Learning with Vector Quantized Encoding

no code implementations12 Nov 2022 Liang Zhang, Justin Lieffers, Adarsh Pyarelal

Human decision-making often involves combining similar states into categories and reasoning at the level of the categories rather than the actual states.

Decision Making reinforcement-learning +1

Using Features at Multiple Temporal and Spatial Resolutions to Predict Human Behavior in Real Time

no code implementations12 Nov 2022 Liang Zhang, Justin Lieffers, Adarsh Pyarelal

We contend that for an artificially intelligent agent to effectively model human teammates, i. e., demonstrate computational theory of mind (ToM), it should do the same.

DynamicLight: Dynamically Tuning Traffic Signal Duration with DRL

1 code implementation2 Nov 2022 Liang Zhang, Qiang Wu, Jun Shen, Linyuan Lü, Bo Du, Akbar Telikani, Jianqing Wu, Shubin Xie

Deep reinforcement learning (DRL) is becoming increasingly popular in implementing traffic signal control (TSC).

Q-Learning

LinearCoFold and LinearCoPartition: Linear-Time Algorithms for Secondary Structure Prediction of Interacting RNA molecules

no code implementations26 Oct 2022 He Zhang, Sizhen Li, Liang Zhang, David H. Mathews, Liang Huang

Vienna RNAcofold, which reduces the problem into the classical single sequence folding by concatenating two strands, scales in cubic time against the combined sequence length, and is slow for long sequences.

MobileCodec: Neural Inter-frame Video Compression on Mobile Devices

no code implementations18 Jul 2022 Hoang Le, Liang Zhang, Amir Said, Guillaume Sautiere, Yang Yang, Pranav Shrestha, Fei Yin, Reza Pourreza, Auke Wiggers

Realizing the potential of neural video codecs on mobile devices is a big technological challenge due to the computational complexity of deep networks and the power-constrained mobile hardware.

Video Compression

Contrastive Brain Network Learning via Hierarchical Signed Graph Pooling Model

no code implementations14 Jul 2022 Haoteng Tang, Guixiang Ma, Lei Guo, Xiyao Fu, Heng Huang, Liang Zhang

Here, we propose an interpretable hierarchical signed graph representation learning model to extract graph-level representations from brain functional networks, which can be used for different prediction tasks.

Contrastive Learning Graph Learning +1

LinearAlifold: Linear-Time Consensus Structure Prediction for RNA Alignments

1 code implementation29 Jun 2022 Liang Zhang, Sizhen Li, He Zhang, David H. Mathews, Liang Huang

We present LinearAlifold, an efficient algorithm for folding aligned RNA homologs that scales linearly with both the sequence length and the number of sequences, based on our recent work LinearFold that folds a single RNA in linear time.

Bring Your Own Algorithm for Optimal Differentially Private Stochastic Minimax Optimization

no code implementations1 Jun 2022 Liang Zhang, Kiran Koshy Thekumparampil, Sewoong Oh, Niao He

We provide a general framework for solving differentially private stochastic minimax optimization (DP-SMO) problems, which enables the practitioners to bring their own base optimization algorithm and use it as a black-box to obtain the near-optimal privacy-loss trade-off.

Generalizing Multimodal Pre-training into Multilingual via Language Acquisition

no code implementations29 May 2022 Liang Zhang, Anwen Hu, Qin Jin

Specifically, we design a lightweight language acquisition encoder based on state-of-the-art monolingual VLP models.

Language Acquisition Retrieval +2

Generalization Bounds of Nonconvex-(Strongly)-Concave Stochastic Minimax Optimization

no code implementations28 May 2022 Siqi Zhang, Yifan Hu, Liang Zhang, Niao He

We further study the algorithm-dependent generalization bounds via stability arguments of algorithms.

Generalization Bounds

Learning from Pixel-Level Noisy Label : A New Perspective for Light Field Saliency Detection

1 code implementation28 Apr 2022 Mingtao Feng, Kendong Liu, Liang Zhang, Hongshan Yu, Yaonan Wang, Ajmal Mian

Saliency detection with light field images is becoming attractive given the abundant cues available, however, this comes at the expense of large-scale pixel level annotated data which is expensive to generate.

Saliency Prediction

Landing AI on Networks: An equipment vendor viewpoint on Autonomous Driving Networks

no code implementations26 Apr 2022 Dario Rossi, Liang Zhang

The tremendous achievements of Artificial Intelligence (AI) in computer vision, natural language processing, games and robotics, has extended the reach of the AI hype to other fields: in telecommunication networks, the long term vision is to let AI fully manage, and autonomously drive, all aspects of network operation.

Autonomous Driving

A Masked Image Reconstruction Network for Document-level Relation Extraction

no code implementations21 Apr 2022 Liang Zhang, Yidong Cheng

After that, we look on the entity-pair matrix as an image and then randomly mask it and restore it through an inference module to capture the correlations between the relationships.

Document-level Relation Extraction Image Reconstruction +2

Trigger-GNN: A Trigger-Based Graph Neural Network for Nested Named Entity Recognition

no code implementations12 Apr 2022 Yuan Sui, Fanyang Bu, Yingting Hu, Wei Yan, Liang Zhang

Nested named entity recognition (NER) aims to identify the entity boundaries and recognize categories of the named entities in a complex hierarchical sentence.

named-entity-recognition Named Entity Recognition +3

A Token-level Contrastive Framework for Sign Language Translation

1 code implementation11 Apr 2022 Biao Fu, PeiGen Ye, Liang Zhang, Pei Yu, Cong Hu, Yidong Chen, Xiaodong Shi

Sign Language Translation (SLT) is a promising technology to bridge the communication gap between the deaf and the hearing people.

Contrastive Learning Machine Translation +5

DynLight: Realize dynamic phase duration with multi-level traffic signal control

no code implementations7 Apr 2022 Liang Zhang, Shubin Xie, Jianming Deng

We would like to withdraw this article for the following reasons: 1 this article is not satisfactory for limited language and theoretical description; 2 we have enriched and revised this article with the help of other authors; 3 we must update the author contribution information.

NC-DRE: Leveraging Non-entity Clue Information for Document-level Relation Extraction

no code implementations1 Apr 2022 Liang Zhang, Yidong Cheng

Document-level relation extraction (RE), which requires reasoning on multiple entities in different sentences to identify complex inter-sentence relations, is more challenging than sentence-level RE.

Document-level Relation Extraction Language Modelling +2

A Densely Connected Criss-Cross Attention Network for Document-level Relation Extraction

no code implementations26 Mar 2022 Liang Zhang, Yidong Cheng

Specifically, the Dense-CCNet performs entity-pair-level logical reasoning through the Criss-Cross Attention (CCA), which can collect contextual information in horizontal and vertical directions on the entity-pair matrix to enhance the corresponding entity-pair representation.

Document-level Relation Extraction Logical Reasoning +2

Learning From Pixel-Level Noisy Label: A New Perspective for Light Field Saliency Detection

1 code implementation CVPR 2022 Mingtao Feng, Kendong Liu, Liang Zhang, Hongshan Yu, Yaonan Wang, Ajmal Mian

Saliency detection with light field images is becoming attractive given the abundant cues available, however, this comes at the expense of large-scale pixel level annotated data which is expensive to generate.

Saliency Prediction

Leveraging Queue Length and Attention Mechanisms for Enhanced Traffic Signal Control Optimization

2 code implementations30 Dec 2021 Liang Zhang, Shubin Xie, Jianming Deng

We propose two new methods: (1) Max Queue-Length (M-QL), an optimization-based traditional method designed based on the property of queue length; and (2) AttentionLight, an RL model that employs the self-attention mechanism to capture the signal phase correlation without requiring human knowledge of phase relationships.

Reinforcement Learning (RL)

Expression might be enough: representing pressure and demand for reinforcement learning based traffic signal control

1 code implementation19 Dec 2021 Liang Zhang, Qiang Wu, Jun Shen, Linyuan Lü, Bo Du, Jianqing Wu

Many studies confirmed that a proper traffic state representation is more important than complex algorithms for the classical traffic signal control (TSC) problem.

Reinforcement Learning (RL)

Gram-SLD: Automatic Self-labeling and Detection for Instance Objects

no code implementations7 Dec 2021 Rui Wang, Chengtun Wu, Jiawen Xin, Liang Zhang

Instance object detection plays an important role in intelligent monitoring, visual navigation, human-computer interaction, intelligent services and other fields.

Object object-detection +2

Efficient Pressure: Improving efficiency for signalized intersections

1 code implementation4 Dec 2021 Qiang Wu, Liang Zhang, Jun Shen, Linyuan Lü, Bo Du, Jianqing Wu

Since conventional approaches could not adapt to dynamic traffic conditions, reinforcement learning (RL) has attracted more attention to help solve the traffic signal control (TSC) problem.

Reinforcement Learning (RL)

Analysis and visualization of spatial transcriptomic data

no code implementations15 Oct 2021 Boxiang Liu, Yanjun Li, Liang Zhang

Human and animal tissues consist of heterogeneous cell types that organize and interact in highly structured manners.

Spatio-Temporal Interaction Graph Parsing Networks for Human-Object Interaction Recognition

no code implementations19 Aug 2021 Ning Wang, Guangming Zhu, Liang Zhang, Peiyi Shen, Hongsheng Li, Cong Hua

With the effective spatio-temporal relationship modeling, it is possible not only to uncover contextual information in each frame but also to directly capture inter-time dependencies.

Human-Object Interaction Detection Object

Deep Natural Language Processing for LinkedIn Search

no code implementations16 Aug 2021 Weiwei Guo, Xiaowei Liu, Sida Wang, Michaeel Kazi, Zhiwei Wang, Zhoutong Fu, Jun Jia, Liang Zhang, Huiji Gao, Bo Long

Building a successful search system requires a thorough understanding of textual data semantics, where deep learning based natural language processing techniques (deep NLP) can be of great help.

Document Ranking Language Modelling

Deep Natural Language Processing for LinkedIn Search Systems

no code implementations30 Jul 2021 Weiwei Guo, Xiaowei Liu, Sida Wang, Michaeel Kazi, Zhoutong Fu, Huiji Gao, Jun Jia, Liang Zhang, Bo Long

Many search systems work with large amounts of natural language data, e. g., search queries, user profiles and documents, where deep learning based natural language processing techniques (deep NLP) can be of great help.

A Systematic Collection of Medical Image Datasets for Deep Learning

1 code implementation24 Jun 2021 Johann Li, Guangming Zhu, Cong Hua, Mingtao Feng, BasheerBennamoun, Ping Li, Xiaoyuan Lu, Juan Song, Peiyi Shen, Xu Xu, Lin Mei, Liang Zhang, Syed Afaq Ali Shah, Mohammed Bennamoun

Thus, as comprehensive as possible, this paper provides a collection of medical image datasets with their associated challenges for deep learning research.

MeSIN: Multilevel Selective and Interactive Network for Medication Recommendation

no code implementations22 Apr 2021 Yang An, Liang Zhang, Mao You, Xueqing Tian, Bo Jin, Xiaopeng Wei

Second, we incorporate a novel interactive long-short term memory network (InLSTM) to reinforce the interactions of multilevel medical sequences in EHR data with the help of the calibrated memory-augmented cell and an enhanced input gate.

Decision Making

Free-form Description Guided 3D Visual Graph Network for Object Grounding in Point Cloud

1 code implementation ICCV 2021 Mingtao Feng, Zhen Li, Qi Li, Liang Zhang, Xiangdong Zhang, Guangming Zhu, HUI ZHANG, Yaonan Wang, Ajmal Mian

There are three main challenges in 3D object grounding: to find the main focus in the complex and diverse description; to understand the point cloud scene; and to locate the target object.

Object

Real-Time Vanishing Point Detector Integrating Under-Parameterized RANSAC and Hough Transform

no code implementations ICCV 2021 Jianping Wu, Liang Zhang, Ye Liu, Ke Chen

We propose a novel approach that integrates under-parameterized RANSAC (UPRANSAC) with Hough Transform to detect vanishing points (VPs) from un-calibrated monocular images.

A two-component Comptonisation model for the type-B QPO in MAXI J1348-630

no code implementations18 Dec 2020 Federico García, Mariano Méndez, Konstantinos Karpouzas, Tomaso Belloni, Liang Zhang, Diego Altamirano

The data show a strong type-B QPO at ~4. 5 Hz with increasing fractional rms amplitude with energy and positive lags with respect to a reference band at 2-2. 5 keV.

High Energy Astrophysical Phenomena

More Industry-friendly: Federated Learning with High Efficient Design

no code implementations16 Dec 2020 Dingwei Li, Qinglong Chang, Lixue Pang, Yanfang Zhang, Xudong Sun, Jikun Ding, Liang Zhang

Although many achievements have been made since Google threw out the paradigm of federated learning (FL), there still exists much room for researchers to optimize its efficiency.

Federated Learning Vocal Bursts Intensity Prediction

How does Weight Correlation Affect Generalisation Ability of Deep Neural Networks?

no code implementations NeurIPS 2020 Gaojie Jin, Xinping Yi, Liang Zhang, Lijun Zhang, Sven Schewe, Xiaowei Huang

This paper studies the novel concept of weight correlation in deep neural networks and discusses its impact on the networks' generalisation ability.

HMFlow: Hybrid Matching Optical Flow Network for Small and Fast-Moving Objects

no code implementations19 Nov 2020 Suihanjin Yu, Youmin Zhang, Chen Wang, Xiao Bai, Liang Zhang, Edwin R. Hancock

To address this problem, we introduce a lightweight but effective Global Matching Component (GMC) to grab global matching features.

Optical Flow Estimation

How does Weight Correlation Affect the Generalisation Ability of Deep Neural Networks

1 code implementation12 Oct 2020 Gaojie Jin, Xinping Yi, Liang Zhang, Lijun Zhang, Sven Schewe, Xiaowei Huang

This paper studies the novel concept of weight correlation in deep neural networks and discusses its impact on the networks' generalisation ability.

MeDaS: An open-source platform as service to help break the walls between medicine and informatics

no code implementations12 Jul 2020 Liang Zhang, Johann Li, Ping Li, Xiaoyuan Lu, Peiyi Shen, Guangming Zhu, Syed Afaq Shah, Mohammed Bennarmoun, Kun Qian, Björn W. Schuller

To the best of our knowledge, MeDaS is the first open-source platform proving a collaborative and interactive service for researchers from a medical background easily using DL related toolkits, and at the same time for scientists or engineers from information sciences to understand the medical knowledge side.

On Vocabulary Reliance in Scene Text Recognition

no code implementations CVPR 2020 Zhaoyi Wan, Jielei Zhang, Liang Zhang, Jiebo Luo, Cong Yao

This remedy alleviates the problem of vocabulary reliance and improves the overall scene text recognition performance.

Scene Text Recognition

COVID-19 Chest CT Image Segmentation -- A Deep Convolutional Neural Network Solution

no code implementations23 Apr 2020 Qingsen Yan, Bo wang, Dong Gong, Chuan Luo, Wei Zhao, Jianhu Shen, Qinfeng Shi, Shuo Jin, Liang Zhang, Zheng You

Inspired by the observation that the boundary of the infected lung can be enhanced by adjusting the global intensity, in the proposed deep CNN, we introduce a feature variation block which adaptively adjusts the global properties of the features for segmenting COVID-19 infection.

Computed Tomography (CT) Image Segmentation +3

Algorithm for Optimized mRNA Design Improves Stability and Immunogenicity

2 code implementations21 Apr 2020 He Zhang, Liang Zhang, Ang Lin, Congcong Xu, Ziyu Li, Kaibo Liu, Boxiang Liu, Xiaopin Ma, Fanfan Zhao, Weiguo Yao, Hangwen Li, David H. Mathews, Yujian Zhang, Liang Huang

Messenger RNA (mRNA) vaccines are being used for COVID-19, but still suffer from the critical issue of mRNA instability and degradation, which is a major obstacle in the storage, distribution, and efficacy of the vaccine.

Sentence

Efficient Scene Text Detection with Textual Attention Tower

no code implementations30 Jan 2020 Liang Zhang, Yufei Liu, Hang Xiao, Lu Yang, Guangming Zhu, Syed Afaq Shah, Mohammed Bennamoun, Peiyi Shen

Scene text detection has received attention for years and achieved an impressive performance across various benchmarks.

Scene Text Detection Text Detection

Structure-Feature based Graph Self-adaptive Pooling

1 code implementation30 Jan 2020 Liang Zhang, Xudong Wang, Hongsheng Li, Guangming Zhu, Peiyi Shen, Ping Li, Xiaoyuan Lu, Syed Afaq Ali Shah, Mohammed Bennamoun

To solve these problems mentioned above, we propose a novel graph self-adaptive pooling method with the following objectives: (1) to construct a reasonable pooled graph topology, structure and feature information of the graph are considered simultaneously, which provide additional veracity and objectivity in node selection; and (2) to make the pooled nodes contain sufficiently effective graph information, node feature information is aggregated before discarding the unimportant nodes; thus, the selected nodes contain information from neighbor nodes, which can enhance the use of features of the unselected nodes.

Graph Classification

Relation Graph Network for 3D Object Detection in Point Clouds

no code implementations30 Nov 2019 Mingtao Feng, Syed Zulqarnain Gilani, Yaonan Wang, Liang Zhang, Ajmal Mian

Convolutional Neural Networks (CNNs) have emerged as a powerful strategy for most object detection tasks on 2D images.

3D Object Detection Object +3

Beyond Similarity: Relation Embedding with Dual Attentions for Item-based Recommendation

no code implementations11 Nov 2019 Liang Zhang, Guannan Liu, Junjie Wu

Given the effectiveness and ease of use, Item-based Collaborative Filtering (ICF) methods have been broadly used in industry in recent years.

Collaborative Filtering Relation +2

Point Attention Network for Semantic Segmentation of 3D Point Clouds

no code implementations27 Sep 2019 Mingtao Feng, Liang Zhang, Xuefei Lin, Syed Zulqarnain Gilani, Ajmal Mian

We propose a point attention network that learns rich local shape features and their contextual correlations for 3D point cloud semantic segmentation.

Point Cloud Segmentation Semantic Segmentation

Adversarial Training Methods for Network Embedding

1 code implementation30 Aug 2019 Quanyu Dai, Xiao Shen, Liang Zhang, Qiang Li, Dan Wang

To improve this strategy, we further propose an interpretable adversarial training method by enforcing the reconstruction of the adversarial examples in the discrete graph domain.

Link Prediction Network Embedding +1

Recommendation with Attribute-aware Product Networks: A Representation Learning Model

no code implementations16 Aug 2019 Guannan Liu, Liang Zhang, Junjie Wu, Xiao Fang

Specifically, eRAN first maps items connected in attribute networks to low-dimensional embedding vectors through a deep autoencoder, and then an attention mechanism is applied to model the attractions of attributes to users, from which personalized item representation can be derived.

Attribute Recommendation Systems +1

Deep Hierarchical Reinforcement Learning Based Recommendations via Multi-goals Abstraction

no code implementations22 Mar 2019 Dongyang Zhao, Liang Zhang, Bo Zhang, Lizhou Zheng, Yongjun Bao, Weipeng Yan

To tackle this challenge, we propose a deep hierarchical reinforcement learning based recommendation framework, which consists of two components, i. e., high-level agent and low-level agent.

Hierarchical Reinforcement Learning Recommendation Systems +2

GCN-LASE: Towards Adequately Incorporating Link Attributes in Graph Convolutional Networks

1 code implementation26 Feb 2019 Ziyao Li, Liang Zhang, Guojie Song

Graph Convolutional Networks (GCNs) have proved to be a most powerful architecture in aggregating local neighborhood information for individual graph nodes.

Informativeness

Attention in Convolutional LSTM for Gesture Recognition

1 code implementation NeurIPS 2018 Liang Zhang, Guangming Zhu, Lin Mei, Peiyi Shen, Syed Afaq Ali Shah, Mohammed Bennamoun

On this basis, a new variant of LSTM is derived, in which the convolutional structures are only embedded into the input-to-state transition of LSTM.

Gesture Recognition

Real-time Power System State Estimation and Forecasting via Deep Neural Networks

3 code implementations15 Nov 2018 Liang Zhang, Gang Wang, Georgios B. Giannakis

To bypass these hurdles, this paper advocates deep neural networks (DNNs) for real-time power system monitoring.

Rolling Shutter Correction Time Series Analysis

SepNE: Bringing Separability to Network Embedding

no code implementations14 Nov 2018 Ziyao Li, Liang Zhang, Guojie Song

We further propose SepNE, a simple and flexible network embedding algorithm which independently learns representations for different subsets of nodes in separated processes.

Network Embedding

Deep Reinforcement Learning for Page-wise Recommendations

no code implementations7 May 2018 Xiangyu Zhao, Long Xia, Liang Zhang, Zhuoye Ding, Dawei Yin, Jiliang Tang

In particular, we propose a principled approach to jointly generate a set of complementary items and the corresponding strategy to display them in a 2-D page; and propose a novel page-wise recommendation framework based on deep reinforcement learning, DeepPage, which can optimize a page of items with proper display based on real-time feedback from users.

Recommendation Systems reinforcement-learning +1

Deep Keyframe Detection in Human Action Videos

no code implementations26 Apr 2018 Xiang Yan, Syed Zulqarnain Gilani, Hanlin Qin, Mingtao Feng, Liang Zhang, Ajmal Mian

Detecting representative frames in videos based on human actions is quite challenging because of the combined factors of human pose in action and the background.

Deep Reinforcement Learning for List-wise Recommendations

7 code implementations30 Dec 2017 Xiangyu Zhao, Liang Zhang, Long Xia, Zhuoye Ding, Dawei Yin, Jiliang Tang

Recommender systems play a crucial role in mitigating the problem of information overload by suggesting users' personalized items or services.

Recommendation Systems reinforcement-learning +1

Bringing Salary Transparency to the World: Computing Robust Compensation Insights via LinkedIn Salary

no code implementations29 Mar 2017 Krishnaram Kenthapadi, Stuart Ambler, Liang Zhang, Deepak Agarwal

The recently launched LinkedIn Salary product has been designed with the goal of providing compensation insights to the world's professionals and thereby helping them optimize their earning potential.

Randomized Block Frank-Wolfe for Convergent Large-Scale Learning

no code implementations27 Dec 2016 Liang Zhang, Gang Wang, Daniel Romero, Georgios B. Giannakis

To circumvent the limitations of existing methods, the present work develops step sizes for RB-FW that enable a flexible selection of the number of blocks to update per iteration while ensuring convergence and feasibility of the iterates.

Sparse Phase Retrieval via Truncated Amplitude Flow

1 code implementation23 Nov 2016 Gang Wang, Liang Zhang, Georgios B. Giannakis, Mehmet Akcakaya, Jie Chen

Upon formulating sparse PR as an amplitude-based nonconvex optimization task, SPARTA works iteratively in two stages: In stage one, the support of the underlying sparse signal is recovered using an analytically well-justified rule, and subsequently, a sparse orthogonality-promoting initialization is obtained via power iterations restricted on the support; and, in the second stage, the initialization is successively refined by means of hard thresholding based gradient-type iterations.

Information Theory Information Theory Optimization and Control

Fast Computation of Posterior Mode in Multi-Level Hierarchical Models

no code implementations NeurIPS 2008 Liang Zhang, Deepak Agarwal

Multi-level hierarchical models provide an attractive framework for incorporating correlations induced in a response variable organized in a hierarchy.

Cannot find the paper you are looking for? You can Submit a new open access paper.