Search Results for author: Li Zhang

Found 309 papers, 136 papers with code

Multi-Label Transfer Learning for Multi-Relational Semantic Similarity

no code implementations SEMEVAL 2019 Li Zhang, Steven R. Wilson, Rada Mihalcea

Multi-relational semantic similarity datasets define the semantic relations between two short texts in multiple ways, e. g., similarity, relatedness, and so on.

Multi-Task Learning regression +3

Direct Network Transfer: Transfer Learning of Sentence Embeddings for Semantic Similarity

no code implementations20 Apr 2018 Li Zhang, Steven R. Wilson, Rada Mihalcea

Sentence encoders, which produce sentence embeddings using neural networks, are typically evaluated by how well they transfer to downstream tasks.

Natural Language Understanding Semantic Similarity +5

Cascaded multi-scale and multi-dimension convolutional neural network for stereo matching

no code implementations26 Mar 2018 Haihua Lu, Hai Xu, Li Zhang, Yong Zhao

Firstly, we propose a new multi-scale matching cost computation sub-network, in which two different sizes of receptive fields are implemented parallelly.

Stereo Matching Stereo Matching Hand

Algorithmic Collusion in Cournot Duopoly Market: Evidence from Experimental Economics

no code implementations21 Feb 2018 Nan Zhou, Li Zhang, Shijian Li, Zhijian Wang

In application, we hope, the frameworks, the algorithm design as well as the experiment environment illustrated in this work, can be an incubator or a test bed for researchers and policymakers to handle the emerging algorithmic collusion.

Actor-Critic Sequence Training for Image Captioning

no code implementations29 Jun 2017 Li Zhang, Flood Sung, Feng Liu, Tao Xiang, Shaogang Gong, Yongxin Yang, Timothy M. Hospedales

Generating natural language descriptions of images is an important capability for a robot or other visual-intelligence driven AI agent that may need to communicate with human users about what it is seeing.

Image Captioning reinforcement-learning +1

GaDei: On Scale-up Training As A Service For Deep Learning

no code implementations18 Nov 2016 Wei Zhang, Minwei Feng, Yunhui Zheng, Yufei Ren, Yandong Wang, Ji Liu, Peng Liu, Bing Xiang, Li Zhang, Bo-Wen Zhou, Fei Wang

By evaluating the NLC workloads, we show that only the conservative hyper-parameter setup (e. g., small mini-batch size and small learning rate) can guarantee acceptable model accuracy for a wide range of customers.

On the Protection of Private Information in Machine Learning Systems: Two Recent Approaches

no code implementations26 Aug 2017 Martín Abadi, Úlfar Erlingsson, Ian Goodfellow, H. Brendan McMahan, Ilya Mironov, Nicolas Papernot, Kunal Talwar, Li Zhang

The recent, remarkable growth of machine learning has led to intense interest in the privacy of the data on which machine learning relies, and to new techniques for preserving privacy.

BIG-bench Machine Learning valid

Classification of Neurological Gait Disorders Using Multi-task Feature Learning

no code implementations8 Dec 2016 Ioannis Papavasileiou, Wenlong Zhang, Xin Wang, Jinbo Bi, Li Zhang, Song Han

An advanced machine learning method, multi-task feature learning (MTFL), is used to jointly train classification models of a subject's gait in three classes, post-stroke, PD and healthy gait.

Classification General Classification

Real-Time Visual Tracking: Promoting the Robustness of Correlation Filter Learning

no code implementations29 Aug 2016 Yao Sui, Ziming Zhang, Guanghui Wang, Yafei Tang, Li Zhang

By exploiting the anisotropy of the filter response, three sparsity related loss functions are proposed to alleviate the overfitting issue of previous methods and improve the overall tracking performance.

Real-Time Visual Tracking

Tracking Completion

no code implementations29 Aug 2016 Yao Sui, Guanghui Wang, Yafei Tang, Li Zhang

A fundamental component of modern trackers is an online learned tracking model, which is typically modeled either globally or locally.

Matrix Completion

Differentially Private False Discovery Rate Control

no code implementations11 Jul 2018 Cynthia Dwork, Weijie J. Su, Li Zhang

Differential privacy provides a rigorous framework for privacy-preserving data analysis.

Privacy Preserving Two-sample testing

Nearly Optimal Private LASSO

no code implementations NeurIPS 2015 Kunal Talwar, Abhradeep Guha Thakurta, Li Zhang

In addition, we show that this error bound is nearly optimal amongst all differentially private algorithms.

Depth creates no more spurious local minima

no code implementations28 Jan 2019 Li Zhang

We show that for any convex differentiable loss, a deep linear network has no spurious local minima as long as it is true for the two layer case.

Exemplar-Based Face Parsing

no code implementations CVPR 2013 Brandon M. Smith, Li Zhang, Jonathan Brandt, Zhe Lin, Jianchao Yang

Given a test image, our algorithm first selects a subset of exemplar images from the database, Our algorithm then computes a nonrigid warp for each exemplar image to align it with the test image.

Face Alignment Face Parsing +3

Nonparametric Context Modeling of Local Appearance for Pose- and Expression-Robust Facial Landmark Localization

no code implementations CVPR 2014 Brandon M. Smith, Jonathan Brandt, Zhe Lin, Li Zhang

We propose a data-driven approach to facial landmark localization that models the correlations between each landmark and its surrounding appearance features.

Face Alignment

Discriminative Low-Rank Tracking

no code implementations ICCV 2015 Yao Sui, Yafei Tang, Li Zhang

Good tracking performance is in general attributed to accurate representation over previously obtained targets or reliable discrimination between the target and the surrounding background.

Field-aware Neural Factorization Machine for Click-Through Rate Prediction

no code implementations25 Feb 2019 Li Zhang, Weichen Shen, Shijian Li, Gang Pan

This model can have strong second order feature interactive learning ability like Field-aware Factorization Machine, on this basis, deep neural network is used for higher-order feature combination learning.

Click-Through Rate Prediction Feature Engineering +1

Monte Carlo Neural Fictitious Self-Play: Approach to Approximate Nash equilibrium of Imperfect-Information Games

no code implementations22 Mar 2019 Li Zhang, Wei Wang, Shijian Li, Gang Pan

Experimentally, we demonstrate that the proposed Monte Carlo Neural Fictitious Self Play can converge to approximate Nash equilibrium in games with large-scale search depth while the Neural Fictitious Self Play can't.

Deep Learning based Pedestrian Detection at Distance in Smart Cities

no code implementations18 Nov 2018 Ranjith K Dinakaran, Philip Easom, Ahmed Bouridane, Li Zhang, Richard Jiang, Fozia Mehboob, Abdul Rauf

Generative adversarial networks (GANs) have been promising for many computer vision problems due to their powerful capabilities to enhance the data for training and test.

Decoder Pedestrian Detection

Automated Segmentation of Pulmonary Lobes using Coordination-Guided Deep Neural Networks

2 code implementations19 Apr 2019 Wenjia Wang, Junxuan Chen, Jie Zhao, Ying Chi, Xuansong Xie, Li Zhang, Xian-Sheng Hua

The proposed model is trained and evaluated on a few publicly available datasets and has achieved the state-of-the-art accuracy with a mean Dice coefficient index of 0. 947 $\pm$ 0. 044.

Segmentation

Jointly Learning Structured Analysis Discriminative Dictionary and Analysis Multiclass Classifier

no code implementations27 May 2019 Zhao Zhang, Weiming Jiang, Jie Qin, Li Zhang, Fanzhang Li, Min Zhang, Shuicheng Yan

Then we compute a linear classifier based on the approximated sparse codes by an analysis mechanism to simultaneously consider the classification and representation powers.

Dictionary Learning General Classification

Distant Pedestrian Detection in the Wild using Single Shot Detector with Deep Convolutional Generative Adversarial Networks

no code implementations29 May 2019 Ranjith Dinakaran, Philip Easom, Li Zhang, Ahmed Bouridane, Richard Jiang, Eran Edirisinghe

In our work, GAN has been trained intensively on low resolution images, in order to neutralize the challenges of the pedestrian detection in the wild, and considered humans, and few other classes for detection in smart cities.

object-detection Object Detection +1

End-to-End Learning of Multi-scale Convolutional Neural Network for Stereo Matching

no code implementations25 Jun 2019 Li Zhang, Quanhong Wang, Haihua Lu, Yong Zhao

To tackle this problem, we propose a network for disparity estimation based on abundant contextual details and semantic information, called Multi-scale Features Network (MSFNet).

Disparity Estimation Stereo Matching +1

ASCNet: Adaptive-Scale Convolutional Neural Networks for Multi-Scale Feature Learning

no code implementations7 Jul 2019 Mo Zhang, Jie Zhao, Xiang Li, Li Zhang, Quanzheng Li

Such pixel-level dilation rates produce optimal receptive fields so that the information of objects with different sizes can be extracted at the corresponding scale.

Semantic Segmentation

Multi-level Domain Adaptive learning for Cross-Domain Detection

no code implementations26 Jul 2019 Rongchang Xie, Fei Yu, Jiachao Wang, Yizhou Wang, Li Zhang

In recent years, object detection has shown impressive results using supervised deep learning, but it remains challenging in a cross-domain environment.

Object object-detection +1

Annotation-Free Cardiac Vessel Segmentation via Knowledge Transfer from Retinal Images

no code implementations26 Jul 2019 Fei Yu, Jie Zhao, Yanjun Gong, Zhi Wang, Yuxi Li, Fan Yang, Bin Dong, Quanzheng Li, Li Zhang

Segmenting coronary arteries is challenging, as classic unsupervised methods fail to produce satisfactory results and modern supervised learning (deep learning) requires manual annotation which is often time-consuming and can some time be infeasible.

Generative Adversarial Network Transfer Learning

Discriminative Local Sparse Representation by Robust Adaptive Dictionary Pair Learning

no code implementations20 Nov 2019 Yulin Sun, Zhao Zhang, Weiming Jiang, Zheng Zhang, Li Zhang, Shuicheng Yan, Meng Wang

In this paper, we propose a structured Robust Adaptive Dic-tionary Pair Learning (RA-DPL) framework for the discrim-inative sparse representation learning.

Representation Learning

Unsupervised Learnable Sinogram Inpainting Network (SIN) for Limited Angle CT reconstruction

no code implementations9 Nov 2018 Ji Zhao, Zhiqiang Chen, Li Zhang, Xin Jin

In this paper, we propose a sinogram inpainting network (SIN) to solve limited-angle CT reconstruction problem, which is a very challenging ill-posed issue and of great interest for several clinical applications.

Medical Physics Image and Video Processing

Deep Self-representative Concept Factorization Network for Representation Learning

no code implementations13 Dec 2019 Yan Zhang, Zhao Zhang, Zheng Zhang, Mingbo Zhao, Li Zhang, Zheng-Jun Zha, Meng Wang

In this paper, we investigate the unsupervised deep representation learning issue and technically propose a novel framework called Deep Self-representative Concept Factorization Network (DSCF-Net), for clustering deep features.

Clustering Representation Learning

Automatic marker-free registration of tree point-cloud data based on rotating projection

no code implementations30 Jan 2020 Xiuxian Xu, Pei Wang, Xiaozheng Gan, Ya-Xin Li, Li Zhang, Qing Zhang, Mei Zhou, Yinghui Zhao, Xinwei Li

In coarse registration, point clouds produced by each scan are projected onto a spherical surface to generate a series of two-dimensional (2D) images, which are used to estimate the initial positions of multiple scans.

Selective Convolutional Network: An Efficient Object Detector with Ignoring Background

no code implementations4 Feb 2020 Hefei Ling, Yangyang Qin, Li Zhang, Yuxuan Shi, Ping Li

It is well known that attention mechanisms can effectively improve the performance of many CNNs including object detectors.

Superbloom: Bloom filter meets Transformer

no code implementations11 Feb 2020 John Anderson, Qingqing Huang, Walid Krichene, Steffen Rendle, Li Zhang

We extend the idea of word pieces in natural language models to machine learning tasks on opaque ids.

Feedback Graph Convolutional Network for Skeleton-based Action Recognition

no code implementations17 Mar 2020 Hao Yang, Dan Yan, Li Zhang, Dong Li, YunDa Sun, ShaoDi You, Stephen J. Maybank

It transmits the high-level semantic features to the low-level layers and flows temporal information stage by stage to progressively model global spatial-temporal features for action recognition; (3) The FGCN model provides early predictions.

Action Recognition Skeleton Based Action Recognition

In-Vehicle Object Detection in the Wild for Driverless Vehicles

no code implementations27 Apr 2020 Ranjith Dinakaran, Li Zhang, Richard Jiang

In-vehicle human object identification plays an important role in vision-based automated vehicle driving systems while objects such as pedestrians and vehicles on roads or streets are the primary targets to protect from driverless vehicles.

object-detection Object Detection

A Survey on Deep Learning for Neuroimaging-based Brain Disorder Analysis

no code implementations10 May 2020 Li Zhang, Mingliang Wang, Mingxia Liu, Daoqiang Zhang

Deep learning has been recently used for the analysis of neuroimages, such as structural magnetic resonance imaging (MRI), functional MRI, and positron emission tomography (PET), and has achieved significant performance improvements over traditional machine learning in computer-aided diagnosis of brain disorders.

SentPWNet: A Unified Sentence Pair Weighting Network for Task-specific Sentence Embedding

no code implementations22 May 2020 Li Zhang, Han Wang, Lingxiao Li

Our model, SentPWNet, exploits the neighboring spatial distribution of each sentence as locality weight to indicate the informative level of sentence pair.

Metric Learning Sentence +3

Long-Term Cloth-Changing Person Re-identification

no code implementations26 May 2020 Xuelin Qian, Wenxuan Wang, Li Zhang, Fangrui Zhu, Yanwei Fu, Tao Xiang, Yu-Gang Jiang, xiangyang xue

Specifically, we consider that under cloth-changes, soft-biometrics such as body shape would be more reliable.

Cloth-Changing Person Re-Identification

Self-supervised Video Object Segmentation

no code implementations22 Jun 2020 Fangrui Zhu, Li Zhang, Yanwei Fu, Guodong Guo, Weidi Xie

The objective of this paper is self-supervised representation learning, with the goal of solving semi-supervised video object segmentation (a. k. a.

Object One-shot visual object segmentation +4

Egocentric Action Recognition by Video Attention and Temporal Context

no code implementations3 Jul 2020 Juan-Manuel Perez-Rua, Antoine Toisoul, Brais Martinez, Victor Escorcia, Li Zhang, Xiatian Zhu, Tao Xiang

In this challenge, action recognition is posed as the problem of simultaneously predicting a single `verb' and `noun' class label given an input trimmed video clip.

Action Recognition

A novel deep learning-based method for monochromatic image synthesis from spectral CT using photon-counting detectors

no code implementations20 Jul 2020 Ao Zheng, Hongkai Yang, Li Zhang, Yuxiang Xing

To solve this problem, in this paper, we proposed a novel deep learning-based monochromatic image synthesis method working in sinogram domain.

Image Generation

Learning-based Computer-aided Prescription Model for Parkinson's Disease: A Data-driven Perspective

no code implementations31 Jul 2020 Yinghuan Shi, Wanqi Yang, Kim-Han Thung, Hao Wang, Yang Gao, Yang Pan, Li Zhang, Dinggang Shen

Then, we build a novel computer-aided prescription model by learning the relation between observed symptoms and prescription drug.

A Survey on Concept Factorization: From Shallow to Deep Representation Learning

no code implementations31 Jul 2020 Zhao Zhang, Yan Zhang, Mingliang Xu, Li Zhang, Yi Yang, Shuicheng Yan

In this paper, we therefore survey the recent advances on CF methodologies and the potential benchmarks by categorizing and summarizing the current methods.

Clustering Representation Learning

PriceAggregator: An Intelligent System for Hotel Price Fetching

no code implementations30 Jun 2020 Jiangwei Zhang, Li Zhang, Vigneshwaran Raveendran, Ziv Ben-Zuk, Leonard Lu

The major challenge is that each supplier only allows Agoda to fetch for the hotel price with a limited amount of Queries Per Second (QPS).

Zero-Shot Heterogeneous Transfer Learning from Recommender Systems to Cold-Start Search Retrieval

no code implementations7 Aug 2020 Tao Wu, Ellie Ka-In Chio, Heng-Tze Cheng, Yu Du, Steffen Rendle, Dima Kuzmin, Ritesh Agarwal, Li Zhang, John Anderson, Sarvjeet Singh, Tushar Chandra, Ed H. Chi, Wen Li, Ankit Kumar, Xiang Ma, Alex Soares, Nitin Jindal, Pei Cao

In light of these problems, we observed that most online content platforms have both a search and a recommender system that, while having heterogeneous input spaces, can be connected through their common output item space and a shared semantic representation.

Information Retrieval Recommendation Systems +2

Dual-constrained Deep Semi-Supervised Coupled Factorization Network with Enriched Prior

no code implementations8 Sep 2020 Yan Zhang, Zhao Zhang, Yang Wang, Zheng Zhang, Li Zhang, Shuicheng Yan, Meng Wang

Nonnegative matrix factorization is usually powerful for learning the "shallow" parts-based representation, but it clearly fails to discover deep hierarchical information within both the basis and representation spaces.

Clustering Graph Learning +1

Holistic Grid Fusion Based Stop Line Estimation

no code implementations18 Sep 2020 Runsheng Xu, Faezeh Tafazzoli, Li Zhang, Timo Rehfeld, Gunther Krehl, Arunava Seal

Intersection scenarios provide the most complex traffic situations in Autonomous Driving and Driving Assistance Systems.

Autonomous Driving

Skin disease diagnosis with deep learning: a review

no code implementations11 Nov 2020 Hongfeng Li, Yini Pan, Jie Zhao, Li Zhang

As an important part of this article, we then review the literature involving deep learning methods for skin disease diagnosis from several aspects according to the specific tasks.

Direct Classification of Emotional Intensity

no code implementations15 Nov 2020 Jacob Ouyang, Isaac R Galatzer-Levy, Vidya Koesmahargyo, Li Zhang

In this paper, we present a model that can directly predict emotion intensity score from video inputs, instead of deriving from action units.

Classification General Classification

Searching for Quasi-Periodic Modulations in $γ$-ray Active Galactic Nuclei

no code implementations29 Jan 2020 Pengfei Zhang, Dahai Yan, Jianeng Zhou, Jiancheng Wang, Li Zhang

We perform a systematic search of quasi-periodic variabilities in $\gamma$-ray active galactic nuclei (AGNs) in the third \emph{Fermi} Large Area Telescope source catalog (3FGL).

High Energy Astrophysical Phenomena

A Systematic Literature Review on Federated Learning: From A Model Quality Perspective

no code implementations1 Dec 2020 Yi Liu, Li Zhang, Ning Ge, Guanghao Li

In this process, the server uses an incentive mechanism to encourage clients to contribute high-quality and large-volume data to improve the global model.

Federated Learning

Hop-Hop Relation-aware Graph Neural Networks

no code implementations21 Dec 2020 Li Zhang, Yan Ge, Haiping Lu

Graph Neural Networks (GNNs) are widely used in graph representation learning.

Knowledge Graph Embedding Relation

Unifying Homophily and Heterophily Network Transformation via Motifs

no code implementations21 Dec 2020 Yan Ge, Jun Ma, Li Zhang, Haiping Lu

Because H2NT can sparsify networks with motif structures, it can also improve the computational efficiency of existing network embedding methods when integrated.

Computational Efficiency Network Embedding +1

Failure Prediction in Production Line Based on Federated Learning: An Empirical Study

no code implementations25 Jan 2021 Ning Ge, Guanghao Li, Li Zhang, Yi Liu Yi Liu

Data protection across organizations is limiting the application of centralized learning (CL) techniques.

Federated Learning

EEGFuseNet: Hybrid Unsupervised Deep Feature Characterization and Fusion for High-Dimensional EEG with An Application to Emotion Recognition

no code implementations7 Feb 2021 Zhen Liang, Rushuang Zhou, Li Zhang, Linling Li, Gan Huang, Zhiguo Zhang, Shin Ishii

The performance of the extracted deep and low-dimensional features by EEGFuseNet is carefully evaluated in an unsupervised emotion recognition application based on three public emotion databases.

EEG Emotion Recognition +2

Hierarchical Road Topology Learning for Urban Map-less Driving

no code implementations31 Mar 2021 Li Zhang, Faezeh Tafazzoli, Gunther Krehl, Runsheng Xu, Timo Rehfeld, Manuel Schier, Arunava Seal

The majority of current approaches in autonomous driving rely on High-Definition (HD) maps which detail the road geometry and surrounding area.

Autonomous Driving

Hybrid Template Canonical Correlation Analysis Method for Enhancing SSVEP Recognition under data-limited Condition

no code implementations7 Aug 2020 Runfeng Miao, Li Zhang, Qiang Sun

In this study, an advanced CCA-based algorithn called hybrid template canonical correlation analysis (HTCCA) was proposed to improve the performance of brain-computer interface (BCI) based on steady state visual evoked potential (SSVEP) uuder data-linited condition.

EEG SSVEP +1

Optimize Neural Fictitious Self-Play in Regret Minimization Thinking

no code implementations22 Apr 2021 Yuxuan Chen, Li Zhang, Shijian Li, Gang Pan

Optimization of deep learning algorithms to approach Nash Equilibrium remains a significant problem in imperfect information games, e. g. StarCraft and poker.

Starcraft

Delving into Data: Effectively Substitute Training for Black-box Attack

no code implementations CVPR 2021 Wenxuan Wang, Bangjie Yin, Taiping Yao, Li Zhang, Yanwei Fu, Shouhong Ding, Jilin Li, Feiyue Huang, xiangyang xue

Previous substitute training approaches focus on stealing the knowledge of the target model based on real training data or synthetic data, without exploring what kind of data can further improve the transferability between the substitute and target models.

Adversarial Attack

Prediction of clinical tremor severity using Rank Consistent Ordinal Regression

no code implementations3 May 2021 Li Zhang, Vijay Yadav, Vidya Koesmahargyo, Anzar Abbas, Isaac Galatzer-Levy

The videos are coupled with clinician assessed TETRAS scores, which are used as ground truth labels to train the DNN.

regression Transfer Learning

Composite Localization for Human Pose Estimation

no code implementations15 May 2021 ZiFan Chen, Xin Qin, Chao Yang, Li Zhang

This work proposes a novel deep learning framework for human pose estimation called composite localization to divide the complex learning objective into two simpler ones: a sparse heatmap to find the keypoint's approximate location and two short-distance offsetmaps to obtain its final precise coordinates.

Distance regression Pose Estimation

Oneshot Differentially Private Top-k Selection

no code implementations18 May 2021 Gang Qiao, Weijie J. Su, Li Zhang

Being able to efficiently and accurately select the top-$k$ elements with differential privacy is an integral component of various private data analysis tasks.

A Unified Efficient Pyramid Transformer for Semantic Segmentation

no code implementations29 Jul 2021 Fangrui Zhu, Yi Zhu, Li Zhang, Chongruo wu, Yanwei Fu, Mu Li

Semantic segmentation is a challenging problem due to difficulties in modeling context in complex scenes and class confusions along boundaries.

Segmentation Semantic Segmentation

Multi-Frequency Wireless Channel Measurements and Characteristics Analysis in Indoor Corridor Scenarios

no code implementations14 Aug 2021 ZiHao Zhou, Li Zhang, Xinyue Chen, Cheng-Xiang Wang, Jie Huang

In this paper, we conduct wireless channel measurements in indoor corridor scenarios at 2. 4, 5 and 6 GHz bands with bandwidth of 320 MHz.

Unsupervised Domain Adaptation in Semantic Segmentation Based on Pixel Alignment and Self-Training

no code implementations29 Sep 2021 Hexin Dong, Fei Yu, Jie Zhao, Bin Dong, Li Zhang

This paper proposes an unsupervised cross-modality domain adaptation approach based on pixel alignment and self-training.

Segmentation Semantic Segmentation +1

SmartCiteCon: Implicit Citation Context Extraction from Academic Literature Using Supervised Learning

no code implementations WOSP 2020 Chenrui Guo, Haoran Cui, Li Zhang, Jiamin Wang, Wei Lu, Jian Wu

The tool is built on a Support Vector Machine (SVM) model trained on a set of 7, 058 manually annotated citation context sentences, curated from 34, 000 papers from the ACL Anthology.

Learning from Mistakes -- A Framework for Neural Architecture Search

1 code implementation11 Nov 2021 Bhanu Garg, Li Zhang, Pradyumna Sridhara, Ramtin Hosseini, Eric Xing, Pengtao Xie

We propose a novel machine learning method called Learning From Mistakes (LFM), wherein the learner improves its ability to learn by focusing more on the mistakes during revision.

BIG-bench Machine Learning Neural Architecture Search

Depth creates no more spurious local minima in linear networks

no code implementations25 Sep 2019 Li Zhang

We show that for any convex differentiable loss, a deep linear network has no spurious local minima as long as it is true for the two layer case.

ALX: Large Scale Matrix Factorization on TPUs

no code implementations3 Dec 2021 Harsh Mehta, Steffen Rendle, Walid Krichene, Li Zhang

We present ALX, an open-source library for distributed matrix factorization using Alternating Least Squares, written in JAX.

Link Prediction

A general framework for adaptive two-index fusion attribute weighted naive Bayes

no code implementations24 Feb 2022 Xiaoliang Zhou, Dongyang Wu, Zitong You, Li Zhang, Ning Ye

In addition, the ATFNB framework can improve the existing two-index NB model by introducing the adaptive switching factor \{beta}.

Attribute

ImpDet: Exploring Implicit Fields for 3D Object Detection

no code implementations31 Mar 2022 Xuelin Qian, Li Wang, Yi Zhu, Li Zhang, Yanwei Fu, xiangyang xue

Conventional 3D object detection approaches concentrate on bounding boxes representation learning with several parameters, i. e., localization, dimension, and orientation.

3D Object Detection Object +2

In Defense of Subspace Tracker: Orthogonal Embedding for Visual Tracking

no code implementations17 Apr 2022 Yao Sui, Guanghui Wang, Li Zhang

The paper focuses on a classical tracking model, subspace learning, grounded on the fact that the targets in successive frames are considered to reside in a low-dimensional subspace or manifold due to the similarity in their appearances.

Visual Tracking

Reasoning about Procedures with Natural Language Processing: A Tutorial

no code implementations16 May 2022 Li Zhang

This tutorial provides a comprehensive and in-depth view of the research on procedures, primarily in Natural Language Processing.

Accelerating Score-based Generative Models for High-Resolution Image Synthesis

no code implementations8 Jun 2022 Hengyuan Ma, Li Zhang, Xiatian Zhu, Jingfeng Zhang, Jianfeng Feng

To ensure stability of convergence in sampling and generation quality, however, this sequential sampling process has to take a small step size and many sampling iterations (e. g., 2000).

Image Generation Vocal Bursts Intensity Prediction

Intra Encoding Complexity Control with a Time-Cost Model for Versatile Video Coding

no code implementations13 Jun 2022 Yan Huang, Jizheng Xu, Li Zhang, Yan Zhao, Li Song

Inspired by rate control algorithms, we propose a scheme to precisely control the intra encoding complexity of VVC.

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

no code implementations22 Jun 2022 Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, Aman Madaan, Angelina McMillan-Major, Anna Shvets, Ashish Upadhyay, Bingsheng Yao, Bryan Wilie, Chandra Bhagavatula, Chaobin You, Craig Thomson, Cristina Garbacea, Dakuo Wang, Daniel Deutsch, Deyi Xiong, Di Jin, Dimitra Gkatzia, Dragomir Radev, Elizabeth Clark, Esin Durmus, Faisal Ladhak, Filip Ginter, Genta Indra Winata, Hendrik Strobelt, Hiroaki Hayashi, Jekaterina Novikova, Jenna Kanerva, Jenny Chim, Jiawei Zhou, Jordan Clive, Joshua Maynez, João Sedoc, Juraj Juraska, Kaustubh Dhole, Khyathi Raghavi Chandu, Laura Perez-Beltrachini, Leonardo F. R. Ribeiro, Lewis Tunstall, Li Zhang, Mahima Pushkarna, Mathias Creutz, Michael White, Mihir Sanjay Kale, Moussa Kamal Eddine, Nico Daheim, Nishant Subramani, Ondrej Dusek, Paul Pu Liang, Pawan Sasanka Ammanamanchi, Qi Zhu, Ratish Puduppully, Reno Kriz, Rifat Shahriyar, Ronald Cardenas, Saad Mahamood, Salomey Osei, Samuel Cahyawijaya, Sanja Štajner, Sebastien Montella, Shailza, Shailza Jolly, Simon Mille, Tahmid Hasan, Tianhao Shen, Tosin Adewumi, Vikas Raunak, Vipul Raheja, Vitaly Nikolaev, Vivian Tsai, Yacine Jernite, Ying Xu, Yisi Sang, Yixin Liu, Yufang Hou

This problem is especially pertinent in natural language generation which requires ever-improving suites of datasets, metrics, and human evaluation to make definitive claims.

Benchmarking Text Generation

Knowledge-aware Neural Collective Matrix Factorization for Cross-domain Recommendation

no code implementations27 Jun 2022 Li Zhang, Yan Ge, Jun Ma, Jianmo Ni, Haiping Lu

In this paper, we propose to incorporate the knowledge graph (KG) for CDR, which enables items in different domains to share knowledge.

General Knowledge

SiamMask: A Framework for Fast Online Object Tracking and Segmentation

no code implementations5 Jul 2022 Weiming Hu, Qiang Wang, Li Zhang, Luca Bertinetto, Philip H. S. Torr

In this paper we introduce SiamMask, a framework to perform both visual object tracking and video object segmentation, in real-time, with the same simple method.

Multiple Object Tracking Object +5

Is “My Favorite New Movie” My Favorite Movie? Probing the Understanding of Recursive Noun Phrases

no code implementations NAACL 2022 Qing Lyu, Zheng Hua, Daoxin Li, Li Zhang, Marianna Apidianaki, Chris Callison-Burch

We introduce the Recursive Noun Phrase Challenge (RNPC), a dataset of three textual inference tasks involving textual entailment and event plausibility comparison, precisely targeting the understanding of recursive NPs.

Common Sense Reasoning Natural Language Inference

RCLane: Relay Chain Prediction for Lane Detection

no code implementations19 Jul 2022 Shenghua Xu, Xinyue Cai, Bin Zhao, Li Zhang, Hang Xu, Yanwei Fu, xiangyang xue

This is because most of the existing lane detection methods either treat the lane detection as a dense prediction or a detection task, few of them consider the unique topologies (Y-shape, Fork-shape, nearly horizontal lane) of the lane markers, which leads to sub-optimal solution.

Lane Detection

Hierarchical Reinforcement Learning Based Video Semantic Coding for Segmentation

no code implementations24 Aug 2022 Guangqi Xie, Xin Li, Shiqi Lin, Li Zhang, Kai Zhang, Yue Li, Zhibo Chen

In this paper, we take a step forward to video semantic compression and propose the Hierarchical Reinforcement Learning based task-driven Video Semantic Coding, named as HRLVSC.

Hierarchical Reinforcement Learning reinforcement-learning +3

Scalable Nanophotonic-Electronic Spiking Neural Networks

no code implementations28 Aug 2022 Luis El Srouji, Yun-jhu Lee, Mehmet Berkay On, Li Zhang, S. J. Ben Yoo

Photonic devices are ideal for the design of high-bandwidth, parallel architectures matching the SNN computational paradigm.

Data-Driven Deep Supervision for Skin Lesion Classification

no code implementations4 Sep 2022 Suraj Mishra, Yizhe Zhang, Li Zhang, Tianyu Zhang, X. Sharon Hu, Danny Z. Chen

Specifically, we analyze the convolutional network's behavior (field-of-view) to find the location of deep supervision for improved feature extraction.

Classification Lesion Classification +2

NWPU-ASLP System for the VoicePrivacy 2022 Challenge

no code implementations24 Sep 2022 Jixun Yao, Qing Wang, Li Zhang, Pengcheng Guo, Yuhao Liang, Lei Xie

Our system consists of four modules, including feature extractor, acoustic model, anonymization module, and neural vocoder.

Speaker Verification

Generative Model Watermarking Based on Human Visual System

no code implementations30 Sep 2022 Li Zhang, Yong liu, Shaoteng Liu, Tianshu Yang, Yexin Wang, Xinpeng Zhang, Hanzhou Wu

Intellectual property protection of deep neural networks is receiving attention from more and more researchers, and the latest research applies model watermarking to generative models for image processing.

TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge

no code implementations26 Oct 2022 Bowen Pang, Huan Zhao, Gaosheng Zhang, Xiaoyue Yang, Yang Sun, Li Zhang, Qing Wang, Lei Xie

In this challenge, we explore three kinds of typical speaker diarization systems, which are spectral clustering(SC) based diarization, target-speaker voice activity detection(TS-VAD) and end-to-end neural diarization(EEND) respectively.

Action Detection Activity Detection +2

Robust Time Series Chain Discovery with Incremental Nearest Neighbors

no code implementations3 Nov 2022 Li Zhang, Yan Zhu, Yifeng Gao, Jessica Lin

Inspired by a recent work that tracks how the nearest neighbor of a time series subsequence changes over time, we introduce a new TSC definition which is much more robust to noise in the data, in the sense that they can better locate the evolving patterns while excluding the non-evolving ones.

Time Series Time Series Analysis

Panoramic Video Salient Object Detection with Ambisonic Audio Guidance

no code implementations26 Nov 2022 Xiang Li, Haoyuan Cao, Shijie Zhao, Junlin Li, Li Zhang, Bhiksha Raj

In this paper, we aim to tackle the video salient object detection problem for panoramic videos, with their corresponding ambisonic audios.

Object object-detection +2

MSV Challenge 2022: NPU-HC Speaker Verification System for Low-resource Indian Languages

no code implementations30 Nov 2022 Yue Li, Li Zhang, Namin Wang, Jie Liu, Lei Xie

Specifically, the weight transfer fine-tuning aims to constrain the distance of the weights between the pre-trained model and the fine-tuned model, which takes advantage of the previously acquired discriminative ability from the large-scale out-domain datasets and avoids catastrophic forgetting and overfitting at the same time.

Speaker Verification

PMP: Privacy-Aware Matrix Profile against Sensitive Pattern Inference for Time Series

1 code implementation4 Jan 2023 Li Zhang, Jiahao Ding, Yifeng Gao, Jessica Lin

During the process, data sharing is often involved to allow the third-party modelers to perform specific time series data mining (TSDM) tasks based on the need of data owner.

Privacy Preserving Time Series +1

LB-SimTSC: An Efficient Similarity-Aware Graph Neural Network for Semi-Supervised Time Series Classification

no code implementations12 Jan 2023 Wenjie Xi, Arnav Jain, Li Zhang, Jessica Lin

Recently, Similarity-aware Time Series Classification (SimTSC) is proposed to address this problem by using a graph neural network classification model on the graph generated from pairwise Dynamic Time Warping (DTW) distance of batch data.

Classification Dynamic Time Warping +3

Syntax and Domain Aware Model for Unsupervised Program Translation

no code implementations8 Feb 2023 Fang Liu, Jia Li, Li Zhang

The experimental results on function translation tasks between Python, Java, and C++ show that SDA-Trans outperforms many large-scale pre-trained models, especially for unseen language translation.

Cross-Lingual Transfer Translation

Multi-Task Differential Privacy Under Distribution Skew

no code implementations15 Feb 2023 Walid Krichene, Prateek Jain, Shuang Song, Mukund Sundararajan, Abhradeep Thakurta, Li Zhang

We study the problem of multi-task learning under user-level differential privacy, in which $n$ users contribute data to $m$ tasks, each involving a subset of users.

Multi-Task Learning

S-NeRF: Neural Radiance Fields for Street Views

no code implementations1 Mar 2023 Ziyang Xie, Junge Zhang, Wenye Li, Feihu Zhang, Li Zhang

Specifically, we improve the scene parameterization function and the camera poses for learning better neural representations from street views.

Novel View Synthesis Self-Driving Cars

Single-view Neural Radiance Fields with Depth Teacher

no code implementations17 Mar 2023 Yurui Chen, Chun Gu, Feihu Zhang, Li Zhang

Moreover, it has poor generalizations to new scenes and requires retraining or fine-tuning on each scene.

Novel View Synthesis

Learning by Grouping: A Multilevel Optimization Framework for Improving Fairness in Classification without Losing Accuracy

no code implementations2 Apr 2023 Ramtin Hosseini, Li Zhang, Bhanu Garg, Pengtao Xie

Our proposed framework involves three stages of learning, which are formulated as a three-level optimization problem: (i) learning to group problems into different subgroups; (ii) learning group-specific sub-models for problem-solving; and (iii) updating group assignments of training examples by minimizing the validation loss.

Decision Making Domain Adaptation +2

OPDN: Omnidirectional Position-aware Deformable Network for Omnidirectional Image Super-Resolution

no code implementations26 Apr 2023 Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Qiufang Ma, Xuhan Sheng, Ming Cheng, Haoyu Ma, Shijie Zhao, Jian Zhang, Junlin Li, Li Zhang

Model A aims to enhance the feature extraction ability of 360{\deg} image positional information, while Model B further focuses on the high-frequency information of 360{\deg} images.

Image Super-Resolution Position

Train-Once-for-All Personalization

no code implementations CVPR 2023 Hong-You Chen, Yandong Li, Yin Cui, Mingda Zhang, Wei-Lun Chao, Li Zhang

We study the problem of how to train a "personalization-friendly" model such that given only the task descriptions, the model can be adapted to different end-users' needs, e. g., for accurately classifying different subsets of objects.

propnet: Propagating 2D Annotation to 3D Segmentation for Gastric Tumors on CT Scans

no code implementations29 May 2023 ZiFan Chen, Jiazheng Li, Jie Zhao, Yiting Liu, Hongfeng Li, Bin Dong, Lei Tang, Li Zhang

This model consists of a proposing stage for coarse segmentation and a refining stage for improved segmentation, using two-way branches for enhanced performance and an up-down strategy for efficiency.

Segmentation Tumor Segmentation

Answering Compositional Queries with Set-Theoretic Embeddings

no code implementations7 Jun 2023 Shib Dasgupta, Andrew McCallum, Steffen Rendle, Li Zhang

The need to compactly and robustly represent item-attribute relations arises in many important tasks, such as faceted browsing and recommendation systems.

Attribute Recommendation Systems +1

Video Compression with Arbitrary Rescaling Network

no code implementations7 Jun 2023 Mengxi Guo, Shijie Zhao, Hao Jiang, Junlin Li, Li Zhang

Most video platforms provide video streaming services with different qualities, and the quality of the services is usually adjusted by the resolution of the videos.

Video Compression

CD-CTFM: A Lightweight CNN-Transformer Network for Remote Sensing Cloud Detection Fusing Multiscale Features

no code implementations12 Jun 2023 Wenxuan Ge, Xubing Yang, Li Zhang

In the decoder part, we utilize a lightweight network combing CNN and Transformer as backbone, which is conducive to extract local and global features simultaneously.

Cloud Detection Decoder

MCPI: Integrating Multimodal Data for Enhanced Prediction of Compound Protein Interactions

no code implementations15 Jun 2023 Li Zhang, Wenhao Li, Haotian Guan, Zhiquan He, Mingjun Cheng, Han Wang

The identification of compound-protein interactions (CPI) plays a critical role in drug screening, drug repurposing, and combination therapy studies.

Joint Coordinate Regression and Association For Multi-Person Pose Estimation, A Pure Neural Network Approach

no code implementations3 Jul 2023 Dongyang Yu, Yunshi Xie, Wangpeng An, Li Zhang, YuFeng Yao

We introduce a novel one-stage end-to-end multi-person 2D pose estimation algorithm, known as Joint Coordinate Regression and Association (JCRA), that produces human pose joints and associations without requiring any post-processing.

2D Pose Estimation Decoder +1

SUIT: Learning Significance-guided Information for 3D Temporal Detection

no code implementations4 Jul 2023 Zheyuan Zhou, Jiachen Lu, Yihan Zeng, Hang Xu, Li Zhang

To this end, we propose to learn Significance-gUided Information for 3D Temporal detection (SUIT), which simplifies temporal information as sparse features for information fusion across frames.

3D Object Detection Autonomous Driving +2

Towards Efficient In-memory Computing Hardware for Quantized Neural Networks: State-of-the-art, Open Challenges and Perspectives

no code implementations8 Jul 2023 Olga Krestinskaya, Li Zhang, Khaled Nabil Salama

Limited energy and computational resources on edge push the transition from traditional von Neumann architectures to In-memory Computing (IMC), especially for machine learning and neural network applications.

Quantization

Cluster-Induced Mask Transformers for Effective Opportunistic Gastric Cancer Screening on Non-contrast CT Scans

no code implementations10 Jul 2023 Mingze Yuan, Yingda Xia, Xin Chen, Jiawen Yao, Junli Wang, Mingyan Qiu, Hexin Dong, Jingren Zhou, Bin Dong, Le Lu, Li Zhang, Zaiyi Liu, Ling Zhang

In our experiments, the proposed method achieves a sensitivity of 85. 0% and specificity of 92. 6% for detecting gastric tumors on a hold-out test set consisting of 100 patients with cancer and 148 normal.

Specificity

Deep neural network improves the estimation of polygenic risk scores for breast cancer

no code implementations24 Jul 2023 Adrien Badré, Li Zhang, Wellington Muchero, Justin C. Reynolds, Chongle Pan

In the test cohort with 50% prevalence, the Area Under the receiver operating characteristic Curve (AUC) were 67. 4% for DNN, 64. 2% for BLUP, 64. 5% for BayesA, and 62. 4% for LDpred.

Improved Prognostic Prediction of Pancreatic Cancer Using Multi-Phase CT by Integrating Neural Distance and Texture-Aware Transformer

no code implementations1 Aug 2023 Hexin Dong, Jiawen Yao, Yuxing Tang, Mingze Yuan, Yingda Xia, Jian Zhou, Hong Lu, Jingren Zhou, Bin Dong, Le Lu, Li Zhang, Zaiyi Liu, Yu Shi, Ling Zhang

Pancreatic ductal adenocarcinoma (PDAC) is a highly lethal cancer in which the tumor-vascular involvement greatly affects the resectability and, thus, overall survival of patients.

Semi-Supervised Dual-Stream Self-Attentive Adversarial Graph Contrastive Learning for Cross-Subject EEG-based Emotion Recognition

no code implementations13 Aug 2023 Weishan Ye, Zhiguo Zhang, Min Zhang, Fei Teng, Li Zhang, Linling Li, Gan Huang, Jianhong Wang, Dong Ni, Zhen Liang

In this paper, a semi-supervised Dual-stream Self-Attentive Adversarial Graph Contrastive learning framework (termed as DS-AGC) is proposed to tackle the challenge of limited labeled data in cross-subject EEG-based emotion recognition.

Contrastive Learning Domain Adaptation +2

Designs and Implementations in Neural Network-based Video Coding

no code implementations11 Sep 2023 Yue Li, Junru Li, Chaoyi Lin, Kai Zhang, Li Zhang, Franck Galpin, Thierry Dumas, Hongtao Wang, Muhammed Coban, Jacob Ström, Du Liu, Kenneth Andersson

The past decade has witnessed the huge success of deep learning in well-known artificial intelligence applications such as face recognition, autonomous driving, and large language model like ChatGPT.

Autonomous Driving Face Recognition +3

Choice-75: A Dataset on Decision Branching in Script Learning

no code implementations21 Sep 2023 Zhaoyi Joey Hou, Li Zhang, Chris Callison-Burch

Script learning studies how stereotypical events unfold, enabling machines to reason about narratives with implicit information.

Descriptive

Private Matrix Factorization with Public Item Features

no code implementations17 Sep 2023 Mihaela Curmei, Walid Krichene, Li Zhang, Mukund Sundararajan

It can be applied to different types of public item data, including: (1) categorical item features; (2) item-item similarities learned from public sources; and (3) publicly available user feedback.

Collaborative Filtering

Translating Images to Road Network: A Non-Autoregressive Sequence-to-Sequence Approach

no code implementations ICCV 2023 Jiachen Lu, Renyuan Peng, Xinyue Cai, Hang Xu, Hongyang Li, Feng Wen, Wei zhang, Li Zhang

The extraction of road network is essential for the generation of high-definition maps since it enables the precise localization of road landmarks and their interconnections.

CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization

no code implementations16 Oct 2023 Bodhisattwa Prasad Majumder, Bhavana Dalvi Mishra, Peter Jansen, Oyvind Tafjord, Niket Tandon, Li Zhang, Chris Callison-Burch, Peter Clark

Language agents have shown some ability to interact with an external environment, e. g., a virtual world such as ScienceWorld, to perform complex tasks, e. g., growing a plant, without the startup costs of reinforcement learning.

CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields from Imperfect Camera Poses

no code implementations15 Oct 2023 Hongyu Fu, Xin Yu, Lincheng Li, Li Zhang

Existing volumetric neural rendering techniques, such as Neural Radiance Fields (NeRF), face limitations in synthesizing high-quality novel views when the camera poses of input images are imperfect.

3D Reconstruction Neural Rendering +1

Private Learning with Public Features

no code implementations24 Oct 2023 Walid Krichene, Nicolas Mayoraz, Steffen Rendle, Shuang Song, Abhradeep Thakurta, Li Zhang

We study a class of private learning problems in which the data is a join of private and public features.

Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video

no code implementations6 Nov 2023 Yanqin Jiang, Li Zhang, Jin Gao, Weimin Hu, Yao Yao

This is achieved by leveraging the object-level 3D-aware image diffusion model as the primary supervision signal for training Dynamic Neural Radiance Fields (DyNeRF).

3D Generation Camera Calibration +3

One Size Does Not Fit All: Customizing Open-Domain Procedures

no code implementations16 Nov 2023 Yash Kumar Lal, Li Zhang, Faeze Brahman, Bodhisattwa Prasad Majumder, Peter Clark, Niket Tandon

Our approach is to test several simple multi-LLM-agent architectures for customization, as well as an end-to-end LLM, using a new evaluation set, called CustomPlans, of over 200 WikiHow procedures each with a customization need.

Relightable 3D Gaussian: Real-time Point Cloud Relighting with BRDF Decomposition and Ray Tracing

no code implementations27 Nov 2023 Jian Gao, Chun Gu, Youtian Lin, Hao Zhu, Xun Cao, Li Zhang, Yao Yao

We present a novel differentiable point-based rendering framework for material and lighting decomposition from multi-view images, enabling editing, ray-tracing, and real-time relighting of the 3D point cloud.

BRDF estimation Lighting Estimation

Demonstration of Programmable Brain-Inspired Optoelectronic Neuron in Photonic Spiking Neural Network with Neural Heterogeneity

no code implementations27 Nov 2023 Yun-jhu Lee, Mehmet Berkay On, Luis El Srouji, Li Zhang, Mahmoud Abdelghany, S. J. Ben Yoo

Photonic Spiking Neural Networks (PSNN) composed of the co-integrated CMOS and photonic elements can offer low loss, low power, highly-parallel, and high-throughput computing for brain-inspired neuromorphic systems.

Optimal Transcoding Resolution Prediction for Efficient Per-Title Bitrate Ladder Estimation

no code implementations9 Jan 2024 Jinhai Yang, Mengxi Guo, Shijie Zhao, Junlin Li, Li Zhang

In this paper, we propose to directly predict the optimal transcoding resolution at each preset bitrate for efficient bitrate ladder construction.

Fast Dynamic 3D Object Generation from a Single-view Video

no code implementations16 Jan 2024 Zijie Pan, Zeyu Yang, Xiatian Zhu, Li Zhang

Generating dynamic 3D object from a single-view video is challenging due to the lack of 4D labeled data.

Image Generation Image to 3D +3

LVC-LGMC: Joint Local and Global Motion Compensation for Learned Video Compression

no code implementations1 Feb 2024 Wei Jiang, Junru Li, Kai Zhang, Li Zhang

To validate the effectiveness of our proposed LGMC, we integrate it with DCVC-TCM and obtain learned video compression with joint local and global motion compensation (LVC-LGMC).

Motion Compensation Video Compression

S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation

no code implementations3 Feb 2024 Yurui Chen, Junge Zhang, Ziyang Xie, Wenye Li, Feihu Zhang, Jiachen Lu, Li Zhang

Autonomous driving simulation system plays a crucial role in enhancing self-driving data and simulating complex and rare traffic scenarios, ensuring navigation safety.

Autonomous Driving

TimeSiam: A Pre-Training Framework for Siamese Time-Series Modeling

no code implementations4 Feb 2024 Jiaxiang Dong, Haixu Wu, Yuxuan Wang, Yunzhong Qiu, Li Zhang, Jianmin Wang, Mingsheng Long

To emphasize temporal correlation modeling, this paper proposes TimeSiam as a simple but effective self-supervised pre-training framework for Time series based on Siamese networks.

Contrastive Learning Data Augmentation +1

RA-Rec: An Efficient ID Representation Alignment Framework for LLM-based Recommendation

no code implementations7 Feb 2024 Xiaohan Yu, Li Zhang, Xin Zhao, Yue Wang, Zhongrui Ma

To address this limitation, we propose a new paradigm, ID representation, which incorporates pre-trained ID embeddings into LLMs in a complementary manner.

Recommendation Systems

A Neural-network Enhanced Video Coding Framework beyond ECM

no code implementations13 Feb 2024 Yanchen Zhao, Wenxuan He, Chuanmin Jia, Qizhe Wang, Junru Li, Yue Li, Chaoyi Lin, Kai Zhang, Li Zhang, Siwei Ma

In this paper, a hybrid video compression framework is proposed that serves as a demonstrative showcase of deep learning-based approaches extending beyond the confines of traditional coding methodologies.

Video Compression

Calibrating Large Language Models with Sample Consistency

no code implementations21 Feb 2024 Qing Lyu, Kumar Shridhar, Chaitanya Malaviya, Li Zhang, Yanai Elazar, Niket Tandon, Marianna Apidianaki, Mrinmaya Sachan, Chris Callison-Burch

Accurately gauging the confidence level of Large Language Models' (LLMs) predictions is pivotal for their reliable application.

FrameNeRF: A Simple and Efficient Framework for Few-shot Novel View Synthesis

no code implementations22 Feb 2024 Yan Xing, Pan Wang, Ligang Liu, Daolun Li, Li Zhang

We present a novel framework, called FrameNeRF, designed to apply off-the-shelf fast high-fidelity NeRF models with fast training speed and high rendering quality for few-shot novel view synthesis tasks.

Novel View Synthesis

A First Look at GPT Apps: Landscape and Vulnerability

no code implementations23 Feb 2024 Zejun Zhang, Li Zhang, Xin Yuan, Anlan Zhang, Mengwei Xu, Feng Qian

With the advancement of Large Language Models (LLMs), increasingly sophisticated and powerful GPTs are entering the market.

BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAM

no code implementations26 Feb 2024 Li Zhang, Youwei Liang, Ruiyi Zhang, Amirhosein Javadi, Pengtao Xie

Secondly, SAM faces challenges in excelling at specific downstream tasks, like medical imaging, due to a disparity between the distribution of its pretraining data, which predominantly consists of general-domain images, and the data used in downstream tasks.

Image Segmentation Segmentation +1

RSAM-Seg: A SAM-based Approach with Prior Knowledge Integration for Remote Sensing Image Semantic Segmentation

no code implementations29 Feb 2024 Jie Zhang, Xubing Yang, Rui Jiang, Wei Shao, Li Zhang

While the direct application of SAM to remote sensing image segmentation tasks does not yield satisfactory results, we propose RSAM-Seg, which stands for Remote Sensing SAM with Semantic Segmentation, as a tailored modification of SAM for the remote sensing field and eliminates the need for manual intervention to provide prompts.

Cloud Detection Image Segmentation +2

Implicit Image-to-Image Schrodinger Bridge for CT Super-Resolution and Denoising

no code implementations10 Mar 2024 Yuang Wang, Siyeop Yoon, Pengfei Jin, Matthew Tivnan, Zhennong Chen, Rui Hu, Li Zhang, Zhiqiang Chen, Quanzheng Li, Dufan Wu

As a promising alternative, the Image-to-Image Schr\"odinger Bridge (I2SB) initializes the generative process from corrupted images and integrates training techniques from conditional diffusion models.

Denoising Image Restoration +1

FrameQuant: Flexible Low-Bit Quantization for Transformers

no code implementations10 Mar 2024 Harshavardhan Adepu, Zhanpeng Zeng, Li Zhang, Vikas Singh

If quantization is interpreted as the addition of noise, our casting of the problem allows invoking an extensive body of known consistent recovery and noise robustness guarantees.

Quantization

OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation

no code implementations18 Mar 2024 Haochen Jiang, Yueming Xu, Yihan Zeng, Hang Xu, Wei zhang, Jianfeng Feng, Li Zhang

We model the geometric structure of the scene with occupancy representation and distill the pre-trained open vocabulary model into a 3D language field via volume rendering for zero-shot inference.

3D Reconstruction 3D Scene Reconstruction +3

STEntConv: Predicting Disagreement with Stance Detection and a Signed Graph Convolutional Network

1 code implementation23 Mar 2024 Isabelle Lorge, Li Zhang, Xiaowen Dong, Janet B. Pierrehumbert

The rise of social media platforms has led to an increase in polarised online discussions, especially on political and socio-cultural topics such as elections and climate change.

Stance Detection

Peer-aided Repairer: Empowering Large Language Models to Repair Advanced Student Assignments

no code implementations2 Apr 2024 Qianhui Zhao, Fang Liu, Li Zhang, Yang Liu, Zhen Yan, Zhenghao Chen, Yufei Zhou, Jing Jiang, Ge Li

Automated generation of feedback on programming assignments holds significant benefits for programming education, especially when it comes to advanced assignments.

Language Modelling Large Language Model +1

Exploring and Evaluating Hallucinations in LLM-Powered Code Generation

no code implementations1 Apr 2024 Fang Liu, Yang Liu, Lin Shi, Houkun Huang, Ruifeng Wang, Zhen Yang, Li Zhang

The rise of Large Language Models (LLMs) has significantly advanced many applications on software engineering tasks, particularly in code generation.

Code Generation Hallucination +2

LaneCorrect: Self-supervised Lane Detection

no code implementations23 Apr 2024 Ming Nie, Xinyue Cai, Hang Xu, Li Zhang

Lane detection has evolved highly functional autonomous driving system to understand driving scenes even under complex environments.

Autonomous Driving Lane Detection

ResVR: Joint Rescaling and Viewport Rendering of Omnidirectional Images

no code implementations25 Apr 2024 Weiqi Li, Shijie Zhao, Bin Chen, Xinhua Cheng, Junlin Li, Li Zhang, Jian Zhang

With the advent of virtual reality technology, omnidirectional image (ODI) rescaling techniques are increasingly embraced for reducing transmitted and stored file sizes while preserving high image quality.

ERP

TEAC: Intergrating Trust Region and Max Entropy Actor Critic for Continuous Control

1 code implementation1 Jan 2021 Hongyu Zang, Xin Li, Li Zhang, Peiyao Zhao, Mingzhong Wang

Trust region methods and maximum entropy methods are two state-of-the-art branches used in reinforcement learning (RL) for the benefits of stability and exploration in continuous environments, respectively.

Continuous Control Reinforcement Learning (RL)

Persistent Animal Identification Leveraging Non-Visual Markers

2 code implementations13 Dec 2021 Michael P. J. Camilleri, Li Zhang, Rasneer S. Bains, Andrew Zisserman, Christopher K. I. Williams

Our objective is to locate and provide a unique identifier for each mouse in a cluttered home-cage environment through time, as a precursor to automated behaviour recognition for biological research.

Visual Tracking

Exploring the Curious Case of Code Prompts

1 code implementation26 Apr 2023 Li Zhang, Liam Dugan, Hainiu Xu, Chris Callison-Burch

Furthermore, we show that the style of code prompt has a large effect on performance for some but not all tasks and that fine-tuning on text instructions leads to better relative performance of code prompts.

In-Orbit Instrument Performance Study and Calibration for POLAR Polarization Measurements

1 code implementation19 May 2018 Zheng-Heng Li, Merlin Kole, Jian-Chao Sun, Li-Ming Song, Nicolas Produit, Bo-Bing Wu, Tianwei Bao, Tancredi Bernasconi, Franck Cadoux, Yongwei Dong, Minzi Feng, Neal Gauvin, Wojtek Hajdas, Hancheng Li, Lu Li, Xin Liu, Radoslaw Marcinkowski, Martin Pohl, Dominik K. Rybka, Haoli Shi, Jacek Szabelski, Teresa Tymieniecka, Ruijie Wang, Yuanhao Wang, Xing Wen, Xin Wu, Shao-Lin Xiong, Anna Zwolinska, Li Zhang, Lai-Yu Zhang, Shuang-Nan Zhang, Yong-Jie Zhang, Yi Zhao

POLAR is a compact space-borne detector designed to perform reliable measurements of the polarization for transient sources like Gamma-Ray Bursts in the energy range 50-500keV.

Instrumentation and Methods for Astrophysics High Energy Physics - Experiment Instrumentation and Detectors

Semantic Discord: Finding Unusual Local Patterns for Time Series

1 code implementation30 Jan 2020 Li Zhang, Yifeng Gao, Jessica Lin

Finding anomalous subsequence in a long time series is a very important but difficult problem.

Time Series Time Series Analysis

Analyze Gauss: Optimal Bounds for Privacy-Preserving Principal Component Analysis

1 code implementation1 May 2014 Cynthia Dwork, Kunal Talwar, Abhradeep Thakurta, Li Zhang

We show that the well-known, but misnamed, randomized response algorithm, with properly tuned parameters, provides a nearly optimal additive quality gap compared to the best possible singular subspace of A.

Attribute Privacy Preserving

Modular Blind Video Quality Assessment

1 code implementation29 Feb 2024 Wen Wen, Mu Li, Yabin Zhang, Yiting Liao, Junlin Li, Li Zhang, Kede Ma

Blind video quality assessment (BVQA) plays a pivotal role in evaluating and improving the viewing experience of end-users across a wide range of video-based platforms and services.

Video Quality Assessment

Generalizable and Stable Finetuning of Pretrained Language Models on Low-Resource Texts

1 code implementation19 Mar 2024 Sai Ashish Somayajula, Youwei Liang, Abhishek Singh, Li Zhang, Pengtao Xie

Pretrained Language Models (PLMs) have advanced Natural Language Processing (NLP) tasks significantly, but finetuning PLMs on low-resource datasets poses significant challenges such as instability and overfitting.

Spatial Language Representation with Multi-Level Geocoding

1 code implementation21 Aug 2020 Sayali Kulkarni, Shailee Jain, Mohammad Javad Hosseini, Jason Baldridge, Eugene Ie, Li Zhang

We present a multi-level geocoding model (MLG) that learns to associate texts to geographic locations.

Toponym Resolution

Automatically detecting the conflicts between software requirements based on finer semantic analysis

1 code implementation3 Mar 2021 Weize Guo, Li Zhang, Xiaoli Lian

Besides, our approach is capable of transforming the natural language functional requirements into eight semantic tuples, which is useful not only the detection of the conflicts between requirements but also some other tasks such as constructing the association between requirements and so on.

Label Definitions Improve Semantic Role Labeling

1 code implementation NAACL 2022 Li Zhang, Ishan Jindal, Yunyao Li

Given a sentence and the predicate, a semantic role label is assigned to each argument of the predicate.

Semantic Role Labeling Sentence

Towards Optimal Filter Pruning with Balanced Performance and Pruning Speed

1 code implementation14 Oct 2020 Dong Li, Sitong Chen, Xudong Liu, YunDa Sun, Li Zhang

In this paper, we propose a balanced filter pruning method for both performance and pruning speed.

Probabilistic computation and uncertainty quantification with emerging covariance

1 code implementation30 May 2023 Hengyuan Ma, Yang Qi, Li Zhang, Wenlian Lu, Jianfeng Feng

Building robust, interpretable, and secure AI system requires quantifying and representing uncertainty under a probabilistic perspective to mimic human cognitive abilities.

Uncertainty Quantification

Scale-aware Test-time Click Adaptation for Pulmonary Nodule and Mass Segmentation

1 code implementation28 Jul 2023 Zhihao LI, Jiancheng Yang, Yongchao Xu, Li Zhang, Wenhui Dong, Bo Du

Extensive experiments on both open-source and in-house datasets consistently demonstrate the effectiveness of the proposed method over some CNN and Transformer-based segmentation methods.

Image Segmentation Management +4

Is "My Favorite New Movie" My Favorite Movie? Probing the Understanding of Recursive Noun Phrases

1 code implementation15 Dec 2021 Qing Lyu, Hua Zheng, Daoxin Li, Li Zhang, Marianna Apidianaki, Chris Callison-Burch

We introduce the Recursive Noun Phrase Challenge (RNPC), a dataset of three textual inference tasks involving textual entailment and event plausibility comparison, precisely targeting the understanding of recursive NPs.

Common Sense Reasoning Natural Language Inference

Improved Dense Nested Attention Network Based on Transformer for Infrared Small Target Detection

1 code implementation15 Nov 2023 Chun Bao, Jie Cao, Yaqian Ning, Tianhua Zhao, Zhijun Li, Zechen Wang, Li Zhang, Qun Hao

To address this issue, we propose a novel method for detecting infrared small targets called improved dense nested attention network (IDNANet), which is based on the transformer architecture.

Few-shot Action Recognition with Permutation-invariant Attention

1 code implementation ECCV 2020 Hongguang Zhang, Li Zhang, Xiaojuan Qi, Hongdong Li, Philip H. S. Torr, Piotr Koniusz

Such encoded blocks are aggregated by permutation-invariant pooling to make our approach robust to varying action lengths and long-range temporal dependencies whose patterns are unlikely to repeat even in clips of the same class.

Few-Shot action recognition Few Shot Action Recognition +3

Automated Segmentation of Cervical Nuclei in Pap Smear Images using Deformable Multi-path Ensemble Model

1 code implementation3 Dec 2018 Jie Zhao, Quanzheng Li, Xiang Li, Hongfeng Li, Li Zhang

Pap smear testing has been widely used for detecting cervical cancers based on the morphology properties of cell nuclei in microscopic image.

Image Segmentation Medical Image Segmentation +2

Goal-Oriented Script Construction

1 code implementation INLG (ACL) 2021 Qing Lyu, Li Zhang, Chris Callison-Burch

The knowledge of scripts, common chains of events in stereotypical scenarios, is a valuable asset for task-oriented natural language understanding systems.

Language Modelling Natural Language Understanding +1

Cannot find the paper you are looking for? You can Submit a new open access paper.