Search Results for author: Tong Zhang

Found 366 papers, 113 papers with code

Effective Use of Word Order for Text Categorization with Convolutional Neural Networks

4 code implementations • HLT 2015 • Rie Johnson, Tong Zhang

Convolutional neural network (CNN) is a neural network that can make use of the internal structure of data such as the 2D structure of image data.

Ranked #30 on Sentiment Analysis on IMDb

General Classification Sentiment Analysis

76,594

Paper
Code

Exponentially Weighted Imitation Learning for Batched Historical Data

1 code implementation • NeurIPS 2018 • Qing Wang, Jiechao Xiong, Lei Han, Peng Sun, Han Liu, Tong Zhang

We consider deep policy learning with only batched historical trajectories.

Imitation Learning reinforcement-learning +1

31,092

Paper
Code

Model Rubik's Cube: Twisting Resolution, Depth and Width for TinyNets

10 code implementations • 28 Oct 2020 • Kai Han, Yunhe Wang, Qiulin Zhang, Wei zhang, Chunjing Xu, Tong Zhang

To this end, we summarize a tiny formula for downsizing neural architectures through a series of smaller models derived from the EfficientNet-B0 with the FLOPs constraint.

Ranked #695 on Image Classification on ImageNet

Image Classification Rubik's Cube

29,758

Paper
Code

Adaptive Sampling Towards Fast Graph Representation Learning

2 code implementations • NeurIPS 2018 • Wenbing Huang, Tong Zhang, Yu Rong, Junzhou Huang

Graph Convolutional Networks (GCNs) have become a crucial tool on learning representations of graph vertices.

Ranked #2 on Node Classification on Pubmed Full-supervised

Graph Representation Learning Node Classification

10,560

Paper
Code

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

1 code implementation • 13 Apr 2023 • Hanze Dong, Wei Xiong, Deepanshu Goyal, Yihan Zhang, Winnie Chow, Rui Pan, Shizhe Diao, Jipeng Zhang, Kashun Shum, Tong Zhang

Utilizing a reward model and a sufficient number of samples, our approach selects the high-quality samples, discarding those that exhibit undesired behavior, and subsequently enhancing the model by fine-tuning on these filtered samples.

Ethics

8,000

Paper
Code

LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models

1 code implementation • 21 Jun 2023 • Shizhe Diao, Rui Pan, Hanze Dong, Ka Shun Shum, Jipeng Zhang, Wei Xiong, Tong Zhang

As the number of available models and specialized tasks keeps growing, the job of general finetuning becomes highly nontrivial.

8,000

Paper
Code

LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

1 code implementation • 26 Mar 2024 • Rui Pan, Xiang Liu, Shizhe Diao, Renjie Pi, Jipeng Zhang, Chi Han, Tong Zhang

Attempting to complement this deficiency, we investigate layerwise properties of LoRA on fine-tuning tasks and observe an uncommon skewness of weight norms across different layers.

GSM8K Language Modelling +1

8,000

Paper
Code

Involution: Inverting the Inherence of Convolution for Visual Recognition

13 code implementations • CVPR 2021 • Duo Li, Jie Hu, Changhu Wang, Xiangtai Li, Qi She, Lei Zhu, Tong Zhang, Qifeng Chen

Convolution has been the core ingredient of modern neural networks, triggering the surge of deep learning in vision.

Ranked #706 on Image Classification on ImageNet

Image Classification

5,257

Paper
Code

Model Rubik’s Cube: Twisting Resolution, Depth and Width for TinyNets

3 code implementations • NeurIPS 2020 • Kai Han, Yunhe Wang, Qiulin Zhang, Wei zhang, Chunjing Xu, Tong Zhang

To this end, we summarize a tiny formula for downsizing neural architectures through a series of smaller models derived from the EfficientNet-B0 with the FLOPs constraint.

Image Classification

3,803

Paper
Code

Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning

1 code implementation • 7 Jan 2019 • Baoyuan Wu, Weidong Chen, Yanbo Fan, Yong Zhang, Jinlong Hou, Jie Liu, Tong Zhang

In this work, we propose to train CNNs from images annotated with multiple tags, to enhance the quality of visual representation of the trained CNN model.

Image Classification object-detection +5

3,048

Paper
Code

Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space

5 code implementations • 10 Oct 2018 • Jiechao Xiong, Qing Wang, Zhuoran Yang, Peng Sun, Lei Han, Yang Zheng, Haobo Fu, Tong Zhang, Ji Liu, Han Liu

Most existing deep reinforcement learning (DRL) frameworks consider either discrete action space or continuous action space solely.

reinforcement-learning Reinforcement Learning (RL)

2,548

Paper
Code

PARAMETRIZED DEEP Q-NETWORKS LEARNING: PLAYING ONLINE BATTLE ARENA WITH DISCRETE-CONTINUOUS HYBRID ACTION SPACE

1 code implementation • ICLR 2018 • Jiechao Xiong, Qing Wang, Zhuoran Yang, Peng Sun, Yang Zheng, Lei Han, Haobo Fu, Xiangru Lian, Carson Eisenach, Haichuan Yang, Emmanuel Ekwedike, Bei Peng, Haoyue Gao, Tong Zhang, Ji Liu, Han Liu

Most existing deep reinforcement learning (DRL) frameworks consider action spaces that are either discrete or continuous space.

2,548

Paper
Code

VEGA: Towards an End-to-End Configurable AutoML Pipeline

1 code implementation • 3 Nov 2020 • Bochao Wang, Hang Xu, Jiajin Zhang, Chen Chen, Xiaozhi Fang, Yixing Xu, Ning Kang, Lanqing Hong, Chenhan Jiang, Xinyue Cai, Jiawei Li, Fengwei Zhou, Yong Li, Zhicheng Liu, Xinghao Chen, Kai Han, Han Shu, Dehua Song, Yunhe Wang, Wei zhang, Chunjing Xu, Zhenguo Li, Wenzhi Liu, Tong Zhang

Automated Machine Learning (AutoML) is an important industrial solution for automatic discovery and deployment of the machine learning models.

BIG-bench Machine Learning Data Augmentation +3

834

Paper
Code

DetGPT: Detect What You Need via Reasoning

1 code implementation • 23 May 2023 • Renjie Pi, Jiahui Gao, Shizhe Diao, Rui Pan, Hanze Dong, Jipeng Zhang, Lewei Yao, Jianhua Han, Hang Xu, Lingpeng Kong, Tong Zhang

Overall, our proposed paradigm and DetGPT demonstrate the potential for more sophisticated and intuitive interactions between humans and machines.

Autonomous Driving Object +2

719

Paper
Code

A Field Guide to Federated Optimization

2 code implementations • 14 Jul 2021 • Jianyu Wang, Zachary Charles, Zheng Xu, Gauri Joshi, H. Brendan McMahan, Blaise Aguera y Arcas, Maruan Al-Shedivat, Galen Andrew, Salman Avestimehr, Katharine Daly, Deepesh Data, Suhas Diggavi, Hubert Eichner, Advait Gadhikar, Zachary Garrett, Antonious M. Girgis, Filip Hanzely, Andrew Hard, Chaoyang He, Samuel Horvath, Zhouyuan Huo, Alex Ingerman, Martin Jaggi, Tara Javidi, Peter Kairouz, Satyen Kale, Sai Praneeth Karimireddy, Jakub Konecny, Sanmi Koyejo, Tian Li, Luyang Liu, Mehryar Mohri, Hang Qi, Sashank J. Reddi, Peter Richtarik, Karan Singhal, Virginia Smith, Mahdi Soltanolkotabi, Weikang Song, Ananda Theertha Suresh, Sebastian U. Stich, Ameet Talwalkar, Hongyi Wang, Blake Woodworth, Shanshan Wu, Felix X. Yu, Honglin Yuan, Manzil Zaheer, Mi Zhang, Tong Zhang, Chunxiang Zheng, Chen Zhu, Wennan Zhu

Federated learning and analytics are a distributed approach for collaboratively learning models (or statistics) from decentralized data, motivated by and designed for privacy protection.

Federated Learning

647

Paper
Code

ZEN: Pre-training Chinese Text Encoder Enhanced by N-gram Representations

7 code implementations • Findings of the Association for Computational Linguistics 2020 • Shizhe Diao, Jiaxin Bai, Yan Song, Tong Zhang, Yonggang Wang

Moreover, it is shown that reasonable performance can be obtained when ZEN is trained on a small corpus, which is important for applying pre-training techniques to scenarios with limited data.

Ranked #1 on Chinese Part-of-Speech Tagging on CTB5 Dev

Chinese Named Entity Recognition Chinese Word Segmentation +5

640

Paper
Code

Bridging the Gap between Sample-based and One-shot Neural Architecture Search with BONAS

1 code implementation • NeurIPS 2020 • Han Shi, Renjie Pi, Hang Xu, Zhenguo Li, James T. Kwok, Tong Zhang

In this work, we propose BONAS (Bayesian Optimized Neural Architecture Search), a sample-based NAS framework which is accelerated using weight-sharing to evaluate multiple related architectures simultaneously.

Bayesian Optimization Neural Architecture Search

496

Paper
Code

ADAPT: Action-aware Driving Caption Transformer

1 code implementation • 1 Feb 2023 • Bu Jin, Xinyu Liu, Yupeng Zheng, Pengfei Li, Hao Zhao, Tong Zhang, Yuhang Zheng, Guyue Zhou, Jingjing Liu

To bridge the gap, we propose an end-to-end transformer-based architecture, ADAPT (Action-aware Driving cAPtion Transformer), which provides user-friendly natural language narrations and reasoning for each decision making step of autonomous vehicular control and action.

Autonomous Driving Decision Making

367

Paper
Code

Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents

5 code implementations • ICML 2018 • Kaiqing Zhang, Zhuoran Yang, Han Liu, Tong Zhang, Tamer Başar

To this end, we propose two decentralized actor-critic algorithms with function approximation, which are applicable to large-scale MARL problems where both the number of states and the number of agents are massively large.

Multi-agent Reinforcement Learning reinforcement-learning +1

352

Paper
Code

TransNAS-Bench-101: Improving Transferrability and Generalizability of Cross-Task Neural Architecture Search

2 code implementations • 1 Jan 2021 • Yawen Duan, Xin Chen, Hang Xu, Zewei Chen, Xiaodan Liang, Tong Zhang, Zhenguo Li

While existing NAS methods mostly design architectures on one single task, algorithms that look beyond single-task search are surging to pursue a more efficient and universal solution across various tasks.

Neural Architecture Search Transfer Learning

334

Paper
Code

ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT

1 code implementation • 30 Nov 2022 • Rui Pan, Shizhe Diao, Jianlin Chen, Tong Zhang

In this paper, we present ExtremeBERT, a toolkit for accelerating and customizing BERT pretraining.

Molecular System Prediction Sentence Classification

282

Paper
Code

Deep Subspace Clustering Networks

3 code implementations • NeurIPS 2017 • Pan Ji, Tong Zhang, Hongdong Li, Mathieu Salzmann, Ian Reid

We present a novel deep neural network architecture for unsupervised subspace clustering.

Ranked #3 on Image Clustering on Extended Yale-B

Clustering Image Clustering

203

Paper
Code

RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering

1 code implementation • 8 Mar 2022 • Di Chang, Aljaž Božič, Tong Zhang, Qingsong Yan, Yingcong Chen, Sabine Süsstrunk, Matthias Nießner

Finding accurate correspondences among different views is the Achilles' heel of unsupervised Multi-View Stereo (MVS).

Neural Rendering

199

Paper
Code

Deep Pyramid Convolutional Neural Networks for Text Categorization

1 code implementation • ACL 2017 • Rie Johnson, Tong Zhang

This paper proposes a low-complexity word-level deep convolutional neural network (CNN) architecture for text categorization that can efficiently represent long-range associations in text.

Ranked #2 on Sentiment Analysis on Amazon Review Full

Sentiment Analysis Sentiment Classification

197

Paper
Code

Active Prompting with Chain-of-Thought for Large Language Models

2 code implementations • 23 Feb 2023 • Shizhe Diao, Pengcheng Wang, Yong Lin, Tong Zhang

For this purpose, we propose a solution to the key problem of determining which questions are the most important and helpful ones to annotate from a pool of task-specific queries.

Active Learning Zero-Shot Learning

195

Paper
Code

UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders

1 code implementation • CVPR 2020 • Jing Zhang, Deng-Ping Fan, Yuchao Dai, Saeed Anwar, Fatemeh Sadat Saleh, Tong Zhang, Nick Barnes

In this paper, we propose the first framework (UCNet) to employ uncertainty for RGB-D saliency detection by learning from the data labeling process.

Ranked #4 on RGB-D Salient Object Detection on LFSD

RGB-D Salient Object Detection Saliency Detection +1

174

Paper
Code

Improving Chinese Word Segmentation with Wordhood Memory Networks

1 code implementation • ACL 2020 • Yuanhe Tian, Yan Song, Fei Xia, Tong Zhang, Yonggang Wang

Contextual features always play an important role in Chinese word segmentation (CWS).

Ranked #1 on Chinese Word Segmentation on CITYU

Chinese Word Segmentation

173

Paper
Code

Plum: Prompt Learning using Metaheuristic

1 code implementation • 14 Nov 2023 • Rui Pan, Shuo Xing, Shizhe Diao, Wenhe Sun, Xiang Liu, Kashun Shum, Renjie Pi, Jipeng Zhang, Tong Zhang

Since the emergence of large language models, prompt learning has become a popular method for optimizing and customizing these models.

Image Generation

131

Paper
Code

VolRecon: Volume Rendering of Signed Ray Distance Functions for Generalizable Multi-View Reconstruction

1 code implementation • CVPR 2023 • Yufan Ren, Fangjinhua Wang, Tong Zhang, Marc Pollefeys, Sabine Süsstrunk

The success of the Neural Radiance Fields (NeRF) in novel view synthesis has inspired researchers to propose neural implicit scene reconstruction.

Novel View Synthesis

130

Paper
Code

Multi-scale Convolutional Neural Networks for Crowd Counting

1 code implementation • 8 Feb 2017 • Lingke Zeng, Xiangmin Xu, Bolun Cai, Suo Qiu, Tong Zhang

Crowd counting on static images is a challenging problem due to scale variations.

Crowd Counting

Paper
Code

MAP Inference via L2-Sphere Linear Program Reformulation

1 code implementation • 9 May 2019 • Baoyuan Wu, Li Shen, Tong Zhang, Bernard Ghanem

Thus, LS-LP is equivalent to the original MAP inference problem.

valid

Paper
Code

Learning Nonlinear Functions Using Regularized Greedy Forest

1 code implementation • 5 Sep 2011 • Rie Johnson, Tong Zhang

We consider the problem of learning a forest of nonlinear decision rules with general loss functions.

Paper
Code

Video Re-localization

1 code implementation • ECCV 2018 • Yang Feng, Lin Ma, Wei Liu, Tong Zhang, Jiebo Luo

We first exploit and reorganize the videos in ActivityNet to form a new dataset for video re-localization research, which consists of about 10, 000 videos of diverse visual appearances associated with localized boundary information.

Copy Detection

Paper
Code

Picasso: A Sparse Learning Library for High Dimensional Data Analysis in R and Python

1 code implementation • 27 Jun 2020 • Jason Ge, Xingguo Li, Haoming Jiang, Han Liu, Tong Zhang, Mengdi Wang, Tuo Zhao

We describe a new library named picasso, which implements a unified framework of pathwise coordinate optimization for a variety of sparse learning problems (e. g., sparse linear regression, sparse logistic regression, sparse Poisson regression and scaled sparse linear regression) combined with efficient active set selection strategies.

regression Sparse Learning

Paper
Code

Joint Chinese Word Segmentation and Part-of-speech Tagging via Two-way Attentions of Auto-analyzed Knowledge

1 code implementation • ACL 2020 • Yuanhe Tian, Yan Song, Xiang Ao, Fei Xia, Xiaojun Quan, Tong Zhang, Yonggang Wang

Chinese word segmentation (CWS) and part-of-speech (POS) tagging are important fundamental tasks for Chinese language processing, where joint learning of them is an effective one-step solution for both tasks.

Chinese Word Segmentation Part-Of-Speech Tagging +2

Paper
Code

Few-Shot Human Motion Transfer by Personalized Geometry and Texture Modeling

1 code implementation • CVPR 2021 • Zhichao Huang, Xintong Han, Jia Xu, Tong Zhang

We present a new method for few-shot human motion transfer that achieves realistic human image generation with only a small number of appearance inputs.

Image Generation

Paper
Code

TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game

3 code implementations • 19 Sep 2018 • Peng Sun, Xinghai Sun, Lei Han, Jiechao Xiong, Qing Wang, Bo Li, Yang Zheng, Ji Liu, Yongsheng Liu, Han Liu, Tong Zhang

Both TStarBot1 and TStarBot2 are able to defeat the built-in AI agents from level 1 to level 10 in a full game (1v1 Zerg-vs-Zerg game on the AbyssalReef map), noting that level 8, level 9, and level 10 are cheating agents with unfair advantages such as full vision on the whole map and resource harvest boosting.

Decision Making Starcraft +1

Paper
Code

DHER: Hindsight Experience Replay for Dynamic Goals

1 code implementation • ICLR 2019 • Meng Fang, Cheng Zhou, Bei Shi, Boqing Gong, Jia Xu, Tong Zhang

Dealing with sparse rewards is one of the most important challenges in reinforcement learning (RL), especially when a goal is dynamic (e. g., to grasp a moving object).

Object Tracking Reinforcement Learning (RL)

Paper
Code

Self-Reference Deep Adaptive Curve Estimation for Low-Light Image Enhancement

1 code implementation • 16 Aug 2023 • Jianyu Wen, Chenhao Wu, Tong Zhang, Yixuan Yu, Piotr Swierczynski

In this paper, we propose a 2-stage low-light image enhancement method called Self-Reference Deep Adaptive Curve Estimation (Self-DACE).

Denoising Low-Light Image Enhancement

Paper
Code

CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer

1 code implementation • 11 Nov 2023 • Haoyu Ma, Tong Zhang, Shanlin Sun, Xiangyi Yan, Kun Han, Xiaohui Xie

Reconstructing personalized animatable head avatars has significant implications in the fields of AR/VR.

Neural Rendering

Paper
Code

RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models

1 code implementation • 31 Dec 2023 • Yuanhao Wu, Juno Zhu, Siliang Xu, Kashun Shum, Cheng Niu, Randy Zhong, Juntong Song, Tong Zhang

Retrieval-augmented generation (RAG) has become a main technique for alleviating hallucinations in large language models (LLMs).

Hallucination Retrieval

Paper
Code

Black-Box Adversarial Attack with Transferable Model-based Embedding

1 code implementation • ICLR 2020 • Zhichao Huang, Tong Zhang

We present a new method for black-box adversarial attack.

Adversarial Attack

Paper
Code

R-Tuning: Teaching Large Language Models to Refuse Unknown Questions

1 code implementation • 16 Nov 2023 • Hanning Zhang, Shizhe Diao, Yong Lin, Yi R. Fung, Qing Lian, Xingyao Wang, Yangyi Chen, Heng Ji, Tong Zhang

This approach is formalized by first identifying the knowledge gap between parametric knowledge and the instruction tuning data.

Hallucination Sentence

Paper
Code

Weakly Supervised Disentangled Generative Causal Representation Learning

1 code implementation • 6 Oct 2020 • Xinwei Shen, Furui Liu, Hanze Dong, Qing Lian, Zhitang Chen, Tong Zhang

This paper proposes a Disentangled gEnerative cAusal Representation (DEAR) learning method under appropriate supervised information.

Disentanglement

Paper
Code

Black-box Prompt Learning for Pre-trained Language Models

1 code implementation • 21 Jan 2022 • Shizhe Diao, Zhichao Huang, Ruijia Xu, Xuechun Li, Yong Lin, Xiao Zhou, Tong Zhang

Particularly, instead of fine-tuning the model in the cloud, we adapt PLMs by prompt learning, which efficiently optimizes only a few parameters of the discrete prompts.

text-classification Text Classification

Paper
Code

End-to-end Learning for Inter-Vehicle Distance and Relative Velocity Estimation in ADAS with a Monocular Camera

1 code implementation • 7 Jun 2020 • Zhenbo Song, Jianfeng Lu, Tong Zhang, Hongdong Li

In this paper, we propose a monocular camera-based inter-vehicle distance and relative velocity estimation method based on end-to-end training of a deep neural network.

Optical Flow Estimation

Paper
Code

Uncertainty-aware Joint Salient Object and Camouflaged Object Detection

2 code implementations • CVPR 2021 • Aixuan Li, Jing Zhang, Yunqiu Lv, Bowen Liu, Tong Zhang, Yuchao Dai

Visual salient object detection (SOD) aims at finding the salient object(s) that attract human attention, while camouflaged object detection (COD) on the contrary intends to discover the camouflaged object(s) that hidden in the surrounding.

Object object-detection +2

Paper
Code

NATTACK: Learning the Distributions of Adversarial Examples for an Improved Black-Box Attack on Deep Neural Networks

1 code implementation • 1 May 2019 • Yandong Li, Lijun Li, Liqiang Wang, Tong Zhang, Boqing Gong

Powerful adversarial attack methods are vital for understanding how to construct robust deep neural networks (DNNs) and for thoroughly testing defense techniques.

Adversarial Attack

Paper
Code

Translating Pro-Drop Languages with Reconstruction Models

1 code implementation • 10 Jan 2018 • Long-Yue Wang, Zhaopeng Tu, Shuming Shi, Tong Zhang, Yvette Graham, Qun Liu

Next, the annotated source sentence is reconstructed from hidden representations in the NMT model.

Machine Translation NMT +2

Paper
Code

Learning to Remember Translation History with a Continuous Cache

1 code implementation • TACL 2018 • Zhaopeng Tu, Yang Liu, Shuming Shi, Tong Zhang

Existing neural machine translation (NMT) models generally translate sentences in isolation, missing the opportunity to take advantage of document-level information.

Machine Translation NMT +1

Paper
Code

LTP: A New Active Learning Strategy for CRF-Based Named Entity Recognition

1 code implementation • 8 Jan 2020 • Mingyi Liu, Zhiying Tu, Tong Zhang, Tonghua Su, Zhongjie Wang

In this paper, we first examine traditional active learning strategies in a specific case of BiLstm-CRF that has widely used in named entity recognition on several typical datasets.

Active Learning named-entity-recognition +4

Paper
Code

Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models Memories

1 code implementation • 8 Jun 2023 • Shizhe Diao, Tianyang Xu, Ruijia Xu, Jiawei Wang, Tong Zhang

Pre-trained language models (PLMs) demonstrate excellent abilities to understand texts in the generic domain while struggling in a specific domain.

Domain Adaptation

Paper
Code

Synthetic Learning: Learn From Distributed Asynchronized Discriminator GAN Without Sharing Medical Image Data

1 code implementation • CVPR 2020 • Qi Chang, Hui Qu, Yikai Zhang, Mert Sabuncu, Chao Chen, Tong Zhang, Dimitris Metaxas

In this paper, we propose a data privacy-preserving and communication efficient distributed GAN learning framework named Distributed Asynchronized Discriminator GAN (AsynDGAN).

Privacy Preserving

Paper
Code

Spatiotemporal Self-supervised Learning for Point Clouds in the Wild

1 code implementation • CVPR 2023 • Yanhao Wu, Tong Zhang, Wei Ke, Sabine Süsstrunk, Mathieu Salzmann

In this paper, we introduce an SSL strategy that leverages positive pairs in both the spatial and temporal domain.

Contrastive Learning Object Tracking +3

Paper
Code

MiLeNAS: Efficient Neural Architecture Search via Mixed-Level Reformulation

1 code implementation • CVPR 2020 • Chaoyang He, Haishan Ye, Li Shen, Tong Zhang

To remedy this, this paper proposes \mldas, a mixed-level reformulation for NAS that can be optimized efficiently and reliably.

Bilevel Optimization Neural Architecture Search +1

Paper
Code

Divergence-Augmented Policy Optimization

1 code implementation • NeurIPS 2019 • Qing Wang, Yingru Li, Jiechao Xiong, Tong Zhang

In deep reinforcement learning, policy optimization methods need to deal with issues such as function approximation and the reuse of off-policy data.

Atari Games Policy Gradient Methods +2

Paper
Code

Model Agnostic Sample Reweighting for Out-of-Distribution Learning

1 code implementation • 24 Jan 2023 • Xiao Zhou, Yong Lin, Renjie Pi, Weizhong Zhang, Renzhe Xu, Peng Cui, Tong Zhang

The overfitting issue is addressed by considering a bilevel formulation to search for the sample reweighting, in which the generalization complexity depends on the search space of sample weights instead of the model size.

Paper
Code

Attention-guided Chained Context Aggregation for Semantic Segmentation

3 code implementations • 27 Feb 2020 • Quan Tang, Fagui Liu, Tong Zhang, Jun Jiang, Yu Zhang

The way features propagate in Fully Convolutional Networks is of momentous importance to capture multi-scale contexts for obtaining precise segmentation masks.

Ranked #23 on Semantic Segmentation on SUN-RGBD (using extra training data)

Semantic Segmentation

Paper
Code

Towards Robust Offline Reinforcement Learning under Diverse Data Corruption

2 code implementations • 19 Oct 2023 • Rui Yang, Han Zhong, Jiawei Xu, Amy Zhang, Chongjie Zhang, Lei Han, Tong Zhang

Offline reinforcement learning (RL) presents a promising approach for learning reinforced policies from offline datasets without the need for costly or unsafe interactions with the environment.

Offline RL Q-Learning +2

Paper
Code

Time Series Generation with Masked Autoencoder

1 code implementation • 14 Jan 2022 • Mengyue Zha, SiuTim Wong, Mengqi Liu, Tong Zhang, Kani Chen

This paper shows that masked autoencoder with extrapolator (ExtraMAE) is a scalable self-supervised model for time series generation.

Data Augmentation Imputation +6

Paper
Code

Adapter Learning in Pretrained Feature Extractor for Continual Learning of Diseases

1 code implementation • 18 Apr 2023 • Wentao Zhang, Yujun Huang, Tong Zhang, Qingsong Zou, Wei-Shi Zheng, Ruixuan Wang

In particular, updating an intelligent diagnosis system with training data of new diseases would cause catastrophic forgetting of old disease knowledge.

Continual Learning

Paper
Code

ZEN 2.0: Continue Training and Adaption for N-gram Enhanced Text Encoders

1 code implementation • 4 May 2021 • Yan Song, Tong Zhang, Yonggang Wang, Kai-Fu Lee

Pre-trained text encoders have drawn sustaining attention in natural language processing (NLP) and shown their capability in obtaining promising results in different tasks.

Paper
Code

Reinforced Training Data Selection for Domain Adaptation

1 code implementation • ACL 2019 • Miaofeng Liu, Yan Song, Hongbin Zou, Tong Zhang

Supervised models suffer from the problem of domain shifting where distribution mismatch in the data across domains greatly affect model performance.

Dependency Parsing Domain Adaptation +3

Paper
Code

DiffMG: Differentiable Meta Graph Search for Heterogeneous Graph Neural Networks

2 code implementations • 7 Oct 2020 • Yuhui Ding, Quanming Yao, Huan Zhao, Tong Zhang

Specifically, we search for a meta graph, which can capture more complex semantic relations than a meta path, to determine how graph neural networks (GNNs) propagate messages along different types of edges.

Neural Architecture Search Recommendation Systems

Paper
Code

Effective Sparsification of Neural Networks with Global Sparsity Constraint

1 code implementation • CVPR 2021 • Xiao Zhou, Weizhong Zhang, Hang Xu, Tong Zhang

Weight pruning is an effective technique to reduce the model size and inference time for deep neural networks in real-world deployments.

Ranked #6 on Network Pruning on ImageNet - ResNet 50 - 90% sparsity

Network Pruning

Paper
Code

Optimal Feature Transport for Cross-View Image Geo-Localization

1 code implementation • 11 Jul 2019 • Yujiao Shi, Xin Yu, Liu Liu, Tong Zhang, Hongdong Li

This paper proposes a novel Cross-View Feature Transport (CVFT) technique to explicitly establish cross-view domain transfer that facilitates feature alignment between ground and aerial images.

Image-Based Localization Metric Learning

Paper
Code

Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards

1 code implementation • 28 Feb 2024 • Haoxiang Wang, Yong Lin, Wei Xiong, Rui Yang, Shizhe Diao, Shuang Qiu, Han Zhao, Tong Zhang

Additionally, DPA models user preferences as directions (i. e., unit vectors) in the reward space to achieve user-dependent preference control.

Paper
Code

TILGAN: Transformer-based Implicit Latent GAN for Diverse and Coherent Text Generation

1 code implementation • Findings (ACL) 2021 • Shizhe Diao, Xinwei Shen, Kashun Shum, Yan Song, Tong Zhang

Text Generation

Paper
Code

Keyphrase Generation with Cross-Document Attention

1 code implementation • 21 Apr 2020 • Shizhe Diao, Yan Song, Tong Zhang

Keyphrase generation aims to produce a set of phrases summarizing the essentials of a given document.

Keyphrase Generation

Paper
Code

MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance

1 code implementation • 5 Jan 2024 • Renjie Pi, Tianyang Han, Yueqi Xie, Rui Pan, Qing Lian, Hanze Dong, Jipeng Zhang, Tong Zhang

The deployment of multimodal large language models (MLLMs) has brought forth a unique vulnerability: susceptibility to malicious attacks through visual inputs.

Paper
Code

NetML: A Challenge for Network Traffic Analytics

1 code implementation • 25 Apr 2020 • Onur Barut, Yan Luo, Tong Zhang, Weigang Li, Peilong Li

Classifying network traffic is the basis for important network applications.

BIG-bench Machine Learning Malware Detection

Paper
Code

Multi-Task Hierarchical Learning Based Network Traffic Analytics

1 code implementation • 5 Jun 2021 • Onur Barut, Yan Luo, Tong Zhang, Weigang Li, Peilong Li

Classifying network traffic is the basis for important network applications.

Malware Detection

Paper
Code

TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search

2 code implementations • CVPR 2021 • Yawen Duan, Xin Chen, Hang Xu, Zewei Chen, Xiaodan Liang, Tong Zhang, Zhenguo Li

While existing NAS methods mostly design architectures on a single task, algorithms that look beyond single-task search are surging to pursue a more efficient and universal solution across various tasks.

Neural Architecture Search Transfer Learning

Paper
Code

Mathematical Models of Overparameterized Neural Networks

1 code implementation • 27 Dec 2020 • Cong Fang, Hanze Dong, Tong Zhang

Deep learning has received considerable empirical successes in recent years.

Paper
Code

Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation

1 code implementation • ACL 2021 • Shizhe Diao, Ruijia Xu, Hongjin Su, Yilei Jiang, Yan Song, Tong Zhang

In this paper, we aim to adapt a generic pretrained model with a relatively small amount of domain-specific data.

Domain Adaptation

Paper
Code

Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data

2 code implementations • 24 Feb 2023 • Kashun Shum, Shizhe Diao, Tong Zhang

However, most CoT studies rely on carefully designed human-annotated rational chains to prompt LLMs, posing challenges for real-world applications where labeled data is available without rational chains.

Arithmetic Reasoning Language Modelling

Paper
Code

Hierarchical Neural Architecture Search via Operator Clustering

1 code implementation • 26 Sep 2019 • Guilin Li, Xing Zhang, Zitong Wang, Matthias Tan, Jiashi Feng, Zhenguo Li, Tong Zhang

Recently, the efficiency of automatic neural architecture design has been significantly improved by gradient-based search methods such as DARTS.

Clustering Neural Architecture Search

Paper
Code

Dual Adaptive Representation Alignment for Cross-domain Few-shot Learning

1 code implementation • 18 Jun 2023 • Yifan Zhao, Tong Zhang, Jia Li, Yonghong Tian

Recent progress in this setting assumes that the base knowledge and novel query samples are distributed in the same domains, which are usually infeasible for realistic applications.

cross-domain few-shot learning

Paper
Code

EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain

1 code implementation • 30 Jan 2024 • Wei zhang, Miaoxin Cai, Tong Zhang, Yin Zhuang, Xuerui Mao

Multi-modal large language models (MLLMs) have demonstrated remarkable success in vision and visual-language tasks within the natural image domain.

Image Comprehension Instruction Following +2

Paper
Code

Optimizing Latent Space Directions For GAN-based Local Image Editing

1 code implementation • 24 Nov 2021 • Ehsan Pajouheshgar, Tong Zhang, Sabine Süsstrunk

Generative Adversarial Network (GAN) based localized image editing can suffer from ambiguity between semantic attributes.

Disentanglement Generative Adversarial Network

Paper
Code

Improving Constituency Parsing with Span Attention

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Yuanhe Tian, Yan Song, Fei Xia, Tong Zhang

Constituency parsing is a fundamental and important task for natural language understanding, where a good representation of contextual information can help this task.

Ranked #1 on Constituency Parsing on ATB

Constituency Parsing Natural Language Understanding +1

Paper
Code

Consecutive Pretraining: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain

1 code implementation • 8 Jul 2022 • Tong Zhang, Peng Gao, Hao Dong, Yin Zhuang, Guanqun Wang, Wei zhang, He Chen

Currently, under supervised learning, a model pretrained by a large-scale nature scene dataset and then fine-tuned on a few specific task labeling data is the paradigm that has dominated the knowledge transfer learning.

Land Cover Classification object-detection +3

Paper
Code

A Theoretical Analysis on Independence-driven Importance Weighting for Covariate-shift Generalization

1 code implementation • 3 Nov 2021 • Renzhe Xu, Xingxuan Zhang, Zheyan Shen, Tong Zhang, Peng Cui

Afterward, we prove that under ideal conditions, independence-driven importance weighting algorithms could identify the variables in this set.

feature selection

Paper
Code

MICO: A Multi-alternative Contrastive Learning Framework for Commonsense Knowledge Representation

1 code implementation • 14 Oct 2022 • Ying Su, ZiHao Wang, Tianqing Fang, Hongming Zhang, Yangqiu Song, Tong Zhang

Commonsense reasoning tasks such as commonsense knowledge graph completion and commonsense question answering require powerful representation learning.

Contrastive Learning Question Answering +2

Paper
Code

TempSAL -- Uncovering Temporal Information for Deep Saliency Prediction

1 code implementation • 5 Jan 2023 • Bahar Aydemir, Ludo Hoffstetter, Tong Zhang, Mathieu Salzmann, Sabine Süsstrunk

Deep saliency prediction algorithms complement the object recognition features, they typically rely on additional information, such as scene context, semantic relationships, gaze direction, and object dissimilarity.

Object Object Recognition +1

Paper
Code

TempSAL - Uncovering Temporal Information for Deep Saliency Prediction

1 code implementation • CVPR 2023 • Bahar Aydemir, Ludo Hoffstetter, Tong Zhang, Mathieu Salzmann, Sabine Süsstrunk

Deep saliency prediction algorithms complement the object recognition features, they typically rely on additional information such as scene context, semantic relationships, gaze direction, and object dissimilarity.

Ranked #3 on Saliency Prediction on SALECI

Object Object Recognition +1

Paper
Code

HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning

1 code implementation • ICLR 2022 • Ziniu Li, Yingru Li, Yushun Zhang, Tong Zhang, Zhi-Quan Luo

However, it is limited to the case where 1) a good feature is known in advance and 2) this feature is fixed during the training: if otherwise, RLSVI suffers an unbearable computational burden to obtain the posterior samples of the parameter in the $Q$-value function.

Efficient Exploration reinforcement-learning +1

Paper
Code

Normalizing Flow with Variational Latent Representation

1 code implementation • 21 Nov 2022 • Hanze Dong, Shizhe Diao, Weizhong Zhang, Tong Zhang

The resulting method is significantly more powerful than the standard normalization flow approach for generating data distributions with multiple modes.

Paper
Code

What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?

1 code implementation • 30 May 2023 • Rui Yang, Yong Lin, Xiaoteng Ma, Hao Hu, Chongjie Zhang, Tong Zhang

In this paper, we study out-of-distribution (OOD) generalization of offline GCRL both theoretically and empirically to identify factors that are important.

Imitation Learning Offline RL

Paper
Code

Two-person Graph Convolutional Network for Skeleton-based Human Interaction Recognition

1 code implementation • 12 Aug 2022 • Zhengcen Li, Yueran Li, Linlin Tang, Tong Zhang, Jingyong Su

To overcome the above shortcoming, we introduce a novel unified two-person graph to represent inter-body and intra-body correlations between joints.

Action Classification Action Recognition +3

Paper
Code

Speeding up Transformer Decoding via an Attention Refinement Network

1 code implementation • COLING 2022 • Kaixin Wu, Yue Zhang, Bojie Hu, Tong Zhang

Extensive experiments on ten WMT machine translation tasks show that the proposed model yields an average of 1. 35x faster (with almost no decrease in BLEU) over the state-of-the-art inference implementation.

Machine Translation NMT +1

Paper
Code

CLIP Meets Video Captioning: Concept-Aware Representation Learning Does Matter

1 code implementation • 30 Nov 2021 • Bang Yang, Tong Zhang, Yuexian Zou

DCD is an auxiliary task that requires a caption model to learn the correspondence between video content and concepts and the co-occurrence relations between concepts.

Ranked #16 on Video Captioning on MSR-VTT

Caption Generation Representation Learning +1

Paper
Code

UniKG: A Benchmark and Universal Embedding for Large-Scale Knowledge Graphs

1 code implementation • 11 Sep 2023 • Yide Qiu, Shaoxiang Ling, Tong Zhang, Bo Huang, Zhen Cui

To perform effective learning on the large-scale UniKG, two key measures are taken, including (i) the semantic alignment strategy for multi-attribute entities, which projects the feature description of multi-attribute nodes into a common embedding space to facilitate node aggregation in a large receptive field; (ii) proposing a novel plug-and-play anisotropy propagation module (APM) to learn effective multi-hop anisotropy propagation kernels, which extends methods of large-scale homogeneous graphs to heterogeneous graphs.

Attribute Graph Learning +3

Paper
Code

Efficient Neural Network Training via Forward and Backward Propagation Sparsification

1 code implementation • NeurIPS 2021 • Xiao Zhou, Weizhong Zhang, Zonghao Chen, Shizhe Diao, Tong Zhang

For the latter step, instead of using the chain rule based gradient estimators as in existing methods, we propose a variance reduced policy gradient estimator, which only requires two forward passes without backward propagation, thus achieving completely sparse training.

Efficient Neural Network

Paper
Code

Hashtag-Guided Low-Resource Tweet Classification

1 code implementation • 20 Feb 2023 • Shizhe Diao, Sedrick Scott Keh, Liangming Pan, Zhiliang Tian, Yan Song, Tong Zhang

Social media classification tasks (e. g., tweet sentiment analysis, tweet stance detection) are challenging because social media posts are typically short, informal, and ambiguous.

Classification Sentiment Analysis +1

Paper
Code

An Incremental Unified Framework for Small Defect Inspection

1 code implementation • 14 Dec 2023 • Jiaqi Tang, Hao Lu, Xiaogang Xu, Ruizheng Wu, Sixing Hu, Tong Zhang, Tsz Wa Cheng, Ming Ge, Ying-Cong Chen, Fugee Tsung

Artificial Intelligence (AI)-driven defect inspection is pivotal in industrial manufacturing.

Anomaly Detection continual anomaly detection +1

Paper
Code

The Instinctive Bias: Spurious Images lead to Hallucination in MLLMs

1 code implementation • 6 Feb 2024 • Tianyang Han, Qing Lian, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang

In this paper, we identify a typical class of inputs that baffles MLLMs, which consist of images that are highly relevant but inconsistent with answers, causing MLLMs to suffer from hallucination.

Hallucination

Paper
Code

DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses

1 code implementation • 20 Mar 2024 • Chen Zhao, Tong Zhang, Zheng Dang, Mathieu Salzmann

Determining the relative pose of an object between two images is pivotal to the success of generalizable object pose estimation.

Object Pose Estimation

Paper
Code

Stochastic Expectation Maximization with Variance Reduction

1 code implementation • NeurIPS 2018 • Jianfei Chen, Jun Zhu, Yee Whye Teh, Tong Zhang

However, sEM has a slower asymptotic convergence rate than batch EM, and requires a decreasing sequence of step sizes, which is difficult to tune.

Paper
Code

Bidirectional Generative Modeling Using Adversarial Gradient Estimation

2 code implementations • 21 Feb 2020 • Xinwei Shen, Tong Zhang, Kani Chen

This paper considers the general $f$-divergence formulation of bidirectional generative modeling, which includes VAE and BiGAN as special cases.

Paper
Code

Rare and Zero-shot Word Sense Disambiguation using Z-Reweighting

1 code implementation • ACL 2022 • Ying Su, Hongming Zhang, Yangqiu Song, Tong Zhang

However, the imbalanced training dataset leads to poor performance on rare senses and zero-shot senses.

Relation Word Sense Disambiguation

Paper
Code

Communication Efficient Distributed Optimization using an Approximate Newton-type Method

1 code implementation • 30 Dec 2013 • Ohad Shamir, Nathan Srebro, Tong Zhang

We present a novel Newton-type method for distributed optimization, which is particularly well suited for stochastic optimization and learning problems.

Distributed Optimization Vocal Bursts Type Prediction

Paper
Code

Residual Distillation: Towards Portable Deep Neural Networks without Shortcuts

1 code implementation • NeurIPS 2020 • Guilin Li, Junlei Zhang, Yunhe Wang, Chuanjian Liu, Matthias Tan, Yunfeng Lin, Wei zhang, Jiashi Feng, Tong Zhang

In particular, we propose a novel joint-training framework to train plain CNN by leveraging the gradients of the ResNet counterpart.

Paper
Code

Modeling Object Dissimilarity for Deep Saliency Prediction

1 code implementation • 8 Apr 2021 • Bahar Aydemir, Deblina Bhattacharjee, Tong Zhang, Seungryong Kim, Mathieu Salzmann, Sabine Süsstrunk

Saliency prediction has made great strides over the past two decades, with current techniques modeling low-level information, such as color, intensity and size contrasts, and high-level ones, such as attention and gaze direction for entire objects.

Object Saliency Prediction

Paper
Code

Corruption-Robust Offline Reinforcement Learning with General Function Approximation

1 code implementation • NeurIPS 2023 • Chenlu Ye, Rui Yang, Quanquan Gu, Tong Zhang

Notably, under the assumption of single policy coverage and the knowledge of $\zeta$, our proposed algorithm achieves a suboptimality bound that is worsened by an additive factor of $\mathcal{O}(\zeta (C(\widehat{\mathcal{F}},\mu)n)^{-1})$ due to the corruption.

Offline RL reinforcement-learning +1

Paper
Code

Eigencurve: Optimal Learning Rate Schedule for SGD on Quadratic Objectives with Skewed Hessian Spectrums

1 code implementation • ICLR 2022 • Rui Pan, Haishan Ye, Tong Zhang

In this paper, we propose Eigencurve, the first family of learning rate schedules that can achieve minimax optimal convergence rates (up to a constant) for SGD on quadratic objectives when the eigenvalue distribution of the underlying Hessian matrix is skewed.

Image Classification

Paper
Code

Composite Functional Gradient Learning of Generative Adversarial Models

no code implementations • ICML 2018 • Rie Johnson, Tong Zhang

This paper first presents a theory for generative adversarial methods that does not rely on the traditional minimax formulation.

Image Generation

Paper
Add Code

Safe Element Screening for Submodular Function Minimization

no code implementations • ICML 2018 • Weizhong Zhang, Bin Hong, Lin Ma, Wei Liu, Tong Zhang

Relying on this study, we subsequently propose a novel safe screening method to quickly identify the elements guaranteed to be included (we refer to them as active) or excluded (inactive) in the final optimal solution of SFM during the optimization process.

Combinatorial Optimization Sparse Learning

Paper
Add Code

End-to-end Active Object Tracking via Reinforcement Learning

no code implementations • ICML 2018 • Wenhan Luo, Peng Sun, Fangwei Zhong, Wei Liu, Tong Zhang, Yizhou Wang

We study active object tracking, where a tracker takes as input the visual observation (i. e., frame sequence) and produces the camera control signal (e. g., move forward, turn left, etc.).

Object Object Tracking +2

Paper
Add Code

Communication Compression for Decentralized Training

no code implementations • NeurIPS 2018 • Hanlin Tang, Shaoduo Gan, Ce Zhang, Tong Zhang, Ji Liu

In this paper, We explore a natural question: {\em can the combination of both techniques lead to a system that is robust to both bandwidth and latency?}

Paper
Add Code

QuaSE: Accurate Text Style Transfer under Quantifiable Guidance

1 code implementation • EMNLP 2018 • Yi Liao, Lidong Bing, Piji Li, Shuming Shi, Wai Lam, Tong Zhang

For example, an input sequence could be a word sequence, such as review sentence and advertisement text.

Disentanglement Sentence +2

Paper
Code

Walk-Steered Convolution for Graph Classification

no code implementations • 16 Apr 2018 • Jiatao Jiang, Chunyan Xu, Zhen Cui, Tong Zhang, Wenming Zheng, Jian Yang

As an analogy to a standard convolution kernel on image, Gaussian models implicitly coordinate those unordered vertices/nodes and edges in a local receptive field after projecting to the gradient space of Gaussian parameters.

Clustering General Classification +2

Paper
Add Code

Fine-grained Video Attractiveness Prediction Using Multimodal Deep Learning on a Large Real-world Dataset

no code implementations • 4 Apr 2018 • Xinpeng Chen, Jingyuan Chen, Lin Ma, Jian Yao, Wei Liu, Jiebo Luo, Tong Zhang

First, we demonstrate that video attractiveness and different engagements present different relationships.

Multimodal Deep Learning

Paper
Add Code

Deep Unsupervised Saliency Detection: A Multiple Noisy Labeling Perspective

no code implementations • CVPR 2018 • Jing Zhang, Tong Zhang, Yuchao Dai, Mehrtash Harandi, Richard Hartley

Such supervision, while labor-intensive and not always possible, tends to hinder the generalization ability of the learned models.

Benchmarking Saliency Prediction +1

Paper
Add Code

Tensor graph convolutional neural network

no code implementations • 27 Mar 2018 • Tong Zhang, Wenming Zheng, Zhen Cui, Yang Li

For cross graph convolution, a parameterized Kronecker sum operation is proposed to generate a conjunctive adjacency matrix characterizing the relationship between every pair of nodes across two subgraphs.

Attribute Matrix Completion

Paper
Add Code

Neural Stereoscopic Image Style Transfer

no code implementations • ECCV 2018 • Xinyu Gong, HaoZhi Huang, Lin Ma, Fumin Shen, Wei Liu, Tong Zhang

While each view of the stereoscopic pair is processed in an individual path, a novel feature aggregation strategy is proposed to effectively share information between the two paths.

Style Transfer

Paper
Add Code

On Quadratic Convergence of DC Proximal Newton Algorithm for Nonconvex Sparse Learning in High Dimensions

no code implementations • 19 Jun 2017 • Xingguo Li, Lin F. Yang, Jason Ge, Jarvis Haupt, Tong Zhang, Tuo Zhao

We propose a DC proximal Newton algorithm for solving nonconvex regularized sparse learning problems in high dimensions.

Sparse Learning

Paper
Add Code

Robust Frequent Directions with Application in Online Learning

no code implementations • 15 May 2017 • Luo Luo, Cheng Chen, Zhihua Zhang, Wu-Jun Li, Tong Zhang

We also apply RFD to online learning and propose an effective hyperparameter-free online Newton algorithm.

Paper
Add Code

Candidates vs. Noises Estimation for Large Multi-Class Classification Problem

no code implementations • ICML 2018 • Lei Han, Yiheng Huang, Tong Zhang

This paper proposes a method for multi-class classification problems, where the number of classes K is large.

General Classification Multi-class Classification

Paper
Add Code

Gradient Sparsification for Communication-Efficient Distributed Optimization

no code implementations • NeurIPS 2018 • Jianqiao Wangni, Jialei Wang, Ji Liu, Tong Zhang

Modern large scale machine learning applications require stochastic optimization algorithms to be implemented on distributed computational architectures.

BIG-bench Machine Learning Distributed Optimization +1

Paper
Add Code

Improved Optimization of Finite Sums with Minibatch Stochastic Variance Reduced Proximal Iterations

no code implementations • 21 Jun 2017 • Jialei Wang, Tong Zhang

We present novel minibatch stochastic optimization methods for empirical risk minimization problems, the methods efficiently leverage variance reduced first-order and sub-sampled higher-order information to accelerate the convergence speed.

Stochastic Optimization

Paper
Add Code

Near-Optimal Stochastic Approximation for Online Principal Component Estimation

no code implementations • 16 Mar 2016 • Chris Junchi Li, Mengdi Wang, Han Liu, Tong Zhang

We prove for the first time a nearly optimal finite-sample error bound for the online PCA algorithm.

Paper
Add Code

Deep manifold-to-manifold transforming network for action recognition

no code implementations • 30 May 2017 • Tong Zhang, Wenming Zheng, Zhen Cui, Chaolong Li

Symmetric positive definite (SPD) matrices (e. g., covariances, graph Laplacians, etc.)

Action Recognition Temporal Action Localization

Paper
Add Code

A General Distributed Dual Coordinate Optimization Framework for Regularized Loss Minimization

no code implementations • 13 Apr 2016 • Shun Zheng, Jialei Wang, Fen Xia, Wei Xu, Tong Zhang

In modern large-scale machine learning applications, the training data are often partitioned and stored on multiple machines.

Paper
Add Code

Graphical Nonconvex Optimization for Optimal Estimation in Gaussian Graphical Models

no code implementations • 4 Jun 2017 • Qiang Sun, Kean Ming Tan, Han Liu, Tong Zhang

Our proposal is computationally tractable and produces an estimator that achieves the oracle rate of convergence.

Paper
Add Code

Spatial-Temporal Recurrent Neural Network for Emotion Recognition

no code implementations • 12 May 2017 • Tong Zhang, Wenming Zheng, Zhen Cui, Yuan Zong, Yang Li

Then a bi-directional temporal RNN layer is further used to learn discriminative temporal dependencies from the sequences concatenating spatial features of each time slice produced from the spatial RNN layer.

EEG Emotion Recognition

Paper
Add Code

Pathwise Coordinate Optimization for Sparse Learning: Algorithm and Theory

no code implementations • 23 Dec 2014 • Tuo Zhao, Han Liu, Tong Zhang

This is the first result on the computational and statistical guarantees of the pathwise coordinate optimization framework in high dimensions.

Sparse Learning

Paper
Add Code

Towards More Efficient SPSD Matrix Approximation and CUR Matrix Decomposition

no code implementations • 29 Mar 2015 • Shusen Wang, Zhihua Zhang, Tong Zhang

The Nystr\"om method is a special instance of our fast model and is approximation to the prototype model.

Paper
Add Code

Convolutional Neural Networks for Text Categorization: Shallow Word-level vs. Deep Character-level

no code implementations • 31 Aug 2016 • Rie Johnson, Tong Zhang

This paper reports the performances of shallow word-level convolutional neural networks (CNN), our earlier work (2015), on the eight datasets with relatively large training data that were used for testing the very deep character-level CNN in Conneau et al. (2016).

Text Categorization

Paper
Add Code

Local Uncertainty Sampling for Large-Scale Multi-Class Logistic Regression

no code implementations • 27 Apr 2016 • Lei Han, Kean Ming Tan, Ting Yang, Tong Zhang

A major challenge for building statistical models in the big data era is that the available data volume far exceeds the computational capability.

regression

Paper
Add Code

Supervised and Semi-Supervised Text Categorization using LSTM for Region Embeddings

no code implementations • 7 Feb 2016 • Rie Johnson, Tong Zhang

The best results were obtained by combining region embeddings in the form of LSTM and convolution layers trained on unlabeled data.

Ranked #1 on Text Classification on RCV1

Sentiment Analysis Text Categorization

Paper
Add Code

Efficient Distributed Learning with Sparsity

no code implementations • ICML 2017 • Jialei Wang, Mladen Kolar, Nathan Srebro, Tong Zhang

We propose a novel, efficient approach for distributed sparse learning in high-dimensions, where observations are randomly partitioned across machines.

General Classification regression +1

Paper
Add Code

Sparse Generalized Eigenvalue Problem: Optimal Statistical Rates via Truncated Rayleigh Flow

no code implementations • 29 Apr 2016 • Kean Ming Tan, Zhaoran Wang, Han Liu, Tong Zhang

Sparse generalized eigenvalue problem (GEP) plays a pivotal role in a large family of high-dimensional statistical models, including sparse Fisher's discriminant analysis, canonical correlation analysis, and sufficient dimension reduction.

Dimensionality Reduction

Paper
Add Code

Learning Sparse Low-Threshold Linear Classifiers

no code implementations • 13 Dec 2012 • Sivan Sabato, Shai Shalev-Shwartz, Nathan Srebro, Daniel Hsu, Tong Zhang

We consider the problem of learning a non-negative linear classifier with a $1$-norm of at most $k$, and a fixed threshold, under the hinge-loss.

Paper
Add Code

Sparse Nonlinear Regression: Parameter Estimation and Asymptotic Inference

no code implementations • 14 Nov 2015 • Zhuoran Yang, Zhaoran Wang, Han Liu, Yonina C. Eldar, Tong Zhang

To recover $\beta^*$, we propose an $\ell_1$-regularized least-squares estimator.

regression valid

Paper
Add Code

Semi-supervised Convolutional Neural Networks for Text Categorization via Region Embedding

no code implementations • NeurIPS 2015 • Rie Johnson, Tong Zhang

This paper presents a new semi-supervised framework with convolutional neural networks (CNNs) for text categorization.

Ranked #1000000000 on Text Classification on IMDb

General Classification Sentiment Analysis +4

Paper
Add Code

Adjusting Leverage Scores by Row Weighting: A Practical Approach to Coherent Matrix Completion

no code implementations • 26 Dec 2014 • Shusen Wang, Tong Zhang, Zhihua Zhang

Low-rank matrix completion is an important problem with extensive real-world applications.

Low-Rank Matrix Completion

Paper
Add Code

Optimal computational and statistical rates of convergence for sparse nonconvex learning problems

no code implementations • 20 Jun 2013 • Zhaoran Wang, Han Liu, Tong Zhang

In particular, our analysis improves upon existing results by providing a more refined sample complexity bound as well as an exact support recovery result for the final estimator.

regression

Paper
Add Code

Stochastic Optimization with Importance Sampling

no code implementations • 13 Jan 2014 • Peilin Zhao, Tong Zhang

Uniform sampling of training data has been commonly used in traditional stochastic optimization algorithms such as Proximal Stochastic Gradient Descent (prox-SGD) and Proximal Stochastic Dual Coordinate Ascent (prox-SDCA).

Stochastic Optimization

Paper
Add Code

Randomized Dual Coordinate Ascent with Arbitrary Sampling

no code implementations • 21 Nov 2014 • Zheng Qu, Peter Richtárik, Tong Zhang

The distributed variant of Quartz is the first distributed SDCA-like method with an analysis for non-separable data.

Paper
Add Code

Adaptive Stochastic Alternating Direction Method of Multipliers

no code implementations • 16 Dec 2013 • Peilin Zhao, Jinwei Yang, Tong Zhang, Ping Li

The Alternating Direction Method of Multipliers (ADMM) has been studied for years.

Paper
Add Code

Accelerating Minibatch Stochastic Gradient Descent using Stratified Sampling

no code implementations • 13 May 2014 • Peilin Zhao, Tong Zhang

Stochastic Gradient Descent (SGD) is a popular optimization method which has been applied to many important machine learning tasks such as Support Vector Machines and Deep Neural Networks.

Paper
Add Code

Random design analysis of ridge regression

no code implementations • 13 Jun 2011 • Daniel Hsu, Sham M. Kakade, Tong Zhang

The analysis also reveals the effect of errors in the estimated covariance structure, as well as the effect of modeling errors, neither of which effects are present in the fixed design setting.

LEMMA regression

Paper
Add Code

A Proximal Stochastic Gradient Method with Progressive Variance Reduction

no code implementations • 19 Mar 2014 • Lin Xiao, Tong Zhang

We consider the problem of minimizing the sum of two convex functions: one is the average of a large number of smooth component functions, and the other is a general convex function that admits a simple proximal mapping.

Paper
Add Code

Sparse Recovery with Very Sparse Compressed Counting

no code implementations • 31 Dec 2013 • Ping Li, Cun-Hui Zhang, Tong Zhang

In this paper, we adopt very sparse Compressed Counting for nonnegative signal recovery.

Paper
Add Code

Gradient Hard Thresholding Pursuit for Sparsity-Constrained Optimization

no code implementations • 22 Nov 2013 • Xiao-Tong Yuan, Ping Li, Tong Zhang

Numerical evidences show that our method is superior to the state-of-the-art greedy selection methods in sparse logistic regression and sparse precision matrix estimation tasks.

Compressive Sensing regression

Paper
Add Code

Learning Pairwise Graphical Models with Nonlinear Sufficient Statistics

no code implementations • 21 Nov 2013 • Xiao-Tong Yuan, Ping Li, Tong Zhang

We investigate a generic problem of learning pairwise exponential family graphical models with pairwise sufficient statistics defined by a global mapping function, e. g., Mercer kernels.

Computational Efficiency

Paper
Add Code

Aggregation of Affine Estimators

no code implementations • 12 Nov 2013 • Dong Dai, Philippe Rigollet, Lucy Xia, Tong Zhang

While results indicate that the same aggregation scheme may not satisfy sharp oracle inequalities with high probability, we prove that a weaker notion of oracle inequality for EW that holds with high probability.

Model Selection

Paper
Add Code

Accelerated Proximal Stochastic Dual Coordinate Ascent for Regularized Loss Minimization

no code implementations • 10 Sep 2013 • Shai Shalev-Shwartz, Tong Zhang

We introduce a proximal version of the stochastic dual coordinate ascent method and show how to accelerate the method using an inner-outer iteration procedure.

BIG-bench Machine Learning regression

Paper
Add Code

Compressed Counting Meets Compressed Sensing

no code implementations • 3 Oct 2013 • Ping Li, Cun-Hui Zhang, Tong Zhang

In particular, when p->0 the required number of measurements is essentially M=K\log N, where K is the number of nonzero coordinates of the signal.

Paper
Add Code

High-dimensional Joint Sparsity Random Effects Model for Multi-task Learning

no code implementations • 26 Sep 2013 • Krishnakumar Balasubramanian, Kai Yu, Tong Zhang

The traditional convex formulation employs the group Lasso relaxation to achieve joint sparsity across tasks.

Multi-Task Learning Vocal Bursts Intensity Prediction

Paper
Add Code

Accelerated Mini-Batch Stochastic Dual Coordinate Ascent

no code implementations • NeurIPS 2013 • Shai Shalev-Shwartz, Tong Zhang

Stochastic dual coordinate ascent (SDCA) is an effective technique for solving regularized loss minimization problems in machine learning.

BIG-bench Machine Learning

Paper
Add Code

Error Compensated Quantized SGD and its Applications to Large-scale Distributed Optimization

no code implementations • ICML 2018 • Jiaxiang Wu, Weidong Huang, Junzhou Huang, Tong Zhang

Large-scale distributed optimization is of great importance in various applications.

Distributed Optimization Quantization

Paper
Add Code

SPIDER: Near-Optimal Non-Convex Optimization via Stochastic Path Integrated Differential Estimator

no code implementations • NeurIPS 2018 • Cong Fang, Chris Junchi Li, Zhouchen Lin, Tong Zhang

For stochastic first-order method, combining SPIDER with normalized gradient descent, we propose two new algorithms, namely SPIDER-SFO and SPIDER-SFO\textsuperscript{+}, that solve non-convex stochastic optimization problems using stochastic gradients only.

Stochastic Optimization

Paper
Add Code

When Work Matters: Transforming Classical Network Structures to Graph CNN

no code implementations • 7 Jul 2018 • Wenting Zhao, Chunyan Xu, Zhen Cui, Tong Zhang, Jiatao Jiang, Zhen-Yu Zhang, Jian Yang

In this paper, we aim to give a comprehensive analysis of when work matters by transforming different classical network structures to graph CNN, particularly in the basic graph recognition problem.

Ranked #3 on Graph Classification on IMDb-B

Graph Classification Video Understanding

Paper
Add Code

Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks

no code implementations • ECCV 2018 • Minjun Li, Hao-Zhi Huang, Lin Ma, Wei Liu, Tong Zhang, Yu-Gang Jiang

Recent studies on unsupervised image-to-image translation have made a remarkable progress by training a pair of generative adversarial networks with a cycle-consistent loss.

Translation Unsupervised Image-To-Image Translation

Paper
Add Code

Recurrent Fusion Network for Image Captioning

no code implementations • ECCV 2018 • Wenhao Jiang, Lin Ma, Yu-Gang Jiang, Wei Liu, Tong Zhang

In this paper, in order to exploit the complementary information from multiple encoders, we propose a novel Recurrent Fusion Network (RFNet) for tackling image captioning.

Image Captioning

Paper
Add Code

End-to-end Active Object Tracking and Its Real-world Deployment via Reinforcement Learning

no code implementations • 10 Aug 2018 • Wenhan Luo, Peng Sun, Fangwei Zhong, Wei Liu, Tong Zhang, Yizhou Wang

We further propose an environment augmentation technique and a customized reward function, which are crucial for successful training.

Object Object Tracking +1

Paper
Add Code

Diffusion Approximations for Online Principal Component Estimation and Global Convergence

no code implementations • NeurIPS 2017 • Chris Junchi Li, Mengdi Wang, Han Liu, Tong Zhang

In this paper, we propose to adopt the diffusion approximation tools to study the dynamics of Oja's iteration which is an online stochastic gradient descent method for the principal component analysis.

Paper
Add Code

A convex formulation for high-dimensional sparse sliced inverse regression

no code implementations • 17 Sep 2018 • Kean Ming Tan, Zhaoran Wang, Tong Zhang, Han Liu, R. Dennis Cook

Sliced inverse regression is a popular tool for sufficient dimension reduction, which replaces covariates with a minimal set of their linear combinations without loss of information on the conditional distribution of the response given the covariates.

Dimensionality Reduction regression +2

Paper
Add Code

Fully Implicit Online Learning

no code implementations • 25 Sep 2018 • Chaobing Song, Ji Liu, Han Liu, Yong Jiang, Tong Zhang

Regularized online learning is widely used in machine learning applications.

Paper
Add Code

Modeling Varying Camera-IMU Time Offset in Optimization-Based Visual-Inertial Odometry

no code implementations • ECCV 2018 • Yonggen Ling, Linchao Bao, Zequn Jie, Fengming Zhu, Ziyang Li, Shanmin Tang, Yongsheng Liu, Wei Liu, Tong Zhang

Our approach is able to handle the rolling-shutter effects and imperfect sensor synchronization in a unified way.

Paper
Add Code

Orthogonal Deep Features Decomposition for Age-Invariant Face Recognition

no code implementations • ECCV 2018 • Yitong Wang, Dihong Gong, Zheng Zhou, Xing Ji, Hao Wang, Zhifeng Li, Wei Liu, Tong Zhang

Extensive experiments conducted on the three public domain face aging datasets (MORPH Album 2, CACD-VS and FG-NET) have shown the effectiveness of the proposed approach and the value of the constructed CAF dataset on AIFR.

Ranked #3 on Age-Invariant Face Recognition on MORPH Album2

Age-Invariant Face Recognition Benchmarking +1

Paper
Add Code

Multi-Head Attention with Disagreement Regularization

no code implementations • EMNLP 2018 • Jian Li, Zhaopeng Tu, Baosong Yang, Michael R. Lyu, Tong Zhang

Multi-head attention is appealing for the ability to jointly attend to information from different representation subspaces at different positions.

Translation

Paper
Add Code

Modeling Localness for Self-Attention Networks

no code implementations • EMNLP 2018 • Baosong Yang, Zhaopeng Tu, Derek F. Wong, Fandong Meng, Lidia S. Chao, Tong Zhang

Self-attention networks have proven to be of profound value for its strength of capturing global dependencies.

Ranked #29 on Machine Translation on WMT2014 English-German

Machine Translation Translation

Paper
Add Code

Exploiting Deep Representations for Neural Machine Translation

no code implementations • EMNLP 2018 • Zi-Yi Dou, Zhaopeng Tu, Xing Wang, Shuming Shi, Tong Zhang

Advanced neural machine translation (NMT) models generally implement encoder and decoder as multiple layers, which allows systems to model complex functions and capture complicated linguistic structures.

Machine Translation NMT +1

Paper
Add Code

Proximal Gradient Method for Nonsmooth Optimization over the Stiefel Manifold

no code implementations • 2 Nov 2018 • Shixiang Chen, Shiqian Ma, Anthony Man-Cho So, Tong Zhang

We prove that the proposed method globally converges to a stationary point.

Paper
Add Code

Stochastic Primal-Dual Method for Empirical Risk Minimization with $\mathcal{O}(1)$ Per-Iteration Complexity

no code implementations • NeurIPS 2018 • Conghui Tan, Tong Zhang, Shiqian Ma, Ji Liu

Regularized empirical risk minimization problem with linear predictor appears frequently in machine learning.

BIG-bench Machine Learning

Paper
Add Code

Scalable Deep $k$-Subspace Clustering

no code implementations • 2 Nov 2018 • Tong Zhang, Pan Ji, Mehrtash Harandi, Richard Hartley, Ian Reid

In this paper, we introduce a method that simultaneously learns an embedding space along subspaces within it to minimize a notion of reconstruction error, thus addressing the problem of subspace clustering in an end-to-end learning paradigm.

Clustering

Paper
Add Code

Super-Identity Convolutional Neural Network for Face Hallucination

no code implementations • ECCV 2018 • Kaipeng Zhang, Zhanpeng Zhang, Chia-Wen Cheng, Winston H. Hsu, Yu Qiao, Wei Liu, Tong Zhang

Face hallucination is a generative task to super-resolve the facial image with low resolution while human perception of face heavily relies on identity information.

Face Generation Face Hallucination +1

Paper
Add Code

Neural Machine Translation with Adequacy-Oriented Learning

no code implementations • 21 Nov 2018 • Xiang Kong, Zhaopeng Tu, Shuming Shi, Eduard Hovy, Tong Zhang

Although Neural Machine Translation (NMT) models have advanced state-of-the-art performance in machine translation, they face problems like the inadequate translation.

Ranked #35 on Machine Translation on WMT2014 English-German

Attribute Machine Translation +3

Paper
Add Code

Cross-database non-frontal facial expression recognition based on transductive deep transfer learning

no code implementations • 30 Nov 2018 • Keyu Yan, Wenming Zheng, Tong Zhang, Yuan Zong, Zhen Cui

Cross-database non-frontal expression recognition is a very meaningful but rather difficult subject in the fields of computer vision and affect computing.

Facial Expression Recognition Facial Expression Recognition (FER) +1

Paper
Add Code

Finite-Sample Analysis For Decentralized Batch Multi-Agent Reinforcement Learning With Networked Agents

no code implementations • 6 Dec 2018 • Kaiqing Zhang, Zhuoran Yang, Han Liu, Tong Zhang, Tamer Başar

This work appears to be the first finite-sample analysis for batch MARL, a step towards rigorous theoretical understanding of general MARL algorithms in the finite-sample regime.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Cross-Database Micro-Expression Recognition: A Benchmark

no code implementations • 19 Dec 2018 • Yuan Zong, Tong Zhang, Wenming Zheng, Xiaopeng Hong, Chuangao Tang, Zhen Cui, Guoying Zhao

Cross-database micro-expression recognition (CDMER) is one of recently emerging and interesting problem in micro-expression analysis.

Domain Adaptation Micro Expression Recognition +1

Paper
Add Code

QuaSE: Sequence Editing under Quantifiable Guidance

1 code implementation • EMNLP 2018 • Yi Liao, Lidong Bing, Piji Li, Shuming Shi, Wai Lam, Tong Zhang

For example, an input sequence could be a word sequence, such as review sentence and advertisement text.

Disentanglement Sentence +1

Paper
Code

SPIDER: Near-Optimal Non-Convex Optimization via Stochastic Path-Integrated Differential Estimator

no code implementations • NeurIPS 2018 • Cong Fang, Chris Junchi Li, Zhouchen Lin, Tong Zhang

Specially, we prove that the SPIDER-SFO algorithm achieves a gradient computation cost of $\mathcal{O}\left( \min( n^{1/2} \epsilon^{-2}, \epsilon^{-3} ) \right)$ to find an $\epsilon$-approximate first-order stationary point.

Stochastic Optimization

Paper
Add Code

Stochastic Primal-Dual Method for Empirical Risk Minimization with O(1) Per-Iteration Complexity

no code implementations • NeurIPS 2018 • Conghui Tan, Tong Zhang, Shiqian Ma, Ji Liu

Regularized empirical risk minimization problem with linear predictor appears frequently in machine learning.

BIG-bench Machine Learning

Paper
Add Code

On Quadratic Convergence of DC Proximal Newton Algorithm in Nonconvex Sparse Learning

no code implementations • NeurIPS 2017 • Xingguo Li, Lin Yang, Jason Ge, Jarvis Haupt, Tong Zhang, Tuo Zhao

We propose a DC proximal Newton algorithm for solving nonconvex regularized sparse learning problems in high dimensions.

Sparse Learning

Paper
Add Code

Efficient Optimization for Linear Dynamical Systems with Applications to Clustering and Sparse Coding

no code implementations • NeurIPS 2017 • Wenbing Huang, Mehrtash Harandi, Tong Zhang, Lijie Fan, Fuchun Sun, Junzhou Huang

Linear Dynamical Systems (LDSs) are fundamental tools for modeling spatio-temporal data in various disciplines.

Clustering

Paper
Add Code

Exact Recovery of Hard Thresholding Pursuit

no code implementations • NeurIPS 2016 • Xiaotong Yuan, Ping Li, Tong Zhang

In this paper, we bridge this gap by showing, for the first time, that exact recovery of the global sparse minimizer is possible for HTP-style methods under restricted strong condition number bounding conditions.

Paper
Add Code

Learning Additive Exponential Family Graphical Models via \ell_{2,1}-norm Regularized M-Estimation

no code implementations • NeurIPS 2016 • Xiaotong Yuan, Ping Li, Tong Zhang, Qingshan Liu, Guangcan Liu

We investigate a subclass of exponential family graphical models of which the sufficient statistics are defined by arbitrary additive forms.

Paper
Add Code

Quartz: Randomized Dual Coordinate Ascent with Arbitrary Sampling

no code implementations • NeurIPS 2015 • Zheng Qu, Peter Richtarik, Tong Zhang

We study the problem of minimizing the average of a large number of smooth convex functions penalized with a strongly convex regularizer.

Paper
Add Code

Local Smoothness in Variance Reduced Optimization

no code implementations • NeurIPS 2015 • Daniel Vainsencher, Han Liu, Tong Zhang

Abstract We propose a family of non-uniform sampling strategies to provably speed up a class of stochastic optimization algorithms with linear convergence including Stochastic Variance Reduced Gradient (SVRG) and Stochastic Dual Coordinate Ascent (SDCA).

Stochastic Optimization

Paper
Add Code

Accelerating Stochastic Gradient Descent using Predictive Variance Reduction

no code implementations • NeurIPS 2013 • Rie Johnson, Tong Zhang

Stochastic gradient descent is popular for large scale optimization but has slow convergence asymptotically due to the inherent variance.

Structured Prediction

Paper
Add Code

Selective Labeling via Error Bound Minimization

no code implementations • NeurIPS 2012 • Quanquan Gu, Tong Zhang, Jiawei Han, Chris H. Ding

In particular, we derive a deterministic generalization error bound for LapRLS trained on subsampled data, and propose to select a subset of data points to label by minimizing this upper bound.

Paper
Add Code

Greedy Model Averaging

no code implementations • NeurIPS 2011 • Dong Dai, Tong Zhang

The purpose of this paper is to present a new greedy model averaging procedure that improves EWMA.

Model Selection

Paper
Add Code

Learning to Search Efficiently in High Dimensions

no code implementations • NeurIPS 2011 • Zhen Li, Huazhong Ning, Liangliang Cao, Tong Zhang, Yihong Gong, Thomas S. Huang

Traditional approaches relied on algorithmic constructions that are often data independent (such as Locality Sensitive Hashing) or weakly dependent (such as kd-trees, k-means trees).

Computational Efficiency Vocal Bursts Intensity Prediction

Paper
Add Code

Spectral Methods for Learning Multivariate Latent Tree Structure

no code implementations • NeurIPS 2011 • Animashree Anandkumar, Kamalika Chaudhuri, Daniel J. Hsu, Sham M. Kakade, Le Song, Tong Zhang

The setting is one where we only have samples from certain observed variables in the tree, and our goal is to estimate the tree structure (i. e., the graph of how the underlying hidden variables are connected to each other and to the observed variables).

Paper
Add Code

Agnostic Active Learning Without Constraints

no code implementations • NeurIPS 2010 • Alina Beygelzimer, Daniel J. Hsu, John Langford, Tong Zhang

We present and analyze an agnostic active learning algorithm that works without keeping a version space.

Active Learning General Classification

Paper
Add Code

Deep Coding Network

no code implementations • NeurIPS 2010 • Yuanqing Lin, Tong Zhang, Shenghuo Zhu, Kai Yu

This paper proposes a principled extension of the traditional single-layer flat sparse coding scheme, where a two-layer coding scheme is derived based on theoretical analysis of nonlinear functional approximation that extends recent results for local coordinate coding.

Paper
Add Code

Nonlinear Learning using Local Coordinate Coding

no code implementations • NeurIPS 2009 • Kai Yu, Tong Zhang, Yihong Gong

This paper introduces a new method for semi-supervised learning on high dimensional nonlinear manifolds, which includes a phase of unsupervised basis learning and a phase of supervised function learning.

Paper
Add Code

Sparse Online Learning via Truncated Gradient

no code implementations • NeurIPS 2008 • John Langford, Lihong Li, Tong Zhang

We propose a general method called truncated gradient to induce sparsity in the weights of online-learning algorithms with convex loss.

Paper
Add Code

Adaptive Forward-Backward Greedy Algorithm for Sparse Learning with Linear Models

no code implementations • NeurIPS 2008 • Tong Zhang

Consider linear prediction models where the target function is a sparse linear combination of a set of basis functions.

Sparse Learning

Paper
Add Code

Multi-stage Convex Relaxation for Learning with Sparse Regularization

no code implementations • NeurIPS 2008 • Tong Zhang

We study learning formulations with non-convex regularizaton that are natural for sparse linear models.

Paper
Add Code

The Epoch-Greedy Algorithm for Multi-armed Bandits with Side Information

no code implementations • NeurIPS 2007 • John Langford, Tong Zhang

We present Epoch-Greedy, an algorithm for multi-armed bandits with observable side information.

Multi-Armed Bandits

Paper
Add Code

A General Boosting Method and its Application to Learning Ranking Functions for Web Search

no code implementations • NeurIPS 2007 • Zhaohui Zheng, Hongyuan Zha, Tong Zhang, Olivier Chapelle, Keke Chen, Gordon Sun

We present a general boosting method extending functional gradient boosting to optimize complex loss functions that are encountered in many machine learning problems.

Paper
Add Code

Projection-free Distributed Online Learning in Networks

no code implementations • ICML 2017 • Wenpeng Zhang, Peilin Zhao, Wenwu Zhu, Steven C. H. Hoi, Tong Zhang

The conditional gradient algorithm has regained a surge of research interest in recent years due to its high efficiency in handling large-scale machine learning problems.