Search Results for author: Hao Zhang

Found 405 papers, 148 papers with code

Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection

7 code implementations • 9 Mar 2023 • Shilong Liu, Zhaoyang Zeng, Tianhe Ren, Feng Li, Hao Zhang, Jie Yang, Chunyuan Li, Jianwei Yang, Hang Su, Jun Zhu, Lei Zhang

To effectively fuse language and vision modalities, we conceptually divide a closed-set detector into three phases and propose a tight fusion solution, which includes a feature enhancer, a language-guided query selection, and a cross-modality decoder for cross-modality fusion.

Ranked #1 on Zero-Shot Object Detection on MSCOCO

Referring Expression Referring Expression Comprehension +2

124,353

Paper
Code

LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset

1 code implementation • 21 Sep 2023 • Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Tianle Li, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zhuohan Li, Zi Lin, Eric P. Xing, Joseph E. Gonzalez, Ion Stoica, Hao Zhang

Studying how people interact with large language models (LLMs) in real-world scenarios is increasingly important due to their widespread use in various applications.

Chatbot Instruction Following

33,531

Paper
Code

Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena

5 code implementations • NeurIPS 2023 • Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zi Lin, Zhuohan Li, Dacheng Li, Eric P. Xing, Hao Zhang, Joseph E. Gonzalez, Ion Stoica

Evaluating large language model (LLM) based chat assistants is challenging due to their broad capabilities and the inadequacy of existing benchmarks in measuring human preferences.

Ranked #3 on Long-Context Understanding on Ada-LEval (TSort)

Chatbot Language Modelling +2

33,531

Paper
Code

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

1 code implementation • 7 Mar 2024 • Wei-Lin Chiang, Lianmin Zheng, Ying Sheng, Anastasios Nikolas Angelopoulos, Tianle Li, Dacheng Li, Hao Zhang, Banghua Zhu, Michael Jordan, Joseph E. Gonzalez, Ion Stoica

To address this issue, we introduce Chatbot Arena, an open platform for evaluating LLMs based on human preferences.

Chatbot

33,531

Paper
Code

Efficient Memory Management for Large Language Model Serving with PagedAttention

4 code implementations • 12 Sep 2023 • Woosuk Kwon, Zhuohan Li, Siyuan Zhuang, Ying Sheng, Lianmin Zheng, Cody Hao Yu, Joseph E. Gonzalez, Hao Zhang, Ion Stoica

On top of it, we build vLLM, an LLM serving system that achieves (1) near-zero waste in KV cache memory and (2) flexible sharing of KV cache within and across requests to further reduce memory usage.

Language Modelling Large Language Model +1

17,720

Paper
Code

DualGAN: Unsupervised Dual Learning for Image-to-Image Translation

6 code implementations • ICCV 2017 • Zili Yi, Hao Zhang, Ping Tan, Minglun Gong

Depending on the task complexity, thousands to millions of labeled image pairs are needed to train a conditional GAN.

Ranked #2 on Image-to-Image Translation on Aerial-to-Map

Image-to-Image Translation Translation

15,656

Paper
Code

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

14 code implementations • 7 Mar 2022 • Hao Zhang, Feng Li, Shilong Liu, Lei Zhang, Hang Su, Jun Zhu, Lionel M. Ni, Heung-Yeung Shum

Compared to other models on the leaderboard, DINO significantly reduces its model size and pre-training data size while achieving better results.

Ranked #1 on Real-Time Object Detection on COCO 2017 val

Real-Time Object Detection

13,342

Paper
Code

Segment Everything Everywhere All at Once

2 code implementations • NeurIPS 2023 • Xueyan Zou, Jianwei Yang, Hao Zhang, Feng Li, Linjie Li, JianFeng Wang, Lijuan Wang, Jianfeng Gao, Yong Jae Lee

In SEEM, we propose a novel decoding mechanism that enables diverse prompting for all types of segmentation tasks, aiming at a universal segmentation interface that behaves like large language models (LLMs).

Image Segmentation Interactive Segmentation +4

13,342

Paper
Code

Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks

1 code implementation • 25 Jan 2024 • Tianhe Ren, Shilong Liu, Ailing Zeng, Jing Lin, Kunchang Li, He Cao, Jiayu Chen, Xinyu Huang, Yukang Chen, Feng Yan, Zhaoyang Zeng, Hao Zhang, Feng Li, Jie Yang, Hongyang Li, Qing Jiang, Lei Zhang

We introduce Grounded SAM, which uses Grounding DINO as an open-set object detector to combine with the segment anything model (SAM).

Segmentation

13,342

Paper
Code

Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation

9 code implementations • CVPR 2023 • Feng Li, Hao Zhang, Huaizhe xu, Shilong Liu, Lei Zhang, Lionel M. Ni, Heung-Yeung Shum

In this paper we present Mask DINO, a unified object detection and segmentation framework.

Ranked #1 on Panoptic Segmentation on COCO test-dev

Image Segmentation Instance Segmentation +3

12,012

Paper
Code

Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

3 code implementations • 17 Oct 2023 • Jianwei Yang, Hao Zhang, Feng Li, Xueyan Zou, Chunyuan Li, Jianfeng Gao

We present Set-of-Mark (SoM), a new visual prompting method, to unleash the visual grounding abilities of large multimodal models (LMMs), such as GPT-4V.

Interactive Segmentation Referring Expression +4

4,002

Paper
Code

Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning

1 code implementation • 28 Jan 2022 • Lianmin Zheng, Zhuohan Li, Hao Zhang, Yonghao Zhuang, Zhifeng Chen, Yanping Huang, Yida Wang, Yuanzhong Xu, Danyang Zhuo, Eric P. Xing, Joseph E. Gonzalez, Ion Stoica

Existing model-parallel training systems either require users to manually create a parallelization plan or automatically generate one from a limited space of model parallelism configurations.

2,976

Paper
Code

AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving

2 code implementations • 22 Feb 2023 • Zhuohan Li, Lianmin Zheng, Yinmin Zhong, Vincent Liu, Ying Sheng, Xin Jin, Yanping Huang, Zhifeng Chen, Hao Zhang, Joseph E. Gonzalez, Ion Stoica

Model parallelism is conventionally viewed as a method to scale a single large deep learning model beyond the memory limits of a single device.

2,976

Paper
Code

DN-DETR: Accelerate DETR Training by Introducing Query DeNoising

16 code implementations • CVPR 2022 • Feng Li, Hao Zhang, Shilong Liu, Jian Guo, Lionel M. Ni, Lei Zhang

Our method is universal and can be easily plugged into any DETR-like methods by adding dozens of lines of code to achieve a remarkable improvement.

Object Detection

1,964

Paper
Code

Semantic-SAM: Segment and Recognize Anything at Any Granularity

1 code implementation • 10 Jul 2023 • Feng Li, Hao Zhang, Peize Sun, Xueyan Zou, Shilong Liu, Jianwei Yang, Chunyuan Li, Lei Zhang, Jianfeng Gao

In this paper, we introduce Semantic-SAM, a universal image segmentation model to enable segment and recognize anything at any desired granularity.

Image Segmentation Segmentation +1

1,892

Paper
Code

Visual In-Context Prompting

3 code implementations • 22 Nov 2023 • Feng Li, Qing Jiang, Hao Zhang, Tianhe Ren, Shilong Liu, Xueyan Zou, Huaizhe xu, Hongyang Li, Chunyuan Li, Jianwei Yang, Lei Zhang, Jianfeng Gao

In-context prompting in large language models (LLMs) has become a prevalent approach to improve zero-shot capabilities, but this idea is less explored in the vision domain.

Segmentation Visual Prompting

1,892

Paper
Code

DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR

7 code implementations • ICLR 2022 • Shilong Liu, Feng Li, Hao Zhang, Xiao Yang, Xianbiao Qi, Hang Su, Jun Zhu, Lei Zhang

We present in this paper a novel query formulation using dynamic anchor boxes for DETR (DEtection TRansformer) and offer a deeper understanding of the role of queries in DETR.

Ranked #11 on 2D Object Detection on SARDet-100K

Object Detection

1,808

Paper
Code

detrex: Benchmarking Detection Transformers

1 code implementation • 12 Jun 2023 • Tianhe Ren, Shilong Liu, Feng Li, Hao Zhang, Ailing Zeng, Jie Yang, Xingyu Liao, Ding Jia, Hongyang Li, He Cao, Jianan Wang, Zhaoyang Zeng, Xianbiao Qi, Yuhui Yuan, Jianwei Yang, Lei Zhang

To address this issue, we develop a unified, highly modular, and lightweight codebase called detrex, which supports a majority of the mainstream DETR-based instance recognition algorithms, covering various fundamental tasks, including object detection, segmentation, and pose estimation.

Benchmarking object-detection +2

1,808

Paper
Code

A Simple Framework for Open-Vocabulary Segmentation and Detection

2 code implementations • ICCV 2023 • Hao Zhang, Feng Li, Xueyan Zou, Shilong Liu, Chunyuan Li, Jianfeng Gao, Jianwei Yang, Lei Zhang

We present OpenSeeD, a simple Open-vocabulary Segmentation and Detection framework that jointly learns from different segmentation and detection datasets.

Ranked #2 on Instance Segmentation on ADE20K val (using extra training data)

Instance Segmentation Panoptic Segmentation +2

1,242

Paper
Code

Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

1 code implementation • 3 Feb 2024 • Yichao Fu, Peter Bailis, Ion Stoica, Hao Zhang

Autoregressive decoding of large language models (LLMs) is memory bandwidth bounded, resulting in high latency and significant wastes of the parallel processing power of modern accelerators.

Code Completion

962

Paper
Code

How Can Recommender Systems Benefit from Large Language Models: A Survey

1 code implementation • 9 Jun 2023 • Jianghao Lin, Xinyi Dai, Yunjia Xi, Weiwen Liu, Bo Chen, Hao Zhang, Yong liu, Chuhan Wu, Xiangyang Li, Chenxu Zhu, Huifeng Guo, Yong Yu, Ruiming Tang, Weinan Zhang

In this paper, we conduct a comprehensive survey on this research direction from the perspective of the whole pipeline in real-world recommender systems.

Ethics Feature Engineering +5

729

Paper
Code

A Strong and Reproducible Object Detector with Only Public Datasets

2 code implementations • 25 Apr 2023 • Tianhe Ren, Jianwei Yang, Shilong Liu, Ailing Zeng, Feng Li, Hao Zhang, Hongyang Li, Zhaoyang Zeng, Lei Zhang

This work presents Focal-Stable-DINO, a strong and reproducible object detection model which achieves 64. 6 AP on COCO val2017 and 64. 8 AP on COCO test-dev using only 700M parameters without any test time augmentation.

Ranked #5 on Object Detection on COCO minival (using extra training data)

object-detection Object Detection

646

Paper
Code

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

1 code implementation • 9 Nov 2023 • Shilong Liu, Hao Cheng, Haotian Liu, Hao Zhang, Feng Li, Tianhe Ren, Xueyan Zou, Jianwei Yang, Hang Su, Jun Zhu, Lei Zhang, Jianfeng Gao, Chunyuan Li

LLaVA-Plus is a general-purpose multimodal assistant that expands the capabilities of large multimodal models.

Ranked #1 on LMM real-life tasks on Leaderboard

Instruction Following LLM real-life tasks +3

612

Paper
Code

Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning

2 code implementations • 27 Aug 2020 • Aurick Qiao, Sang Keun Choe, Suhas Jayaram Subramanya, Willie Neiswanger, Qirong Ho, Hao Zhang, Gregory R. Ganger, Eric P. Xing

Some recent schedulers choose job resources for users, but do so without awareness of how DL training can be re-optimized to better utilize the provided resources.

Fairness Scheduling

400

Paper
Code

Learning Implicit Fields for Generative Shape Modeling

4 code implementations • CVPR 2019 • Zhiqin Chen, Hao Zhang

We advocate the use of implicit fields for learning generative models of shapes and introduce an implicit field decoder, called IM-NET, for shape generation, aimed at improving the visual quality of the generated shapes.

3D Reconstruction 3D Shape Representation +2

385

Paper
Code

Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning

2 code implementations • 28 Sep 2017 • Pinxin Long, Tingxiang Fan, Xinyi Liao, Wenxi Liu, Hao Zhang, Jia Pan

We validate the learned sensor-level collision avoidance policy in a variety of simulated scenarios with thorough performance evaluations and show that the final learned policy is able to find time efficient, collision-free paths for a large-scale robot system.

Collision Avoidance reinforcement-learning +1

299

Paper
Code

LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models

1 code implementation • 5 Dec 2023 • Hao Zhang, Hongyang Li, Feng Li, Tianhe Ren, Xueyan Zou, Shilong Liu, Shijia Huang, Jianfeng Gao, Lei Zhang, Chunyuan Li, Jianwei Yang

To address this issue, we have created GVC data that allows for the combination of grounding and chat capabilities.

229

Paper
Code

Neural Dual Contouring

2 code implementations • 4 Feb 2022 • Zhiqin Chen, Andrea Tagliasacchi, Thomas Funkhouser, Hao Zhang

We introduce neural dual contouring (NDC), a new data-driven approach to mesh reconstruction based on dual contouring (DC).

Surface Reconstruction

212

Paper
Code

BSP-Net: Generating Compact Meshes via Binary Space Partitioning

3 code implementations • CVPR 2020 • Zhiqin Chen, Andrea Tagliasacchi, Hao Zhang

The network is trained to reconstruct a shape using a set of convexes obtained from a BSP-tree built on a set of planes.

3D Reconstruction 3D Shape Representation

190

Paper
Code

Learning Mesh Representations via Binary Space Partitioning Tree Networks

1 code implementation • 27 Jun 2021 • Zhiqin Chen, Andrea Tagliasacchi, Hao Zhang

The network is trained to reconstruct a shape using a set of convexes obtained from a BSP-tree built over a set of planes, where the planes and convexes are both defined by learned network weights.

190

Paper
Code

Detection Transformer with Stable Matching

1 code implementation • ICCV 2023 • Shilong Liu, Tianhe Ren, Jiayu Chen, Zhaoyang Zeng, Hao Zhang, Feng Li, Hongyang Li, Jun Huang, Hang Su, Jun Zhu, Lei Zhang

We point out that the unstable matching in DETR is caused by a multi-optimization path problem, which is highlighted by the one-to-one matching design in DETR.

Position

176

Paper
Code

Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR

1 code implementation • 13 Mar 2023 • Feng Li, Ailing Zeng, Shilong Liu, Hao Zhang, Hongyang Li, Lei Zhang, Lionel M. Ni

Recent DEtection TRansformer-based (DETR) models have obtained remarkable performance.

object-detection Object Detection

174

Paper
Code

From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models

1 code implementation • 13 Oct 2023 • Dongsheng Jiang, Yuchen Liu, Songlin Liu, Jin'e Zhao, Hao Zhang, Zhen Gao, Xiaopeng Zhang, Jin Li, Hongkai Xiong

By simply equipping it with an MLP layer for alignment, DINO surpasses CLIP in fine-grained related perception tasks.

Hallucination Image Captioning +3

173

Paper
Code

Neural Marching Cubes

1 code implementation • 21 Jun 2021 • Zhiqin Chen, Hao Zhang

To tackle these challenges, we re-cast MC from a deep learning perspective, by designing tessellation templates more apt at preserving geometric features, and learning the vertex positions and mesh topologies from training meshes, to account for contextual information from nearby cubes.

153

Paper
Code

DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training

1 code implementation • 5 Oct 2023 • Dacheng Li, Rulin Shao, Anze Xie, Eric P. Xing, Xuezhe Ma, Ion Stoica, Joseph E. Gonzalez, Hao Zhang

FlashAttention (Dao, 2023) effectively reduces the quadratic peak memory usage to linear in training transformer-based large language models (LLMs) on a single GPU.

145

Paper
Code

NiuTrans: An Open Source Toolkit for Phrase-based and Syntax-based Machine Translation

1 code implementation • ACL 2012 • Tong Xiao, Jingbo Zhu, Hao Zhang, Qiang Li

Language Modelling Machine Translation +1

139

Paper
Code

DS-Fusion: Artistic Typography via Discriminated and Stylized Diffusion

1 code implementation • ICCV 2023 • Maham Tanveer, Yizhi Wang, Ali Mahdavi-Amiri, Hao Zhang

We introduce a novel method to automatically generate an artistic typography by stylizing one or more letter fonts to visually convey the semantics of an input word, while ensuring that the output remains readable.

Denoising

136

Paper
Code

PQ-NET: A Generative Part Seq2Seq Network for 3D Shapes

3 code implementations • CVPR 2020 • Rundi Wu, Yixin Zhuang, Kai Xu, Hao Zhang, Baoquan Chen

We introduce PQ-NET, a deep neural network which represents and generates 3D shapes via sequential part assembly.

3D Reconstruction Single-View 3D Reconstruction

111

Paper
Code

MP-Former: Mask-Piloted Transformer for Image Segmentation

1 code implementation • CVPR 2023 • Hao Zhang, Feng Li, Huaizhe xu, Shijia Huang, Shilong Liu, Lionel M. Ni, Lei Zhang

We present a mask-piloted Transformer which improves masked-attention in Mask2Former for image segmentation.

Image Segmentation Segmentation +1

105

Paper
Code

SketchyScene: Richly-Annotated Scene Sketches

2 code implementations • ECCV 2018 • Changqing Zou, Qian Yu, Ruofei Du, Haoran Mo, Yi-Zhe Song, Tao Xiang, Chengying Gao, Baoquan Chen, Hao Zhang

We contribute the first large-scale dataset of scene sketches, SketchyScene, with the goal of advancing research on sketch understanding at both the object and scene level.

Colorization Image Retrieval +2

101

Paper
Code

Span-based Localizing Network for Natural Language Video Localization

1 code implementation • ACL 2020 • Hao Zhang, Aixin Sun, Wei Jing, Joey Tianyi Zhou

Given an untrimmed video and a text query, natural language video localization (NLVL) is to locate a matching span from the video that semantically corresponds to the query.

Paper
Code

Automatic Photo Adjustment Using Deep Neural Networks

1 code implementation • 24 Dec 2014 • Zhicheng Yan, Hao Zhang, Baoyuan Wang, Sylvain Paris, Yizhou Yu

Many photographic styles rely on subtle adjustments that depend on the image content and even its semantics.

Photo Retouching

Paper
Code

Interfacing Foundation Models' Embeddings

1 code implementation • 12 Dec 2023 • Xueyan Zou, Linjie Li, JianFeng Wang, Jianwei Yang, Mingyu Ding, Zhengyuan Yang, Feng Li, Hao Zhang, Shilong Liu, Arul Aravinthan, Yong Jae Lee, Lijuan Wang

The proposed interface is adaptive to new tasks, and new models.

Image Segmentation Retrieval +2

Paper
Code

DECOR-GAN: 3D Shape Detailization by Conditional Refinement

1 code implementation • CVPR 2021 • Zhiqin Chen, Vladimir G. Kim, Matthew Fisher, Noam Aigerman, Hao Zhang, Siddhartha Chaudhuri

During testing, a style code is fed into the generator to condition the refinement.

Generative Adversarial Network

Paper
Code

MPCFormer: fast, performant and private Transformer inference with MPC

1 code implementation • 2 Nov 2022 • Dacheng Li, Rulin Shao, Hongyi Wang, Han Guo, Eric P. Xing, Hao Zhang

Through extensive evaluations, we show that MPCFORMER significantly speeds up Transformer inference in MPC settings while achieving similar ML performance to the input model.

Knowledge Distillation

Paper
Code

UKP-SQuARE v2: Explainability and Adversarial Attacks for Trustworthy QA

1 code implementation • 19 Aug 2022 • Rachneet Sachdeva, Haritz Puerto, Tim Baumgärtner, Sewin Tariverdian, Hao Zhang, Kexin Wang, Hossain Shaikh Saadi, Leonardo F. R. Ribeiro, Iryna Gurevych

In this paper, we introduce SQuARE v2, the new version of SQuARE, to provide an explainability infrastructure for comparing models based on methods such as saliency maps and graph-based explanations.

Adversarial Attack Explainable Models +2

Paper
Code

UKP-SQuARE v3: A Platform for Multi-Agent QA Research

1 code implementation • 31 Mar 2023 • Haritz Puerto, Tim Baumgärtner, Rachneet Sachdeva, Haishuo Fang, Hao Zhang, Sewin Tariverdian, Kexin Wang, Iryna Gurevych

To ease research in multi-agent models, we extend UKP-SQuARE, an online platform for QA research, to support three families of multi-agent systems: i) agent selection, ii) early-fusion of agents, and iii) late-fusion of agents.

Question Answering

Paper
Code

Token Shift Transformer for Video Classification

3 code implementations • 5 Aug 2021 • Hao Zhang, Yanbin Hao, Chong-Wah Ngo

It is worth noticing that our TokShift transformer is a pure convolutional-free video transformer pilot with computational efficiency for video understanding.

Classification Computational Efficiency +2

Paper
Code

ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing

1 code implementation • CVPR 2023 • Zequn Zeng, Hao Zhang, Zhengjue Wang, Ruiying Lu, Dongsheng Wang, Bo Chen

Zero-shot capability has been considered as a new revolution of deep learning, letting machines work on tasks without curated training data.

Image Captioning Language Modelling

Paper
Code

BAE-NET: Branched Autoencoder for Shape Co-Segmentation

1 code implementation • ICCV 2019 • Zhiqin Chen, Kangxue Yin, Matthew Fisher, Siddhartha Chaudhuri, Hao Zhang

The unsupervised BAE-NET is trained with a collection of un-segmented shapes, using a shape reconstruction loss, without any ground-truth labels.

One-Shot Learning Representation Learning

Paper
Code

TilinGNN: Learning to Tile with Self-Supervised Graph Neural Network

1 code implementation • 5 Jul 2020 • Hao Xu, Ka Hei Hui, Chi-Wing Fu, Hao Zhang

To start, we reformulate tiling as a graph problem by modeling candidate tile locations in the target shape as graph nodes and connectivity between tile locations as edges.

Paper
Code

TimeMAE: Self-Supervised Representations of Time Series with Decoupled Masked Autoencoders

1 code implementation • 1 Mar 2023 • Mingyue Cheng, Qi Liu, Zhiding Liu, Hao Zhang, Rujiao Zhang, Enhong Chen

In this work, we propose TimeMAE, a novel self-supervised paradigm for learning transferrable time series representations based on transformer networks.

Time Series Time Series Analysis +1

Paper
Code

TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models

1 code implementation • 16 Feb 2021 • Zhuohan Li, Siyuan Zhuang, Shiyuan Guo, Danyang Zhuo, Hao Zhang, Dawn Song, Ion Stoica

With this key idea, we design TeraPipe, a high-performance token-level pipeline parallel algorithm for synchronous model-parallel training of Transformer-based language models.

Paper
Code

FaceDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models

2 code implementations • NeurIPS 2023 • Hao Zhang, Yanbo Xu, Tianyuan Dai, Yu-Wing Tai, Chi-Keung Tang

The ability to create high-quality 3D faces from a single image has become increasingly important with wide applications in video conferencing, AR/VR, and advanced video editing in movie industries.

3D Face Reconstruction Video Editing +1

Paper
Code

DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding

1 code implementation • 28 Nov 2022 • Shilong Liu, Yaoyuan Liang, Feng Li, Shijia Huang, Hao Zhang, Hang Su, Jun Zhu, Lei Zhang

As phrase extraction can be regarded as a $1$D text segmentation problem, we formulate PEG as a dual detection problem and propose a novel DQ-DETR model, which introduces dual queries to probe different features from image and text for object prediction and phrase mask prediction.

Ranked #7 on Referring Expression Comprehension on RefCOCO

object-detection Object Detection +4

Paper
Code

BalaGAN: Image Translation Between Imbalanced Domains via Cross-Modal Transfer

1 code implementation • 5 Oct 2020 • Or Patashnik, Dov Danon, Hao Zhang, Daniel Cohen-Or

State-of-the-art image-to-image translation methods tend to struggle in an imbalanced domain setting, where one image domain lacks richness and diversity.

Image-to-Image Translation Style Transfer +1

Paper
Code

Video Corpus Moment Retrieval with Contrastive Learning

1 code implementation • 13 May 2021 • Hao Zhang, Aixin Sun, Wei Jing, Guoshun Nan, Liangli Zhen, Joey Tianyi Zhou, Rick Siow Mong Goh

We adopt the first approach and introduce two contrastive learning objectives to refine video encoder and text encoder to learn video and text representations separately but with better alignment for VCMR.

Contrastive Learning Moment Retrieval +2

Paper
Code

WHAI: Weibull Hybrid Autoencoding Inference for Deep Topic Modeling

1 code implementation • ICLR 2018 • Hao Zhang, Bo Chen, Dandan Guo, Mingyuan Zhou

To train an inference network jointly with a deep generative topic model, making it both scalable to big corpora and fast in out-of-sample prediction, we develop Weibull hybrid autoencoding inference (WHAI) for deep latent Dirichlet allocation, which infers posterior samples via a hybrid of stochastic-gradient MCMC and autoencoding variational Bayes.

Paper
Code

UKP-Athene: Multi-Sentence Textual Entailment for Claim Verification

1 code implementation • WS 2018 • Andreas Hanselowski, Hao Zhang, Zile Li, Daniil Sorokin, Benjamin Schiller, Claudia Schulz, Iryna Gurevych

The Fact Extraction and VERification (FEVER) shared task was launched to support the development of systems able to verify claims by extracting supporting or refuting facts from raw text.

Claim Verification Entity Linking +4

Paper
Code

CLLMs: Consistency Large Language Models

1 code implementation • 28 Feb 2024 • Siqi Kou, Lanxiang Hu, Zhezhi He, Zhijie Deng, Hao Zhang

Parallel decoding methods such as Jacobi decoding show promise for more efficient LLM inference as it breaks the sequential nature of the LLM decoding process and transforms it into parallelizable computation.

Paper
Code

LayoutGMN: Neural Graph Matching for Structural Layout Similarity

1 code implementation • CVPR 2021 • Akshay Gadi Patil, Manyi Li, Matthew Fisher, Manolis Savva, Hao Zhang

In particular, retrieval results by our network better match human judgement of structural layout similarity compared to both IoUs and other baselines including a state-of-the-art method based on graph neural networks and image convolution.

Graph Matching Metric Learning +1

Paper
Code

A Prototype-Oriented Framework for Unsupervised Domain Adaptation

1 code implementation • NeurIPS 2021 • Korawat Tanwisuth, Xinjie Fan, Huangjie Zheng, Shujian Zhang, Hao Zhang, Bo Chen, Mingyuan Zhou

Existing methods for unsupervised domain adaptation often rely on minimizing some statistical distance between the source and target samples in the latent space.

Unsupervised Domain Adaptation

Paper
Code

Roof-GAN: Learning to Generate Roof Geometry and Relations for Residential Houses

1 code implementation • CVPR 2021 • Yiming Qian, Hao Zhang, Yasutaka Furukawa

This paper presents Roof-GAN, a novel generative adversarial network that generates structured geometry of residential roof structures as a set of roof primitives and their relationships.

Generative Adversarial Network

Paper
Code

Estimation of genomic characteristics by analyzing k-mer frequency in de novo genome projects

2 code implementations • 9 Aug 2013 • Binghang Liu, Yujian Shi, Jianying Yuan, Xuesong Hu, Hao Zhang, Nan Li, Zhenyu Li, Yanxiang Chen, Desheng Mu, Wei Fan

Therefore, it is necessary to develop efficient assembly-independent methods for accurate estimation of these genomic characteristics.

Paper
Code

GDPNet: Refining Latent Multi-View Graph for Relation Extraction

1 code implementation • 12 Dec 2020 • Fuzhao Xue, Aixin Sun, Hao Zhang, Eng Siong Chng

Recent advances on RE task are from BERT-based sequence modeling and graph-based modeling of relationships among the tokens in the sequence.

Ranked #4 on Dialog Relation Extraction on DialogRE (F1c (v1) metric)

Dialog Relation Extraction Dynamic Time Warping +2

Paper
Code

Discovering and Explaining the Representation Bottleneck of DNNs

1 code implementation • ICLR 2022 • Huiqi Deng, Qihan Ren, Hao Zhang, Quanshi Zhang

This paper explores the bottleneck of feature representations of deep neural networks (DNNs), from the perspective of the complexity of interactions between input variables encoded in DNNs.

Paper
Code

BIRNAT: Bidirectional Recurrent Neural Networks with Adversarial Training for Video Snapshot Compressive Imaging

1 code implementation • ECCV 2020 • Ziheng Cheng, Ruiying Lu, Zhengjue Wang, Hao Zhang, Bo Chen, Ziyi Meng, Xin Yuan

This measurement and the modulation masks are fed into our Recurrent Neural Network (RNN) to reconstruct the desired high-speed frames.

Paper
Code

Shape-IoU: More Accurate Metric considering Bounding Box Shape and Scale

1 code implementation • 29 Dec 2023 • Hao Zhang, Shuaijie Zhang

As an important component of the detector localization branch, bounding box regression loss plays a significant role in object detection tasks.

object-detection Object Detection +1

Paper
Code

Penalizing Gradient Norm for Efficiently Improving Generalization in Deep Learning

1 code implementation • 8 Feb 2022 • Yang Zhao, Hao Zhang, Xiuyuan Hu

In this paper, we propose an effective method to improve the model generalization by additionally penalizing the gradient norm of loss function during optimization.

Paper
Code

Inner-IoU: More Effective Intersection over Union Loss with Auxiliary Bounding Box

1 code implementation • 6 Nov 2023 • Hao Zhang, Cong Xu, Shuaijie Zhang

Based on the above, we first analyzed the BBR model and concluded that distinguishing different regression samples and using different scales of auxiliary bounding boxes to calculate losses can effectively accelerate the bounding box regression process.

Ranked #1 on Object Detection on AI-TOD (mAP50 metric)

Object Detection regression

Paper
Code

HD-CNN: Hierarchical Deep Convolutional Neural Network for Large Scale Visual Recognition

4 code implementations • 3 Oct 2014 • Zhicheng Yan, Hao Zhang, Robinson Piramuthu, Vignesh Jagadeesh, Dennis Decoste, Wei Di, Yizhou Yu

In this paper, we introduce hierarchical deep CNNs (HD-CNNs) by embedding deep CNNs into a category hierarchy.

Ranked #174 on Image Classification on CIFAR-100

Image Classification Object Recognition

Paper
Code

An End-to-End Neural Network for Image Cropping by Learning Composition from Aesthetic Photos

2 code implementations • 2 Jul 2019 • Peng Lu, Hao Zhang, Xujun Peng, Xiaofu Jin

In this paper, we primarily focus on improving the accuracy of automatic image cropping, and on further exploring its potential in public datasets with high efficiency.

Image Cropping

Paper
Code

AMP: Automatically Finding Model Parallel Strategies with Heterogeneity Awareness

1 code implementation • 13 Oct 2022 • Dacheng Li, Hongyi Wang, Eric Xing, Hao Zhang

Scaling up model sizes can lead to fundamentally new capabilities in many machine learning (ML) tasks.

valid

Paper
Code

CompoNet: Learning to Generate the Unseen by Part Synthesis and Composition

1 code implementation • ICCV 2019 • Nadav Schor, Oren Katzir, Hao Zhang, Daniel Cohen-Or

Data-driven generative modeling has made remarkable progress by leveraging the power of deep neural networks.

Paper
Code

RPM-Net: Recurrent Prediction of Motion and Parts from Point Cloud

1 code implementation • 26 Jun 2020 • Zihao Yan, Ruizhen Hu, Xingguang Yan, Luanmin Chen, Oliver van Kaick, Hao Zhang, Hui Huang

We show results of simultaneous motion and part predictions from synthetic and real scans of 3D objects exhibiting a variety of part mobilities, possibly involving multiple movable parts.

Semantic Segmentation

Paper
Code

Structured Generative Adversarial Networks

1 code implementation • NeurIPS 2017 • Zhijie Deng, Hao Zhang, Xiaodan Liang, Luona Yang, Shizhen Xu, Jun Zhu, Eric P. Xing

We study the problem of conditional generative modeling based on designated semantics or structures.

Semi-Supervised Image Classification Style Transfer

Paper
Code

Predictive and Generative Neural Networks for Object Functionality

1 code implementation • 28 Jun 2020 • Ruizhen Hu, Zihao Yan, Jingwen Zhang, Oliver van Kaick, Ariel Shamir, Hao Zhang, Hui Huang

Given a 3D object in isolation, our functional similarity network (fSIM-NET), a variation of the triplet network, is trained to predict the functionality of the object by inferring functionality-revealing interaction contexts.

Object

Paper
Code

Memory-Efficient Network for Large-scale Video Compressive Sensing

2 code implementations • CVPR 2021 • Ziheng Cheng, Bo Chen, Guanliang Liu, Hao Zhang, Ruiying Lu, Zhengjue Wang, Xin Yuan

With the knowledge of masks, optimization algorithms or deep learning methods are employed to reconstruct the desired high-speed video frames from this snapshot measurement.

Compressive Sensing Demosaicking +1

Paper
Code

Group Contextualization for Video Recognition

1 code implementation • CVPR 2022 • Yanbin Hao, Hao Zhang, Chong-Wah Ngo, Xiangnan He

By utilizing calibrators to embed feature with four different kinds of contexts in parallel, the learnt representation is expected to be more resilient to diverse types of activities.

Ranked #3 on Egocentric Activity Recognition on EGTEA

Action Recognition Egocentric Activity Recognition +1

Paper
Code

MS-RNN: A Flexible Multi-Scale Framework for Spatiotemporal Predictive Learning

1 code implementation • 7 Jun 2022 • Zhifeng Ma, Hao Zhang, Jie Liu

Spatiotemporal predictive learning, which predicts future frames through historical prior knowledge with the aid of deep learning, is widely used in many fields.

Video Prediction

Paper
Code

FED-NeRF: Achieve High 3D Consistency and Temporal Coherence for Face Video Editing on Dynamic NeRF

1 code implementation • 5 Jan 2024 • Hao Zhang, Yu-Wing Tai, Chi-Keung Tang

However, achieving simultaneously multi-view consistency and temporal coherence while editing video sequences remains a formidable challenge.

Video Editing

Paper
Code

High-accuracy mass, spin, and recoil predictions of generic black-hole merger remnants

1 code implementation • 24 Sep 2018 • Vijay Varma, Davide Gerosa, François Hébert, Leo C. Stein, Hao Zhang

We present accurate fits for the remnant properties of generically precessing binary black holes, trained on large banks of numerical-relativity simulations.

General Relativity and Quantum Cosmology High Energy Astrophysical Phenomena

Paper
Code

AutoLoss: Learning Discrete Schedules for Alternate Optimization

1 code implementation • 4 Oct 2018 • Haowen Xu, Hao Zhang, Zhiting Hu, Xiaodan Liang, Ruslan Salakhutdinov, Eric Xing

Many machine learning problems involve iteratively and alternately optimizing different task objectives with respect to different sets of parameters.

Image Generation Machine Translation +4

Paper
Code

Symbolic Graph Reasoning Meets Convolutions

1 code implementation • NeurIPS 2018 • Xiaodan Liang, Zhiting Hu, Hao Zhang, Liang Lin, Eric P. Xing

To cooperate with local convolutions, each SGR is constituted by three modules: a) a primal local-to-semantic voting module where the features of all symbolic nodes are generated by voting from local representations; b) a graph reasoning module propagates information over knowledge graph to achieve global semantic coherency; c) a dual semantic-to-local mapping module learns new associations of the evolved symbolic nodes with local representations, and accordingly enhances local features.

Ranked #81 on Semantic Segmentation on ADE20K val

Image Classification Semantic Segmentation

Paper
Code

Physical Interaction: Reconstructing Hand-object Interactions with Physics

1 code implementation • 22 Sep 2022 • Haoyu Hu, Xinyu Yi, Hao Zhang, Jun-Hai Yong, Feng Xu

Single view-based reconstruction of hand-object interaction is challenging due to the severe observation missing caused by occlusions.

Object

Paper
Code

MetaSCI: Scalable and Adaptive Reconstruction for Video Compressive Sensing

2 code implementations • CVPR 2021 • Zhengjue Wang, Hao Zhang, Ziheng Cheng, Bo Chen, Xin Yuan

To capture high-speed videos using a two-dimensional detector, video snapshot compressive imaging (SCI) is a promising system, where the video frames are coded by different masks and then compressed to a snapshot measurement.

Compressive Sensing Video Compressive Sensing

Paper
Code

Can learning from natural image denoising be used for seismic data interpolation?

1 code implementation • 27 Feb 2019 • Hao Zhang, Xiuyan Yang, Jianwei Ma

We propose a convolutional neural network (CNN) denoising based method for seismic data interpolation.

De-aliasing Image Denoising

Paper
Code

FLNeRF: 3D Facial Landmarks Estimation in Neural Radiance Fields

1 code implementation • 21 Nov 2022 • Hao Zhang, Tianyuan Dai, Yu-Wing Tai, Chi-Keung Tang

This paper presents the first significant work on directly predicting 3D face landmarks on neural radiance fields (NeRFs).

Paper
Code

De novo Drug Design using Reinforcement Learning with Multiple GPT Agents

1 code implementation • NeurIPS 2023 • Xiuyuan Hu, Guoqing Liu, Yang Zhao, Hao Zhang

A central challenge in this field is to generate molecules with specific properties while also producing a wide range of diverse candidates.

reinforcement-learning

Paper
Code

MS-DETR: Natural Language Video Localization with Sampling Moment-Moment Interaction

1 code implementation • 30 May 2023 • Jing Wang, Aixin Sun, Hao Zhang, XiaoLi Li

Given a query, the task of Natural Language Video Localization (NLVL) is to localize a temporal moment in an untrimmed video that semantically matches the query.

Paper
Code

ARO-Net: Learning Implicit Fields from Anchored Radial Observations

1 code implementation • CVPR 2023 • Yizhi Wang, Zeyu Huang, Ariel Shamir, Hui Huang, Hao Zhang, Ruizhen Hu

We introduce anchored radial observations (ARO), a novel shape encoding for learning implicit field representation of 3D shapes that is category-agnostic and generalizable amid significant shape variations.

Surface Reconstruction

Paper
Code

Semi-supervised URL Segmentation with Recurrent Neural NetworksPre-trained on Knowledge Graph Entities

1 code implementation • 5 Nov 2020 • Hao Zhang, Jae Ro, Richard Sproat

Breaking domain names such as openresearch into component words open and research is important for applications like Text-to-Speech synthesis and web search.

Chinese Word Segmentation Speech Synthesis +1

Paper
Code

Semi-supervised URL Segmentation with Recurrent Neural Networks Pre-trained on Knowledge Graph Entities

1 code implementation • COLING 2020 • Hao Zhang, Jae Ro, Richard Sproat

Breaking domain names such as openresearch into component words open and research is important for applications like Text-to-Speech synthesis and web search.

Chinese Word Segmentation Speech Synthesis +1

Paper
Code

GANHopper: Multi-Hop GAN for Unsupervised Image-to-Image Translation

1 code implementation • ECCV 2020 • Wallace Lira, Johannes Merz, Daniel Ritchie, Daniel Cohen-Or, Hao Zhang

Instead of executing translation directly, we steer the translation by requiring the network to produce in-between images that resemble weighted hybrids between images from the input domains.

Translation Unsupervised Image-To-Image Translation

Paper
Code

Adaptive Split-Fusion Transformer

1 code implementation • 26 Apr 2022 • Zixuan Su, Hao Zhang, Jingjing Chen, Lei Pang, Chong-Wah Ngo, Yu-Gang Jiang

Neural networks for visual content understanding have recently evolved from convolutional ones (CNNs) to transformers.

Ranked #1 on Image Classification on CIFAR-10 Image Classification

Image Classification

Paper
Code

High-throughput, high-resolution registration-free generated adversarial network microscopy

1 code implementation • 7 Jan 2018 • Hao Zhang, Xinlin Xie, Chunyu Fang, Yicong Yang, Di Jin, Peng Fei

We combine generative adversarial network (GAN) with light microscopy to achieve deep learning super-resolution under a large field of view (FOV).

Generative Adversarial Network Image Registration +2

Paper
Code

Focaler-IoU: More Focused Intersection over Union Loss

1 code implementation • 19 Jan 2024 • Hao Zhang, Shuaijie Zhang

Existing researchs improve regression performance by utilizing the geometric relationship between bounding boxes, while ignoring the impact of difficult and easy sample distribution on bounding box regression.

Object object-detection +2

Paper
Code

Hybrid Neural Networks for On-device Directional Hearing

1 code implementation • AAAI 2022 • Anran Wang, Maruchi Kim, Hao Zhang, Shyamnath Gollakota

On-device directional hearing requires audio source separation from a given direction while achieving stringent human-imperceptible latency requirements.

Ranked #1 on Real-time Directional Hearing on VCTK

Causal Inference Real-time Directional Hearing

Paper
Code

Manifoldron: Direct Space Partition via Manifold Discovery

2 code implementations • 14 Jan 2022 • Dayang Wang, Feng-Lei Fan, Bo-Jian Hou, Hao Zhang, Zhen Jia, Boce Zhou, Rongjie Lai, Hengyong Yu, Fei Wang

A neural network with the widely-used ReLU activation has been shown to partition the sample space into many convex polytopes for prediction.

BIG-bench Machine Learning

Paper
Code

Neural Eigenfunctions Are Structured Representation Learners

1 code implementation • 23 Oct 2022 • Zhijie Deng, Jiaxin Shi, Hao Zhang, Peng Cui, Cewu Lu, Jun Zhu

Unlike prior spectral methods such as Laplacian Eigenmap that operate in a nonparametric manner, Neural Eigenmap leverages NeuralEF to parametrically model eigenfunctions using a neural network.

Contrastive Learning Data Augmentation +7

Paper
Code

BSD-GAN: Branched Generative Adversarial Network for Scale-Disentangled Representation Learning and Image Synthesis

2 code implementations • 22 Mar 2018 • Zili Yi, Zhiqin Chen, Hao Cai, Wendong Mao, Minglun Gong, Hao Zhang

The key feature of BSD-GAN is that it is trained in multiple branches, progressively covering both the breadth and depth of the network, as resolutions of the training images increase to reveal finer-scale features.

Generative Adversarial Network Image Generation +1

Paper
Code

Variational Hetero-Encoder Randomized GANs for Joint Image-Text Modeling

1 code implementation • ICLR 2020 • Hao Zhang, Bo Chen, Long Tian, Zhengjue Wang, Mingyuan Zhou

For bidirectional joint image-text modeling, we develop variational hetero-encoder (VHE) randomized generative adversarial network (GAN), a versatile deep generative model that integrates a probabilistic text decoder, probabilistic image encoder, and GAN into a coherent end-to-end multi-modality learning framework.

Generative Adversarial Network

Paper
Code

Spin-Orbit Protection of Induced Superconductivity in Majorana Nanowires

1 code implementation • 5 Jul 2018 • Jouri D. S. Bommer, Hao Zhang, Önder Gül, Bas Nijholt, Michael Wimmer, Filipp N. Rybakov, Julien Garaud, Donjan Rodic, Egor Babaev, Matthias Troyer, Diana Car, Sébastien R. Plissard, Erik P. A. M. Bakkers, Kenji Watanabe, Takashi Taniguchi, Leo P. Kouwenhoven

Spin-orbit interaction (SOI) plays a key role in creating Majorana zero modes in semiconductor nanowires proximity coupled to a superconductor.

Mesoscale and Nanoscale Physics

Paper
Code

Students Need More Attention: BERT-based AttentionModel for Small Data with Application to AutomaticPatient Message Triage

1 code implementation • 22 Jun 2020 • Shijing Si, Rui Wang, Jedrek Wosik, Hao Zhang, David Dov, Guoyin Wang, Ricardo Henao, Lawrence Carin

Small and imbalanced datasets commonly seen in healthcare represent a challenge when training classifiers based on deep learning models.

Paper
Code

RIM-Net: Recursive Implicit Fields for Unsupervised Learning of Hierarchical Shape Structures

1 code implementation • CVPR 2022 • Chengjie Niu, Manyi Li, Kai Xu, Hao Zhang

Each level of the tree corresponds to an assembly of shape parts, represented as implicit functions, to reconstruct the input shape.

Paper
Code

Language Model Decomposition: Quantifying the Dependency and Correlation of Language Models

1 code implementation • 19 Oct 2022 • Hao Zhang

A goodness-of-fit metric for LMD similar to the coefficient of determination is defined and used to measure the linear dependency of a set of LMs.

Language Modelling

Paper
Code

DAE-Net: Deforming Auto-Encoder for fine-grained shape co-segmentation

1 code implementation • 22 Nov 2023 • Zhiqin Chen, Qimin Chen, Hang Zhou, Hao Zhang

We present an unsupervised 3D shape co-segmentation method which learns a set of deformable part templates from a shape collection.

Paper
Code

Multi-Task Dense Prediction via Mixture of Low-Rank Experts

1 code implementation • 26 Mar 2024 • YuQi Yang, Peng-Tao Jiang, Qibin Hou, Hao Zhang, Jinwei Chen, Bo Li

Furthermore, to control the parameters and computational cost brought by the increase in the number of experts, we take inspiration from LoRA and propose to leverage the low-rank format of a vanilla convolution in the expert network.

Paper
Code

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

1 code implementation • 12 Apr 2024 • Xuezhe Ma, Xiaomeng Yang, Wenhan Xiong, Beidi Chen, Lili Yu, Hao Zhang, Jonathan May, Luke Zettlemoyer, Omer Levy, Chunting Zhou

The quadratic complexity and weak length extrapolation of Transformers limits their ability to scale to long sequences, and while sub-quadratic solutions like linear attention and state space models exist, they empirically underperform Transformers in pretraining efficiency and downstream task accuracy.

Paper
Code

Downstream Transformer Generation of Question-Answer Pairs with Preprocessing and Postprocessing Pipelines

1 code implementation • 15 May 2022 • Cheng Zhang, Hao Zhang, Jie Wang

We present a system called TP3 to perform a downstream task of transformers on generating question-answer pairs (QAPs) from a given article.

Paper
Code

Text-IF: Leveraging Semantic Text Guidance for Degradation-Aware and Interactive Image Fusion

1 code implementation • 25 Mar 2024 • Xunpeng Yi, Han Xu, Hao Zhang, Linfeng Tang, Jiayi Ma

Through the text semantic encoder and semantic interaction fusion decoder, Text-IF is accessible to the all-in-one infrared and visible image degradation-aware processing and the interactive flexible fusion outcomes.

Paper
Code

Interventional Video Grounding with Dual Contrastive Learning

1 code implementation • CVPR 2021 • Guoshun Nan, Rui Qiao, Yao Xiao, Jun Liu, Sicong Leng, Hao Zhang, Wei Lu

2) Meanwhile, we introduce a dual contrastive learning approach (DCL) to better align the text and video by maximizing the mutual information (MI) between query and video clips, and the MI between start/end frames of a target moment and the others within a video to learn more informative visual representations.

Causal Inference Contrastive Learning +2

Paper
Code

COSY: COunterfactual SYntax for Cross-Lingual Understanding

1 code implementation • ACL 2021 • Sicheng Yu, Hao Zhang, Yulei Niu, Qianru Sun, Jing Jiang

Pre-trained multilingual language models, e. g., multilingual-BERT, are widely used in cross-lingual tasks, yielding the state-of-the-art performance.

counterfactual Natural Language Inference +3

Paper
Code

Multi-relation Message Passing for Multi-label Text Classification

1 code implementation • 10 Feb 2022 • Muberra Ozmen, Hao Zhang, Pengyun Wang, Mark Coates

These examples motivate the modelling of multiple types of bi-directional relationships between labels.

Multi-Label Classification Multi-Label Image Classification +4

Paper
Code

Parameterization of Cross-Token Relations with Relative Positional Encoding for Vision MLP

1 code implementation • 15 Jul 2022 • Zhicai Wang, Yanbin Hao, Xingyu Gao, Hao Zhang, Shuo Wang, Tingting Mu, Xiangnan He

They use token-mixing layers to capture cross-token interactions, as opposed to the multi-head self-attention mechanism used by Transformers.

Paper
Code

ShaDDR: Interactive Example-Based Geometry and Texture Generation via 3D Shape Detailization and Differentiable Rendering

1 code implementation • 8 Jun 2023 • Qimin Chen, Zhiqin Chen, Hang Zhou, Hao Zhang

Furthermore, we showcase the ability of our method to learn geometric details and textures from shapes reconstructed from real-world photos.

Texture Synthesis

Paper
Code

Distantly-Supervised Long-Tailed Relation Extraction Using Constraint Graphs

1 code implementation • 24 May 2021 • Tianming Liang, Yang Liu, Xiaoyan Liu, Hao Zhang, Gaurav Sharma, Maozu Guo

On top of that, we further propose a novel constraint graph-based relation extraction framework(CGRE) to handle the two challenges simultaneously.

Ranked #3 on Relationship Extraction (Distant Supervised) on New York Times Corpus

Denoising Relation +2

Paper
Code

Heterogeneous Autoencoder Empowered by Quadratic Neurons

1 code implementation • 2 Apr 2022 • Jing-Xiao Liao, Bo-Jian Hou, Hang-Cheng Dong, Hao Zhang, Jianwei Ma, Jinwei Sun, Shiping Zhang, Feng-Lei Fan

Inspired by the complexity and diversity of biological neurons, a quadratic neuron is proposed to replace the inner product in the current neuron with a simplified quadratic function.

Anomaly Detection

Paper
Code

Incorporating Instructional Prompts into a Unified Generative Framework for Joint Multiple Intent Detection and Slot Filling

1 code implementation • COLING 2022 • Yangjun Wu, Han Wang, Dongxiang Zhang, Gang Chen, Hao Zhang

Specifically, we design 5-type templates as instructional prompts, and each template includes a question that acts as the driver to teach UGEN to grasp the paradigm, options that list the candidate intents or slots to reduce the answer search space, and the context denotes original utterance.

Intent Detection Question Answering +3

Paper
Code

MeaCap: Memory-Augmented Zero-shot Image Captioning

1 code implementation • 6 Mar 2024 • Zequn Zeng, Yan Xie, Hao Zhang, Chiyu Chen, Zhengjue Wang, Bo Chen

The framework of MeaCap achieves the state-of-the-art performance on a series of zero-shot IC settings.

Caption Generation Image Captioning +4

Paper
Code

FAME: 3D Shape Generation via Functionality-Aware Model Evolution

1 code implementation • 9 May 2020 • Yanran Guan, Han Liu, Kun Liu, Kangxue Yin, Ruizhen Hu, Oliver van Kaick, Yan Zhang, Ersin Yumer, Nathan Carr, Radomir Mech, Hao Zhang

Our tool supports constrained modeling, allowing users to restrict or steer the model evolution with functionality labels.

Graphics

Paper
Code

EnsLM: Ensemble Language Model for Data Diversity by Semantic Clustering

1 code implementation • ACL 2021 • Zhibin Duan, Hao Zhang, Chaojie Wang, Zhengjue Wang, Bo Chen, Mingyuan Zhou

As a result, the backbone learns the shared knowledge among all clusters while modulated weights extract the cluster-specific features.

Clustering Language Modelling

Paper
Code

SAC-GAN: Structure-Aware Image Composition

1 code implementation • 13 Dec 2021 • Hang Zhou, Rui Ma, Ling-Xiao Zhang, Lin Gao, Ali Mahdavi-Amiri, Hao Zhang

Specifically, our network takes the semantic layout features from the input scene image, features encoded from the edges and silhouette in the input object patch, as well as a latent code as inputs, and generates a 2D spatial affine transform defining the translation and scaling of the object patch.

Image Augmentation Object

Paper
Code

A Variational Edge Partition Model for Supervised Graph Representation Learning

1 code implementation • 7 Feb 2022 • Yilin He, Chaojie Wang, Hao Zhang, Bo Chen, Mingyuan Zhou

This paper introduces a graph generative process to model how the observed edges are generated by aggregating the node interactions over a set of overlapping node communities, each of which contributes to the edges via a logical OR mechanism.

Classification Graph Representation Learning +1

Paper
Code

Long-term Leap Attention, Short-term Periodic Shift for Video Classification

1 code implementation • 12 Jul 2022 • Hao Zhang, Lechao Cheng, Yanbin Hao, Chong-Wah Ngo

By replacing a vanilla 2D attention with the LAPS, we could adapt a static transformer into a video one, with zero extra parameters and neglectable computation overhead ($\sim$2. 6\%).

Video Classification

Paper
Code

Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning

1 code implementation • 27 Sep 2022 • Hao Zhang, Hao Wang, Zhen Kan

Automaton based approaches have enabled robots to perform various complex tasks.

Motion Planning reinforcement-learning +1

Paper
Code

NoisywikiHow: A Benchmark for Learning with Real-world Noisy Labels in Natural Language Processing

1 code implementation • 18 May 2023 • Tingting Wu, Xiao Ding, Minji Tang, Hao Zhang, Bing Qin, Ting Liu

To mitigate the effects of label noise, learning with noisy labels (LNL) methods are designed to achieve better generalization performance.

Learning with noisy labels

Paper
Code

TLM: Token-Level Masking for Transformers

1 code implementation • 28 Oct 2023 • Yangjun Wu, Kebin Fang, Dongxiang Zhang, Han Wang, Hao Zhang, Gang Chen

Structured dropout approaches, such as attention dropout and DropHead, have been investigated to regularize the multi-head attention mechanism in Transformers.

Data-to-Text Generation Grammatical Error Correction +1

Paper
Code

Revisiting Single Image Reflection Removal In the Wild

1 code implementation • 29 Nov 2023 • Yurui Zhu, Xueyang Fu, Peng-Tao Jiang, Hao Zhang, Qibin Sun, Jinwei Chen, Zheng-Jun Zha, Bo Li

This research focuses on the issue of single-image reflection removal (SIRR) in real-world conditions, examining it from two angles: the collection pipeline of real reflection pairs and the perception of real reflection locations.

Reflection Removal

Paper
Code

Wavelet Regularization Benefits Adversarial Training

1 code implementation • 8 Jun 2022 • Jun Yan, Huilin Yin, Xiaoyang Deng, Ziming Zhao, Wancheng Ge, Hao Zhang, Gerhard Rigoll

Since adversarial vulnerability can be regarded as a high-frequency phenomenon, it is essential to regulate the adversarially-trained neural network models in the frequency domain.

Adversarial Robustness

Paper
Code

Computron: Serving Distributed Deep Learning Models with Model Parallel Swapping

1 code implementation • 24 Jun 2023 • Daniel Zou, Xinchen Jin, Xueyang Yu, Hao Zhang, James Demmel

In anticipation of workloads that involve serving many of such large models to handle different tasks, we develop Computron, a system that uses memory swapping to serve multiple distributed models on a shared GPU cluster.

Paper
Code

Parameter-Efficient Conversational Recommender System as a Language Processing Task

1 code implementation • 25 Jan 2024 • Mathieu Ravaut, Hao Zhang, Lu Xu, Aixin Sun, Yong liu

Conversational recommender systems (CRS) aim to recommend relevant items to users by eliciting user preference through natural language conversation.

Dialogue Generation Knowledge Graphs +2

Paper
Code

Interpretable Complex-Valued Neural Networks for Privacy Protection

1 code implementation • ICLR 2020 • Liyao Xiang, Haotian Ma, Hao Zhang, Yifan Zhang, Jie Ren, Quanshi Zhang

Previous studies have found that an adversary attacker can often infer unintended input information from intermediate-layer features.

Paper
Code

Quantification and Analysis of Layer-wise and Pixel-wise Information Discarding

1 code implementation • 10 Jun 2019 • Haotian Ma, Hao Zhang, Fan Zhou, Yinqing Zhang, Quanshi Zhang

We define two types of entropy-based metrics, i. e. (1) the discarding of pixel-wise information used in the forward propagation, and (2) the uncertainty of the input reconstruction, to measure input information contained by a specific layer from two perspectives.

Fairness

Paper
Code

Deep N-ary Error Correcting Output Codes

1 code implementation • 22 Sep 2020 • Hao Zhang, Joey Tianyi Zhou, Tianying Wang, Ivor W. Tsang, Rick Siow Mong Goh

To facilitate the training of N-ary ECOC with deep learning base learners, we further propose three different variants of parameter sharing architectures for deep N-ary ECOC.

Ensemble Learning General Classification +3

Paper
Code

An Embarrassingly Simple Model for Dialogue Relation Extraction

1 code implementation • 27 Dec 2020 • Fuzhao Xue, Aixin Sun, Hao Zhang, Jinjie Ni, Eng Siong Chng

Dialogue relation extraction (RE) is to predict the relation type of two entities mentioned in a dialogue.

Ranked #9 on Dialog Relation Extraction on DialogRE

Dialog Relation Extraction Relation +1

Paper
Code

Unlocking the Potential of Large Language Models for Explainable Recommendations

1 code implementation • 25 Dec 2023 • Yucong Luo, Mingyue Cheng, Hao Zhang, Junyu Lu, Qi Liu, Enhong Chen

In this study, we propose LLMXRec, a simple yet effective two-stage explainable recommendation framework aimed at further boosting the explanation quality by employing LLMs.

Decision Making Explainable Recommendation +2

Paper
Code

Contrastive Attraction and Contrastive Repulsion for Representation Learning

1 code implementation • 8 May 2021 • Huangjie Zheng, Xu Chen, Jiangchao Yao, Hongxia Yang, Chunyuan Li, Ya zhang, Hao Zhang, Ivor Tsang, Jingren Zhou, Mingyuan Zhou

We realize this strategy with contrastive attraction and contrastive repulsion (CACR), which makes the query not only exert a greater force to attract more distant positive samples but also do so to repel closer negative samples.

Contrastive Learning Representation Learning

Paper
Code

Combined Invariant Subspace \& Frequency-Domain Subspace Method for Identification of Discrete-Time MIMO Linear Systems

1 code implementation • 12 Dec 2023 • Jingze You, Chao Huang, Hao Zhang

Recently, a novel system identification method based on invariant subspace theory is introduced, aiming to address the identification problem of continuous-time (CT) linear time-invariant (LTI) systems by combining time-domain and frequency-domain methods.

Paper
Code

Empirical Evidence for the Fragment level Understanding on Drug Molecular Structure of LLMs

1 code implementation • 15 Jan 2024 • Xiuyuan Hu, Guoqing Liu, Yang Zhao, Hao Zhang

AI for drug discovery has been a research hotspot in recent years, and SMILES-based language models has been increasingly applied in drug molecular design.

Drug Discovery

Paper
Code

Alternating Synthetic and Real Gradients for Neural Language Modeling

1 code implementation • 27 Feb 2019 • Fangxin Shang, Hao Zhang

Empirically, we demonstrate the effectiveness of alternating training with synthetic and real gradients after periodic warm restarts on language modeling tasks.

Language Modelling

Paper
Code

Sentence Bag Graph Formulation for Biomedical Distant Supervision Relation Extraction

1 code implementation • 29 Oct 2023 • Hao Zhang, Yang Liu, Xiaoyan Liu, Tianming Liang, Gaurav Sharma, Liang Xue, Maozu Guo

We introduce a novel graph-based framework for alleviating key challenges in distantly-supervised relation extraction and demonstrate its effectiveness in the challenging and important domain of biomedical data.

Relation Relation Extraction +1

Paper
Code

P2P-NET: Bidirectional Point Displacement Net for Shape Transform

no code implementations • 25 Mar 2018 • Kangxue Yin, Hui Huang, Daniel Cohen-Or, Hao Zhang

We introduce P2P-NET, a general-purpose deep neural network which learns geometric transformations between point-based shape representations from two domains, e. g., meso-skeletons and surfaces, partial and complete scans, etc.

Paper
Add Code

Semi-Supervised Co-Analysis of 3D Shape Styles from Projected Lines

no code implementations • 18 Apr 2018 • Fenggen Yu, Yan Zhang, Kai Xu, Ali Mahdavi-Amiri, Hao Zhang

We present a semi-supervised co-analysis method for learning 3D shape styles from projected feature lines, achieving style patch localization with only weak supervision.

Clustering

Paper
Add Code

On the Selection of Anchors and Targets for Video Hyperlinking

no code implementations • 14 Apr 2018 • Zhi-Qi Cheng, Hao Zhang, Xiao Wu, Chong-Wah Ngo

A principle way of hyperlinking can be carried out by picking centers of clusters as anchors and from there reach out to targets within or outside of clusters with consideration of neighborhood complexity.

Paper
Add Code

Cavs: A Vertex-centric Programming Interface for Dynamic Neural Networks

no code implementations • 11 Dec 2017 • Hao Zhang, Shizhen Xu, Graham Neubig, Wei Dai, Qirong Ho, Guangwen Yang, Eric P. Xing

Recent deep learning (DL) models have moved beyond static network architectures to dynamic ones, handling data where the network structure changes every example, such as sequences of variable lengths, trees, and graphs.

graph construction Management +1

Paper
Add Code

Efficient and Effective Single-Document Summarizations and A Word-Embedding Measurement of Quality

no code implementations • 1 Oct 2017 • Liqun Shao, Hao Zhang, Ming Jia, Jie Wang

We show that the orderings of the ROUGE and WESM scores of our algorithms are highly comparable, suggesting that WESM may serve as a viable alternative for measuring the quality of a summary.

Clustering Keyword Extraction

Paper
Add Code

Mining Deep And-Or Object Structures via Cost-Sensitive Question-Answer-Based Active Annotations

no code implementations • 13 Aug 2017 • Quanshi Zhang, Ying Nian Wu, Hao Zhang, Song-Chun Zhu

The loss is defined for nodes in all layers of the AOG, including the generative loss (measuring the likelihood of the images) and the discriminative loss (measuring the fitness to human answers).

Question Answering

Paper
Add Code

Generative Semantic Manipulation with Contrasting GAN

no code implementations • 1 Aug 2017 • Xiaodan Liang, Hao Zhang, Eric P. Xing

Generative Adversarial Networks (GANs) have recently achieved significant improvement on paired/unpaired image-to-image translation, such as photo$\rightarrow$ sketch and artist painting style transfer.

Ranked #4 on Facial Expression Translation on CelebA

Image-to-Image Translation Style Transfer

Paper
Add Code

Poseidon: An Efficient Communication Architecture for Distributed Deep Learning on GPU Clusters

no code implementations • 11 Jun 2017 • Hao Zhang, Zeyu Zheng, Shizhen Xu, Wei Dai, Qirong Ho, Xiaodan Liang, Zhiting Hu, Jinliang Wei, Pengtao Xie, Eric P. Xing

We show that Poseidon enables Caffe and TensorFlow to achieve 15. 5x speed-up on 16 single-GPU machines, even with limited bandwidth (10GbE) and the challenging VGG19-22K network for image classification.

Image Classification

Paper
Add Code

GRASS: Generative Recursive Autoencoders for Shape Structures

no code implementations • 5 May 2017 • Jun Li, Kai Xu, Siddhartha Chaudhuri, Ersin Yumer, Hao Zhang, Leonidas Guibas

We introduce a novel neural network architecture for encoding and synthesis of 3D shapes, particularly their structures.

Paper
Add Code

SCAN: Structure Correcting Adversarial Network for Organ Segmentation in Chest X-rays

no code implementations • 26 Mar 2017 • Wei Dai, Joseph Doyle, Xiaodan Liang, Hao Zhang, Nanqing Dong, Yuan Li, Eric P. Xing

Through this adversarial process the critic network learns the higher order structures and guides the segmentation model to achieve realistic segmentation outcomes.

Organ Segmentation Segmentation

Paper
Add Code

Recurrent Topic-Transition GAN for Visual Paragraph Generation

no code implementations • ICCV 2017 • Xiaodan Liang, Zhiting Hu, Hao Zhang, Chuang Gan, Eric P. Xing

The proposed Recurrent Topic-Transition Generative Adversarial Network (RTT-GAN) builds an adversarial framework between a structured paragraph generator and multi-level paragraph discriminators.

Ranked #6 on Image Paragraph Captioning on Image Paragraph Captioning

Generative Adversarial Network Image Paragraph Captioning +1

Paper
Add Code

ZM-Net: Real-time Zero-shot Image Manipulation Network

no code implementations • 21 Mar 2017 • Hao Wang, Xiaodan Liang, Hao Zhang, Dit-yan Yeung, Eric P. Xing

We cast this problem as manipulating an input image according to a parametric model whose key parameters can be conditionally generated from any guiding signal (even unseen ones).

Colorization Descriptive +2

Paper
Add Code

Sequence-based Multimodal Apprenticeship Learning For Robot Perception and Decision Making

no code implementations • 24 Feb 2017 • Fei Han, Xue Yang, Yu Zhang, Hao Zhang

Apprenticeship learning has recently attracted a wide attention due to its capability of allowing robots to learn physical tasks directly from demonstrations provided by human experts.

Decision Making

Paper
Add Code

Simultaneous Feature and Body-Part Learning for Real-Time Robot Awareness of Human Behaviors

no code implementations • 24 Feb 2017 • Fei Han, Xue Yang, Christopher Reardon, Yu Zhang, Hao Zhang

We formulate FABL as a regression-like optimization problem with structured sparsity-inducing norms to model interrelationships of body parts and features.

Paper
Add Code

Space-Time Representation of People Based on 3D Skeletal Data: A Review

1 code implementation • 5 Jan 2016 • Fei Han, Brian Reily, William Hoff, Hao Zhang

Spatiotemporal human representation based on 3D visual perception data is a rapidly growing research area.

Feature Engineering

Paper
Code

Learning Concept Taxonomies from Multi-modal Data

no code implementations • ACL 2016 • Hao Zhang, Zhiting Hu, Yuntian Deng, Mrinmaya Sachan, Zhicheng Yan, Eric P. Xing

We study the problem of automatically building hypernym taxonomies from textual and visual data.

Feature Engineering

Paper
Add Code

Self-Reflective Risk-Aware Artificial Cognitive Modeling for Robot Response to Human Behaviors

no code implementations • 16 May 2016 • Fei Han, Christopher Reardon, Lynne E. Parker, Hao Zhang

In order for cooperative robots ("co-robots") to respond to human behaviors accurately and efficiently in human-robot collaboration, interpretation of human actions, awareness of new situations, and appropriate decision making are all crucial abilities for co-robots.

Decision Making

Paper
Add Code

Enforcing Template Representability and Temporal Consistency for Adaptive Sparse Tracking

no code implementations • 30 Apr 2016 • Xue Yang, Fei Han, Hua Wang, Hao Zhang

Sparse representation has been widely studied in visual tracking, which has shown promising tracking performance.

Descriptive Visual Tracking

Paper
Add Code

Combining the Best of Convolutional Layers and Recurrent Layers: A Hybrid Network for Semantic Segmentation

no code implementations • 15 Mar 2016 • Zhicheng Yan, Hao Zhang, Yangqing Jia, Thomas Breuel, Yizhou Yu

State-of-the-art results of semantic segmentation are established by Fully Convolutional neural Networks (FCNs).

Semantic Segmentation

Paper
Add Code

On the Reducibility of Submodular Functions

no code implementations • 4 Jan 2016 • Jincheng Mei, Hao Zhang, Bao-liang Lu

The scalability of submodular optimization methods is critical for their usability in practice.

Paper
Add Code

Poseidon: A System Architecture for Efficient GPU-based Deep Learning on Multiple Machines

no code implementations • 19 Dec 2015 • Hao Zhang, Zhiting Hu, Jinliang Wei, Pengtao Xie, Gunhee Kim, Qirong Ho, Eric Xing

To investigate how to adapt existing frameworks to efficiently support distributed GPUs, we propose Poseidon, a scalable system architecture for distributed inter-machine communication in existing DL frameworks.

Object Recognition

Paper
Add Code

Online Markov decision processes with policy iteration

no code implementations • 15 Oct 2015 • Yao Ma, Hao Zhang, Masashi Sugiyama

The online Markov decision process (MDP) is a generalization of the classical Markov decision process that incorporates changing reward functions.

Paper
Add Code

Task Selection for Bandit-Based Task Assignment in Heterogeneous Crowdsourcing

no code implementations • 26 Jul 2015 • Hao Zhang, Masashi Sugiyama

Task selection (picking an appropriate labeling task) and worker selection (assigning the labeling task to a suitable worker) are two major challenges in task assignment for crowdsourcing.

Active Learning

Paper
Add Code

Bandit-Based Task Assignment for Heterogeneous Crowdsourcing

no code implementations • 21 Jul 2015 • Hao Zhang, Yao Ma, Masashi Sugiyama

We consider a task assignment problem in crowdsourcing, which is aimed at collecting as many reliable labels as possible within a limited budget.

Paper
Add Code

Statistical models and regularization strategies in statistical image reconstruction of low-dose X-ray CT: a survey

no code implementations • 4 Dec 2014 • Hao Zhang, Jing Wang, Jianhua Ma, Hongbing Lu, Zhengrong Liang

Statistical image reconstruction (SIR) methods have shown potential to substantially improve the image quality of low-dose X-ray computed tomography (CT) as compared to the conventional filtered back-projection (FBP) method for various clinical tasks.

Computed Tomography (CT) Image Reconstruction

Paper
Add Code

Spatial-Spectral Boosting Analysis for Stroke Patients' Motor Imagery EEG in Rehabilitation Training

no code implementations • 23 Oct 2013 • Hao Zhang, Liqing Zhang

Current studies about motor imagery based rehabilitation training systems for stroke subjects lack an appropriate analytic method, which can achieve a considerable classification accuracy, at the same time detects gradual changes of imagery patterns during rehabilitation process and disinters potential mechanisms about motor function recovery.

EEG Motor Imagery

Paper
Add Code

Dual-label Deep LSTM Dereverberation For Speaker Verification

no code implementations • 8 Sep 2018 • Hao Zhang, Stephen Zahorian, Xiao Chen, Peter Guzewich, Xiaoyu Liu

In this paper, we present a reverberation removal approach for speaker verification, utilizing dual-label deep neural networks (DNNs).

Speaker Verification

Paper
Add Code

Semantic WordRank: Generating Finer Single-Document Summarizations

no code implementations • 12 Sep 2018 • Hao Zhang, Jie Wang

We present Semantic WordRank (SWR), an unsupervised method for generating an extractive summary of a single document.

Clustering

Paper
Add Code

SCORES: Shape Composition with Recursive Substructure Priors

no code implementations • 14 Sep 2018 • Chenyang Zhu, Kai Xu, Siddhartha Chaudhuri, Renjiao Yi, Hao Zhang

The network may significantly alter the geometry and structure of the input parts and synthesize a novel shape structure based on the inputs, while adding or removing parts to minimize a structure plausibility loss.

Paper
Add Code

Toward Understanding the Impact of Staleness in Distributed Machine Learning

no code implementations • ICLR 2019 • Wei Dai, Yi Zhou, Nanqing Dong, Hao Zhang, Eric P. Xing

Many distributed machine learning (ML) systems adopt the non-synchronous execution in order to alleviate the network communication bottleneck, resulting in stale parameters that do not reflect the latest updates.

BIG-bench Machine Learning

Paper
Add Code

Event Representation through Semantic Roles: Evaluation of Coverage

no code implementations • 9 Oct 2018 • Aliaksandr Huminski, Hao Zhang

Semantic role theory is a widely used approach for event representation.

Paper
Add Code

Towards Verifying Semantic Roles Co-occurrence

no code implementations • 9 Oct 2018 • Aliaksandr Huminski, Hao Zhang, Gangeshwar Krishnamurthy

Semantic role theory considers roles as a small universal set of unanalyzed entities.

Paper
Add Code

Hartley Spectral Pooling for Deep Learning

no code implementations • 7 Oct 2018 • Hao Zhang, Jianwei Ma

In most convolution neural networks (CNNs), downsampling hidden layers is adopted for increasing computation efficiency and the receptive field size.

Dimensionality Reduction

Paper
Add Code

Deep Poisson gamma dynamical systems

no code implementations • NeurIPS 2018 • Dandan Guo, Bo Chen, Hao Zhang, Mingyuan Zhou

We develop deep Poisson-gamma dynamical systems (DPGDS) to model sequentially observed multivariate count data, improving previously proposed models by not only mining deep hierarchical latent structure from the data, but also capturing both first-order and long-range temporal dependencies.

Data Augmentation Time Series +1

Paper
Add Code

Nearly-tight bounds on linear regions of piecewise linear neural networks

no code implementations • 31 Oct 2018 • Qiang Hu, Hao Zhang

The developments of deep neural networks (DNN) in recent years have ushered a brand new era of artificial intelligence.

Paper
Add Code

Fast and Accurate Reordering with ITG Transition RNN

no code implementations • COLING 2018 • Hao Zhang, Axel Ng, Richard Sproat

Compared to a strong baseline of attention-based RNN, our ITG RNN re-ordering model can reach the same reordering accuracy with only 1/10 of the training data and is 2. 5x faster in decoding.

Feature Engineering Machine Translation +3

Paper
Add Code

Learning Multi-Instance Enriched Image Representations via Non-Greedy Ratio Maximization of the l1-Norm Distances

no code implementations • CVPR 2018 • Kai Liu, Hua Wang, Feiping Nie, Hao Zhang

To tackle these two challenges, in this paper we propose a novel image representation learning method that can integrate the local patches (the instances) of an input image (the bag) and its holistic representation into one single-vector representation.

Representation Learning

Paper
Add Code

Generative Semantic Manipulation with Mask-Contrasting GAN

no code implementations • ECCV 2018 • Xiaodan Liang, Hao Zhang, Liang Lin, Eric Xing

Despite the promising results on paired/unpaired image-to-image translation achieved by Generative Adversarial Networks (GANs), prior works often only transfer the low-level information (e. g. color or texture changes), but fail to manipulate high-level semantic meanings (e. g., geometric structure or content) of different object regions.

Image-to-Image Translation

Paper
Add Code

DATNet: Dual Adversarial Transfer for Low-resource Named Entity Recognition

no code implementations • ICLR 2019 • Joey Tianyi Zhou, Hao Zhang, Di Jin, Hongyuan Zhu, Rick Siow Mong Goh, Kenneth Kwok

We propose a new architecture termed Dual Adversarial Transfer Network (DATNet) for addressing low-resource Named Entity Recognition (NER).

Low Resource Named Entity Recognition named-entity-recognition +2

Paper
Add Code

VHEGAN: Variational Hetero-Encoder Randomized GAN for Zero-Shot Learning

no code implementations • ICLR 2019 • Hao Zhang, Bo Chen, Long Tian, Zhengjue Wang, Mingyuan Zhou

To extract and relate visual and linguistic concepts from images and textual descriptions for text-based zero-shot learning (ZSL), we develop variational hetero-encoder (VHE) that decodes text via a deep probabilisitic topic model, the variational posterior of whose local latent variables is encoded from an image via a Weibull distribution based inference network.

Image Generation Retrieval +3

Paper
Add Code

Simplex-Based 3D Spatio-Temporal Feature Description for Action Recognition

no code implementations • CVPR 2014 • Hao Zhang, Wenjun Zhou, Christopher Reardon, Lynne E. Parker

In addition, the results show that our SOD descriptor is a superior individual descriptor for action recognition.

Action Recognition Temporal Action Localization

Paper
Add Code

Sparse Dictionary Learning for Edit Propagation of High-Resolution Images

no code implementations • CVPR 2014 • Xiaowu Chen, Dongqing Zou, Jianwei Li, Xiaochun Cao, Qinping Zhao, Hao Zhang

Previous approaches for edit propagation typically employ a global optimization over the whole set of image pixels, incurring a prohibitively high memory and time consumption for high-resolution images.

Dictionary Learning Vocal Bursts Intensity Prediction

Paper
Add Code

HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition

no code implementations • ICCV 2015 • Zhicheng Yan, Hao Zhang, Robinson Piramuthu, Vignesh Jagadeesh, Dennis Decoste, Wei Di, Yizhou Yu

In this paper, we introduce hierarchical deep CNNs (HD-CNNs) by embedding deep CNNs into a category hierarchy.

Image Classification Object Recognition

Paper
Add Code

Enforcing Structural Diversity in Cube-pruned Dependency Parsing

no code implementations • ACL 2014 • Hao Zhang, Ryan Mcdonald

Prepositional Phrase Attachment

Paper
Add Code

Universal Dependency Annotation for Multilingual Parsing

no code implementations • ACL 2013 • Ryan McDonald, Joakim Nivre, Yvonne Quirmbach-Brundage, Yoav Goldberg, Dipanjan Das, Kuzman Ganchev, Keith Hall, Slav Petrov, Hao Zhang, Oscar T{\"a}ckstr{\"o}m, Claudia Bedini, N{\'u}ria Bertomeu Castell{\'o}, Jungmee Lee

Dependency Parsing

Paper
Add Code

Online Learning for Inexact Hypergraph Search

no code implementations • EMNLP 2013 • Hao Zhang, Liang Huang, Kai Zhao, Ryan Mcdonald

Constituency Parsing Structured Prediction

Paper
Add Code

Generalized Higher-Order Dependency Parsing with Cube Pruning

no code implementations • EMNLP 2012 • Hao Zhang, Ryan Mcdonald

Dependency Parsing

Paper
Add Code

KWB: An Automated Quick News System for Chinese Readers

no code implementations • WS 2015 • Yiqi Bai, Wenjing Yang, Hao Zhang, Jingwen Wang, Ming Jia, Rol Tong, , Jie Wang

Paper
Add Code

AdaCoSeg: Adaptive Shape Co-Segmentation with Group Consistency Loss

no code implementations • CVPR 2020 • Chenyang Zhu, Kai Xu, Siddhartha Chaudhuri, Li Yi, Leonidas Guibas, Hao Zhang

While the part prior network can be trained with noisy and inconsistently segmented shapes, the final output of AdaCoSeg is a consistent part labeling for the input set, with each shape segmented into up to (a user-specified) K parts.

Instance Segmentation Segmentation +1

Paper
Add Code

LOGAN: Unpaired Shape Transform in Latent Overcomplete Space

no code implementations • 25 Mar 2019 • Kangxue Yin, Zhiqin Chen, Hui Huang, Daniel Cohen-Or, Hao Zhang

Our network consists of an autoencoder to encode shapes from the two input domains into a common latent space, where the latent codes concatenate multi-scale shape features, resulting in an overcomplete representation.

Generative Adversarial Network Translation

Paper
Add Code

DenseAttentionSeg: Segment Hands from Interacted Objects Using Depth Input

no code implementations • 29 Mar 2019 • Zihao Bo, Hao Zhang, Junhai Yong, Feng Xu

We propose a real-time DNN-based technique to segment hand and object of interacting motions from depth inputs.

Hand Segmentation Object +1

Paper
Add Code

Multisensory Omni-directional Long-term Place Recognition: Benchmark Dataset and Analysis

no code implementations • 18 Apr 2017 • Ashwin Mathur, Fei Han, Hao Zhang

We introduce a new dataset Multisensory Omnidirectional Long-term Place recognition (MOLP) comprising omnidirectional intensity and disparity images.

Robotics

Paper
Add Code

Constrained low-tubal-rank tensor recovery for hyperspectral images mixed noise removal by bilateral random projections

no code implementations • 15 May 2019 • Hao Zhang, Xi-Le Zhao, Tai-Xiang Jiang, Michael Kwok-Po Ng

In this paper, we propose a novel low-tubal-rank tensor recovery model, which directly constrains the tubal rank prior for effectively removing the mixed Gaussian and sparse noise in hyperspectral images.

Hyperspectral Image Denoising Image Denoising

Paper
Add Code

A Hybrid Precipitation Prediction Method based on Multicellular Gene Expression Programming

no code implementations • 1 Apr 2019 • Hongya Li, Yuzhong Peng, Chuyan Deng, Yonghua Pan, Daoqing Gong, Hao Zhang

Prompt and accurate precipitation forecast is very important for development management of regional water resource, flood disaster prevention and people's daily activity and production plan; however, non-linear and nonstationary characteristics of precipitation data and noise seriously affect forecast accuracy.

Denoising Management

Paper
Add Code

A Seft-adaptive Multicellular GEP Algorithm Based On Fuzzy Control For Function Optimization

no code implementations • 1 Apr 2019 • Chuyan Deng, Yuzhong Peng, Hongya Li, Daoqing Gong, Hao Zhang, Zhiping Liu

According to the concentration and dispersion of individual fitness values in population, the crossover rate, mutation rate and real number set mutation rate of genetic operation are dynamically adjusted.

Paper
Add Code

GRAINS: Generative Recursive Autoencoders for INdoor Scenes

no code implementations • 24 Jul 2018 • Manyi Li, Akshay Gadi Patil, Kai Xu, Siddhartha Chaudhuri, Owais Khan, Ariel Shamir, Changhe Tu, Baoquan Chen, Daniel Cohen-Or, Hao Zhang

We present a generative neural network which enables us to generate plausible 3D indoor scenes in large quantities and varieties, easily and highly efficiently.

Graphics

Paper
Add Code

Improving Performance of End-to-End ASR on Numeric Sequences

no code implementations • 1 Jul 2019 • Cal Peyser, Hao Zhang, Tara N. Sainath, Zelin Wu

This out-of-vocabulary (OOV) issue is addressed in conventional ASR systems by training part of the model on spoken domain utterances (e. g.

speech-recognition Speech Recognition

Paper
Add Code

Dual Adversarial Neural Transfer for Low-Resource Named Entity Recognition

no code implementations • ACL 2019 • Joey Tianyi Zhou, Hao Zhang, Di Jin, Hongyuan Zhu, Meng Fang, Rick Siow Mong Goh, Kenneth Kwok

We propose a new neural transfer method termed Dual Adversarial Transfer Network (DATNet) for addressing low-resource Named Entity Recognition (NER).

Language Modelling Low Resource Named Entity Recognition +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.