Search Results for author: Yan Wang

Found 330 papers, 125 papers with code

TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

21 code implementations • 8 Feb 2021 • Jieneng Chen, Yongyi Lu, Qihang Yu, Xiangde Luo, Ehsan Adeli, Yan Wang, Le Lu, Alan L. Yuille, Yuyin Zhou

Medical image segmentation is an essential prerequisite for developing healthcare systems, especially for disease diagnosis and treatment planning.

Ranked #4 on Medical Image Segmentation on ACDC

Cardiac Segmentation Decoder +4

8,278

Paper
Code

CogVLM: Visual Expert for Pretrained Language Models

1 code implementation • 6 Nov 2023 • Weihan Wang, Qingsong Lv, Wenmeng Yu, Wenyi Hong, Ji Qi, Yan Wang, Junhui Ji, Zhuoyi Yang, Lei Zhao, Xixuan Song, Jiazheng Xu, Bin Xu, Juanzi Li, Yuxiao Dong, Ming Ding, Jie Tang

We introduce CogVLM, a powerful open-source visual language foundation model.

Ranked #4 on Visual Question Answering (VQA) on InfiMM-Eval

Language Modelling Visual Question Answering

5,130

Paper
Code

CogAgent: A Visual Language Model for GUI Agents

1 code implementation • 14 Dec 2023 • Wenyi Hong, Weihan Wang, Qingsong Lv, Jiazheng Xu, Wenmeng Yu, Junhui Ji, Yan Wang, Zihan Wang, Yuxuan Zhang, Juanzi Li, Bin Xu, Yuxiao Dong, Ming Ding, Jie Tang

People are spending an enormous amount of time on digital devices through graphical user interfaces (GUIs), e. g., computer or smartphone screens.

Ranked #15 on Visual Question Answering on MM-Vet

Language Modelling Visual Question Answering

5,130

Paper
Code

3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers

3 code implementations • 11 Oct 2023 • Jieneng Chen, Jieru Mei, Xianhang Li, Yongyi Lu, Qihang Yu, Qingyue Wei, Xiangde Luo, Yutong Xie, Ehsan Adeli, Yan Wang, Matthew Lungren, Lei Xing, Le Lu, Alan Yuille, Yuyin Zhou

In this paper, we extend the 2D TransUNet architecture to a 3D network by building upon the state-of-the-art nnU-Net architecture, and fully exploring Transformers' potential in both the encoder and decoder design.

Decoder Image Segmentation +4

2,155

Paper
Code

Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation

2 code implementations • CVPR 2023 • Yunhao Bai, Duowen Chen, Qingli Li, Wei Shen, Yan Wang

In semi-supervised medical image segmentation, there exist empirical mismatch problems between labeled and unlabeled data distribution.

Ranked #3 on Semi-supervised Medical Image Segmentation on ACDC 5% labeled data

Image Segmentation Semi-supervised Medical Image Segmentation +1

2,007

Paper
Code

CogDL: A Comprehensive Library for Graph Deep Learning

1 code implementation • 1 Mar 2021 • Yukuo Cen, Zhenyu Hou, Yan Wang, Qibin Chen, Yizhen Luo, Zhongming Yu, Hengrui Zhang, Xingcheng Yao, Aohan Zeng, Shiguang Guo, Yuxiao Dong, Yang Yang, Peng Zhang, Guohao Dai, Yu Wang, Chang Zhou, Hongxia Yang, Jie Tang

In CogDL, we propose a unified design for the training and evaluation of GNN models for various graph tasks, making it unique among existing graph learning libraries.

Graph Classification Graph Embedding +5

1,687

Paper
Code

Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving

2 code implementations • CVPR 2019 • Yan Wang, Wei-Lun Chao, Divyansh Garg, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

However, in this paper we argue that it is not the quality of the data but its representation that accounts for the majority of the difference.

Ranked #10 on 3D Object Detection From Stereo Images on KITTI Cars Moderate

3D Object Detection From Stereo Images Autonomous Driving +2

953

Paper
Code

SimpleShot: Revisiting Nearest-Neighbor Classification for Few-Shot Learning

6 code implementations • 12 Nov 2019 • Yan Wang, Wei-Lun Chao, Kilian Q. Weinberger, Laurens van der Maaten

Few-shot learners aim to recognize new object classes based on a small number of labeled training examples.

Ranked #4 on Few-Shot Image Classification on Dirichlet CUB-200 (5-way, 5-shot)

Few-Shot Image Classification Few-Shot Learning +1

913

Paper
Code

SAM Fails to Segment Anything? -- SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, Medical Image Segmentation, and More

1 code implementation • 18 Apr 2023 • Tianrun Chen, Lanyun Zhu, Chaotao Ding, Runlong Cao, Yan Wang, Zejian Li, Lingyun Sun, Papa Mao, Ying Zang

We can even outperform task-specific network models and achieve state-of-the-art performance in the task we tested: camouflaged object detection, shadow detection.

General Knowledge Image Segmentation +7

806

Paper
Code

PandaGPT: One Model To Instruction-Follow Them All

1 code implementation • 25 May 2023 • Yixuan Su, Tian Lan, Huayang Li, Jialu Xu, Yan Wang, Deng Cai

To do so, PandaGPT combines the multimodal encoders from ImageBind and the large language models from Vicuna.

Instruction Following

728

Paper
Code

PLUMENet: Efficient 3D Object Detection from Stereo Images

1 code implementation • 17 Jan 2021 • Yan Wang, Bin Yang, Rui Hu, Ming Liang, Raquel Urtasun

In this paper we propose a model that unifies these two tasks and performs them in the same metric space.

3D Object Detection From Stereo Images Depth Estimation +2

664

Paper
Code

An Experimental-based Review of Image Enhancement and Image Restoration Methods for Underwater Imaging

1 code implementation • 7 Jul 2019 • Yan Wang, Wei Song, Giancarlo Fortino, Lizhe Qi, Wenqiang Zhang, Antonio Liotta

Underwater images play a key role in ocean exploration, but often suffer from severe quality degradation due to light absorption and scattering in water medium.

Image Enhancement Image Restoration

618

Paper
Code

Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving

1 code implementation • ICLR 2020 • Yurong You, Yan Wang, Wei-Lun Chao, Divyansh Garg, Geoff Pleiss, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

In this paper we provide substantial advances to the pseudo-LiDAR framework through improvements in stereo depth estimation.

Ranked #7 on 3D Object Detection From Stereo Images on KITTI Cars Moderate

3D Object Detection From Stereo Images Autonomous Driving +2

587

Paper
Code

Neural Surface Reconstruction of Dynamic Scenes with Monocular RGB-D Camera

1 code implementation • 30 Jun 2022 • Hongrui Cai, Wanquan Feng, Xuetao Feng, Yan Wang, Juyong Zhang

We propose Neural-DynamicReconstruction (NDR), a template-free method to recover high-fidelity geometry and motions of a dynamic scene from a monocular RGB-D camera.

Dynamic Reconstruction Monocular Reconstruction +2

516

Paper
Code

Anytime Stereo Image Depth Estimation on Mobile Devices

3 code implementations • 26 Oct 2018 • Yan Wang, Zihang Lai, Gao Huang, Brian H. Wang, Laurens van der Maaten, Mark Campbell, Kilian Q. Weinberger

Many applications of stereo depth estimation in robotics require the generation of accurate disparity maps in real time under significant computational constraints.

Ranked #1 on Stereo Depth Estimation on KITTI2012

Stereo Depth Estimation

484

Paper
Code

A Contrastive Framework for Neural Text Generation

2 code implementations • 13 Feb 2022 • Yixuan Su, Tian Lan, Yan Wang, Dani Yogatama, Lingpeng Kong, Nigel Collier

Text generation is of great importance to many natural language processing applications.

Text Generation

445

Paper
Code

A Survey on Session-based Recommender Systems

1 code implementation • 13 Feb 2019 • Shoujin Wang, Longbing Cao, Yan Wang, Quan Z. Sheng, Mehmet Orgun, Defu Lian

In recent years, session-based recommender systems (SBRSs) have emerged as a new paradigm of RSs.

Collaborative Filtering Decision Making +1

379

Paper
Code

Language Models Can See: Plugging Visual Controls in Text Generation

1 code implementation • 5 May 2022 • Yixuan Su, Tian Lan, Yahui Liu, Fangyu Liu, Dani Yogatama, Yan Wang, Lingpeng Kong, Nigel Collier

MAGIC is a flexible framework and is theoretically compatible with any text generation tasks that incorporate image grounding.

Image Captioning Image-text matching +3

251

Paper
Code

HRank: Filter Pruning using High-Rank Feature Map

2 code implementations • CVPR 2020 • Mingbao Lin, Rongrong Ji, Yan Wang, Yichen Zhang, Baochang Zhang, Yonghong Tian, Ling Shao

The principle behind our pruning is that low-rank feature maps contain less information, and thus pruned results can be easily reproduced.

Network Pruning Vocal Bursts Intensity Prediction

246

Paper
Code

Intriguing Findings of Frequency Selection for Image Deblurring

2 code implementations • 23 Nov 2021 • Xintian Mao, Yiming Liu, Fengze Liu, Qingli Li, Wei Shen, Yan Wang

Blur was naturally analyzed in the frequency domain, by estimating the latent sharp image and the blur kernel given a blurry image.

Ranked #2 on Deblurring on RealBlur-R

Deblurring Image Deblurring +1

215

Paper
Code

An Embodied Generalist Agent in 3D World

1 code implementation • 18 Nov 2023 • Jiangyong Huang, Silong Yong, Xiaojian Ma, Xiongkun Linghu, Puhao Li, Yan Wang, Qing Li, Song-Chun Zhu, Baoxiong Jia, Siyuan Huang

Leveraging massive knowledge and learning schemes from large language models (LLMs), recent machine learning models show notable successes in building generalist agents that exhibit the capability of general-purpose task solving in diverse domains, including natural language processing, computer vision, and robotics.

3D dense captioning Question Answering +3

207

Paper
Code

ISTR: End-to-End Instance Segmentation with Transformers

1 code implementation • 3 May 2021 • Jie Hu, Liujuan Cao, Yao Lu, Shengchuan Zhang, Yan Wang, Ke Li, Feiyue Huang, Ling Shao, Rongrong Ji

However, such an upgrade is not applicable to instance segmentation, due to its significantly higher output dimensions compared to object detection.

Ranked #21 on Instance Segmentation on COCO test-dev

Instance Segmentation object-detection +3

200

Paper
Code

End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection

1 code implementation • CVPR 2020 • Rui Qian, Divyansh Garg, Yan Wang, Yurong You, Serge Belongie, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger, Wei-Lun Chao

Reliable and accurate 3D object detection is a necessity for safe autonomous driving.

3D Depth Estimation 3D Object Detection +3

186

Paper
Code

EasyTPP: Towards Open Benchmarking Temporal Point Processes

1 code implementation • 16 Jul 2023 • Siqiao Xue, Xiaoming Shi, Zhixuan Chu, Yan Wang, Hongyan Hao, Fan Zhou, Caigao Jiang, Chen Pan, James Y. Zhang, Qingsong Wen, Jun Zhou, Hongyuan Mei

In this paper, we present EasyTPP, the first central repository of research assets (e. g., data, models, evaluation programs, documentations) in the area of event sequence modeling.

Benchmarking Point Processes

180

Paper
Code

Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate

1 code implementation • 30 May 2023 • Tian Liang, Zhiwei He, Wenxiang Jiao, Xing Wang, Yan Wang, Rui Wang, Yujiu Yang, Zhaopeng Tu, Shuming Shi

To address the DoT problem, we propose a Multi-Agent Debate (MAD) framework, in which multiple agents express their arguments in the state of "tit for tat" and a judge manages the debate process to obtain a final solution.

Arithmetic Reasoning Machine Translation

178

Paper
Code

Copy Is All You Need

1 code implementation • 13 Jul 2023 • Tian Lan, Deng Cai, Yan Wang, Heyan Huang, Xian-Ling Mao

The dominant text generation models compose the output by sequentially selecting words from a fixed vocabulary.

Domain Adaptation Language Modelling +1

177

Paper
Code

HiFuse: Hierarchical Multi-Scale Feature Fusion Network for Medical Image Classification

1 code implementation • 21 Sep 2022 • Xiangzuo Huo, Gang Sun, Shengwei Tian, Yan Wang, Long Yu, Jun Long, Wendong Zhang, Aolun Li

A parallel hierarchy of local and global feature blocks is designed to efficiently extract local features and global representations at various semantic scales, with the flexibility to model at different scales and linear computational complexity relevant to image size.

Image Classification Inductive Bias +1

158

Paper
Code

MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers

1 code implementation • 2 Sep 2021 • Yihuai Lan, Lei Wang, Qiyuan Zhang, Yunshi Lan, Bing Tian Dai, Yan Wang, Dongxiang Zhang, Ee-Peng Lim

Over the last few years, there are a growing number of datasets and deep learning-based methods proposed for effectively solving MWPs.

Ranked #8 on Math Word Problem Solving on Math23K

Math Math Word Problem Solving

156

Paper
Code

RobustART: Benchmarking Robustness on Architecture Design and Training Techniques

1 code implementation • 11 Sep 2021 • Shiyu Tang, Ruihao Gong, Yan Wang, Aishan Liu, Jiakai Wang, Xinyun Chen, Fengwei Yu, Xianglong Liu, Dawn Song, Alan Yuille, Philip H. S. Torr, DaCheng Tao

Thus, we propose RobustART, the first comprehensive Robustness investigation benchmark on ImageNet regarding ARchitecture design (49 human-designed off-the-shelf architectures and 1200+ networks from neural architecture search) and Training techniques (10+ techniques, e. g., data augmentation) towards diverse noises (adversarial, natural, and system noises).

Adversarial Robustness Benchmarking +2

143

Paper
Code

Robust Face Detection via Learning Small Faces on Hard Images

1 code implementation • 28 Nov 2018 • Zhishuai Zhang, Wei Shen, Siyuan Qiao, Yan Wang, Bo wang, Alan Yuille

In this paper, we propose that the robustness of a face detector against hard faces can be improved by learning small faces on hard images.

Ranked #8 on Face Detection on WIDER Face (Hard)

Face Detection

139

Paper
Code

Temporal Convolutional Attention-based Network For Sequence Modeling

1 code implementation • 28 Feb 2020 • Hongyan Hao, Yan Wang, Siqiao Xue, Yudi Xia, Jian Zhao, Furao Shen

So we propose an exploratory architecture referred to Temporal Convolutional Attention-based Network (TCAN) which combines temporal convolutional network and attention mechanism.

136

Paper
Code

BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data

1 code implementation • ACL 2021 • Haoyu Song, Yan Wang, Kaiyan Zhang, Wei-Nan Zhang, Ting Liu

Maintaining consistent personas is essential for dialogue agents.

Decoder Dialogue Generation +1

134

Paper
Code

CAMixerSR: Only Details Need More "Attention"

1 code implementation • 29 Feb 2024 • Yan Wang, Yi Liu, Shijie Zhao, Junlin Li, Li Zhang

To satisfy the rapidly increasing demands on the large image (2K-8K) super-resolution (SR), prevailing methods follow two independent tracks: 1) accelerate existing networks by content-aware routing, and 2) design better super-resolution networks via token mixer refining.

2k 8k +1

127

Paper
Code

Train in Germany, Test in The USA: Making 3D Object Detectors Generalize

1 code implementation • CVPR 2020 • Yan Wang, Xiangyu Chen, Yurong You, Li Erran, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger, Wei-Lun Chao

In the domain of autonomous driving, deep learning has substantially improved the 3D object detection accuracy for LiDAR and stereo camera data alike.

3D Object Detection Autonomous Driving +2

125

Paper
Code

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

2 code implementations • 11 May 2022 • Yawei Li, Kai Zhang, Radu Timofte, Luc van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu1, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoğlu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gaoand Dengwen Zhouand Qian Ning, Jingzhu Tang, Han Huang, YuFei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Dengwen Zhou, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang

The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29. 00dB on DIV2K validation set.

Image Super-Resolution

117

Paper
Code

Automatic Prosody Annotation with Pre-Trained Text-Speech Model

1 code implementation • 16 Jun 2022 • Ziqian Dai, Jianwei Yu, Yan Wang, Nuo Chen, Yanyao Bian, Guangzhi Li, Deng Cai, Dong Yu

Prosodic boundary plays an important role in text-to-speech synthesis (TTS) in terms of naturalness and readability.

Speech Synthesis Text-To-Speech Synthesis

108

Paper
Code

Recurrent Saliency Transformation Network: Incorporating Multi-Stage Visual Cues for Small Organ Segmentation

2 code implementations • CVPR 2018 • Qihang Yu, Lingxi Xie, Yan Wang, Yuyin Zhou, Elliot K. Fishman, Alan L. Yuille

The key innovation is a saliency transformation module, which repeatedly converts the segmentation probability map from the previous iteration as spatial weights and applies these weights to the current iteration.

Ranked #1 on Pancreas Segmentation on TCIA Pancreas-CT Dataset

Organ Segmentation Pancreas Segmentation +1

105

Paper
Code

A Fixed-Point Model for Pancreas Segmentation in Abdominal CT Scans

3 code implementations • 25 Dec 2016 • Yuyin Zhou, Lingxi Xie, Wei Shen, Yan Wang, Elliot K. Fishman, Alan L. Yuille

Deep neural networks have been widely adopted for automatic organ segmentation from abdominal CT scans.

Organ Segmentation Pancreas Segmentation +1

105

Paper
Code

Wasserstein Distances for Stereo Disparity Estimation

1 code implementation • NeurIPS 2020 • Divyansh Garg, Yan Wang, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger, Wei-Lun Chao

Existing approaches to depth or disparity estimation output a distribution over a set of pre-defined discrete values.

Ranked #2 on Stereo Depth Estimation on KITTI2015 (three pixel error metric)

3D Object Detection From Stereo Images Autonomous Driving +5

103

Paper
Code

Large Language Models Meet Harry Potter: A Bilingual Dataset for Aligning Dialogue Agents with Characters

1 code implementation • 13 Nov 2022 • Nuo Chen, Yan Wang, Haiyun Jiang, Deng Cai, Yuhan Li, Ziyang Chen, Longyue Wang, Jia Li

In this paper, we introduce the Harry Potter Dialogue (HPD) dataset, designed to advance the study of dialogue agents and character alignment.

Ranked #1 on Persona Dialogue in Story on Harry Potter Dialogue Dataset

Dialogue Generation In-Context Learning +2

Paper
Code

ELIC: Efficient Learned Image Compression with Unevenly Grouped Space-Channel Contextual Adaptive Coding

5 code implementations • CVPR 2022 • Dailan He, Ziming Yang, Weikun Peng, Rui Ma, Hongwei Qin, Yan Wang

Recently, learned image compression techniques have achieved remarkable performance, even surpassing the best manually designed lossy image coders.

Ranked #1 on Image Compression on kodak

Image Compression

Paper
Code

Multi-scale Attention Network for Single Image Super-Resolution

1 code implementation • 28 Sep 2022 • Yan Wang, Yusen Li, Gang Wang, Xiaoguang Liu

ConvNets can compete with transformers in high-level tasks by exploiting larger receptive fields.

Blocking Image Super-Resolution +1

Paper
Code

Meta Architecture for Point Cloud Analysis

1 code implementation • CVPR 2023 • Haojia Lin, Xiawu Zheng, Lijiang Li, Fei Chao, Shanshan Wang, Yan Wang, Yonghong Tian, Rongrong Ji

However, the lack of a unified framework to interpret those networks makes any systematic comparison, contrast, or analysis challenging, and practically limits healthy development of the field.

Ranked #2 on 3D Semantic Segmentation on OpenTrench3D

3D Semantic Segmentation

Paper
Code

Neural Machine Translation with Monolingual Translation Memory

1 code implementation • ACL 2021 • Deng Cai, Yan Wang, Huayang Li, Wai Lam, Lemao Liu

Second, the memory retriever and NMT model can be jointly optimized for the ultimate translation goal.

Domain Adaptation Machine Translation +3

Paper
Code

Rotated Binary Neural Network

2 code implementations • NeurIPS 2020 • Mingbao Lin, Rongrong Ji, Zihan Xu, Baochang Zhang, Yan Wang, Yongjian Wu, Feiyue Huang, Chia-Wen Lin

In this paper, for the first time, we explore the influence of angular bias on the quantization error and then introduce a Rotated Binary Neural Network (RBNN), which considers the angle alignment between the full-precision weight vector and its binarized version.

Binarization Quantization

Paper
Code

Graph-to-Tree Learning for Solving Math Word Problems

1 code implementation • ACL 2020 • Jipeng Zhang, Lei Wang, Roy Ka-Wei Lee, Yi Bin, Yan Wang, Jie Shao, Ee-Peng Lim

While the recent tree-based neural models have demonstrated promising results in generating solution expression for the math word problem (MWP), most of these models do not capture the relationships and order information among the quantities well.

Ranked #10 on Math Word Problem Solving on Math23K

Decoder Math +1

Paper
Code

Self-Supervised Monocular Depth Estimation: Solving the Edge-Fattening Problem

1 code implementation • 2 Oct 2022 • Xingyu Chen, Ruonan Zhang, Ji Jiang, Yan Wang, Ge Li, Thomas H. Li

In this paper, we redesign the patch-based triplet loss in MDE to alleviate the ubiquitous edge-fattening issue.

Ranked #1 on Unsupervised Monocular Depth Estimation on Kitti Raw

Depth Prediction Metric Learning +2

Paper
Code

Resource Aware Person Re-identification across Multiple Resolutions

1 code implementation • CVPR 2018 • Yan Wang, Lequn Wang, Yurong You, Xu Zou, Vincent Chen, Serena Li, Gao Huang, Bharath Hariharan, Kilian Q. Weinberger

Not all people are equally easy to identify: color statistics might be enough for some cases while others might require careful reasoning about high- and low-level details.

Ranked #12 on Person Re-Identification on CUHK03 detected

Person Re-Identification

Paper
Code

DeepSkeleton: Learning Multi-task Scale-associated Deep Side Outputs for Object Skeleton Extraction in Natural Images

1 code implementation • 13 Sep 2016 • Wei Shen, Kai Zhao, Yuan Jiang, Yan Wang, Xiang Bai, Alan Yuille

By observing the relationship between the receptive field sizes of the different layers in the network and the skeleton scales they can capture, we introduce two scale-associated side outputs to each stage of the network.

Multi-Task Learning Object +3

Paper
Code

Augmenting Lane Perception and Topology Understanding with Standard Definition Navigation Maps

1 code implementation • 7 Nov 2023 • Katie Z Luo, Xinshuo Weng, Yan Wang, Shuang Wu, Jie Li, Kilian Q Weinberger, Yue Wang, Marco Pavone

We propose a novel framework to integrate SD maps into online map prediction and propose a Transformer-based encoder, SD Map Encoder Representations from transFormers, to leverage priors in SD maps for the lane-topology prediction task.

Autonomous Driving Lane Detection

Paper
Code

Non-Autoregressive Text Generation with Pre-trained Language Models

1 code implementation • EACL 2021 • Yixuan Su, Deng Cai, Yan Wang, David Vandyke, Simon Baker, Piji Li, Nigel Collier

In this work, we show that BERT can be employed as the backbone of a NAG model to greatly improve performance.

Machine Translation Sentence +3

Paper
Code

Edge-enhanced Feature Distillation Network for Efficient Super-Resolution

1 code implementation • 19 Apr 2022 • Yan Wang

With the recently massive development in convolution neural networks, numerous lightweight CNN-based image super-resolution methods have been proposed for practical deployments on edge devices.

Image Super-Resolution

Paper
Code

SpecTr: Spectral Transformer for Hyperspectral Pathology Image Segmentation

1 code implementation • 5 Mar 2021 • Boxiang Yun, Yan Wang, Jieneng Chen, Huiyu Wang, Wei Shen, Qingli Li

Hyperspectral imaging (HSI) unlocks the huge potential to a wide variety of applications relied on high-precision pathology image segmentation, such as computational pathology and precision medicine.

Image Segmentation Segmentation +1

Paper
Code

LDLS: 3-D Object Segmentation Through Label Diffusion From 2-D Images

1 code implementation • 30 Oct 2019 • Brian H. Wang, Wei-Lun Chao, Yan Wang, Bharath Hariharan, Kilian Q. Weinberger, Mark Campbell

We obtain 2-D segmentation predictions by applying Mask-RCNN to the RGB image, and then link this image to a 3-D lidar point cloud by building a graph of connections among 3-D points and 2-D pixels.

Image Segmentation Point Cloud Segmentation +2

Paper
Code

Learning Efficient GANs for Image Translation via Differentiable Masks and co-Attention Distillation

1 code implementation • 17 Nov 2020 • Shaojie Li, Mingbao Lin, Yan Wang, Fei Chao, Ling Shao, Rongrong Ji

The latter simultaneously distills informative attention maps from both the generator and discriminator of a pre-trained model to the searched generator, effectively stabilizing the adversarial training of our light-weight model.

Translation

Paper
Code

VIMI: Vehicle-Infrastructure Multi-view Intermediate Fusion for Camera-based 3D Object Detection

2 code implementations • 20 Mar 2023 • Zhe Wang, Siqi Fan, Xiaoliang Huo, Tongda Xu, Yan Wang, Jingjing Liu, Yilun Chen, Ya-Qin Zhang

In autonomous driving, Vehicle-Infrastructure Cooperative 3D Object Detection (VIC3D) makes use of multi-view cameras from both vehicles and traffic infrastructure, providing a global vantage point with rich semantic context of road conditions beyond a single vehicle viewpoint.

3D Object Detection Autonomous Driving +2

Paper
Code

EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection

2 code implementations • 23 Feb 2024 • Zhe Wang, Siqi Fan, Xiaoliang Huo, Tongda Xu, Yan Wang, Jingjing Liu, Yilun Chen, Ya-Qin Zhang

In autonomous driving, cooperative perception makes use of multi-view cameras from both vehicles and infrastructure, providing a global vantage point with rich semantic context of road conditions beyond a single vehicle viewpoint.

3D Object Detection Autonomous Driving +2

Paper
Code

Checkerboard Context Model for Efficient Learned Image Compression

3 code implementations • CVPR 2021 • Dailan He, Yaoyan Zheng, Baocheng Sun, Yan Wang, Hongwei Qin

To the best of our knowledge, this is the first exploration on parallelization-friendly spatial context model for learned image compression.

Computational Efficiency Image Compression

Paper
Code

Inter-slice Context Residual Learning for 3D Medical Image Segmentation

1 code implementation • 28 Nov 2020 • Jianpeng Zhang, Yutong Xie, Yan Wang, Yong Xia

In this paper, we propose the 3D context residual network (ConResNet) for the accurate segmentation of 3D medical images.

Brain Tumor Segmentation Decoder +4

Paper
Code

DDPNAS: Efficient Neural Architecture Search via Dynamic Distribution Pruning

1 code implementation • 28 May 2019 • Xiawu Zheng, Chenyi Yang, Shaokun Zhang, Yan Wang, Baochang Zhang, Yongjian Wu, Yunsheng Wu, Ling Shao, Rongrong Ji

With the proposed efficient network generation method, we directly obtain the optimal neural architectures on given constraints, which is practical for on-device models across diverse search spaces and constraints.

Neural Architecture Search

Paper
Code

Profile Consistency Identification for Open-domain Dialogue Agents

1 code implementation • EMNLP 2020 • Haoyu Song, Yan Wang, Wei-Nan Zhang, Zhengyu Zhao, Ting Liu, Xiaojiang Liu

Maintaining a consistent attribute profile is crucial for dialogue agents to naturally converse with humans.

Attribute

Paper
Code

You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Nature Gradient

1 code implementation • 4 Jun 2021 • Shaokun Zhang, Xiawu Zheng, Chenyi Yang, Yuchao Li, Yan Wang, Fei Chao, Mengdi Wang, Shen Li, Jun Yang, Rongrong Ji

Motivated by the necessity of efficient inference across various constraints on BERT, we propose a novel approach, YOCO-BERT, to achieve compress once and deploy everywhere.

AutoML Model Compression

Paper
Code

Deep Regression Forests for Age Estimation

2 code implementations • CVPR 2018 • Wei Shen, Yilu Guo, Yan Wang, Kai Zhao, Bo wang, Alan Yuille

Age estimation from facial images is typically cast as a nonlinear regression problem.

Ranked #6 on Age Estimation on FGNET

Age Estimation regression

Paper
Code

Translating a Math Word Problem to an Expression Tree

1 code implementation • 14 Nov 2018 • Lei Wang, Yan Wang, Deng Cai, Dongxiang Zhang, Xiaojiang Liu

Moreover, we analyze the performance of three popular SEQ2SEQ models on the math word problem solving.

Math Math Word Problem Solving

Paper
Code

MagicNet: Semi-Supervised Multi-Organ Segmentation via Magic-Cube Partition and Recovery

1 code implementation • CVPR 2023 • Duowen Chen, Yunhao Bai, Wei Shen, Qingli Li, Lequan Yu, Yan Wang

Our strategy encourages unlabeled images to learn organ semantics in relative locations from the labeled images (cross-branch) and enhances the learning ability for small organs (within-branch).

Anatomy Data Augmentation +4

Paper
Code

SSDA3D: Semi-supervised Domain Adaptation for 3D Object Detection from Point Cloud

1 code implementation • 6 Dec 2022 • Yan Wang, Junbo Yin, Wei Li, Pascal Frossard, Ruigang Yang, Jianbing Shen

However, these UDA solutions just yield unsatisfactory 3D detection results when there is a severe domain shift, e. g., from Waymo (64-beam) to nuScenes (32-beam).

3D Object Detection Autonomous Driving +5

Paper
Code

Fixed Neural Network Steganography: Train the images, not the network

1 code implementation • ICLR 2022 • Varsha Kishore, Xiangyu Chen, Yan Wang, Boyi Li, Kilian Q Weinberger

Recent attempts at image steganography make use of advances in deep learning to train an encoder-decoder network pair to hide and retrieve secret messages in images.

Decoder Image Steganography +1

Paper
Code

Exploring Dense Retrieval for Dialogue Response Selection

1 code implementation • 13 Oct 2021 • Tian Lan, Deng Cai, Yan Wang, Yixuan Su, Heyan Huang, Xian-Ling Mao

In this study, we present a solution to directly select proper responses from a large corpus or even a nonparallel corpus that only consists of unpaired sentences, using a dense retrieval model.

Conversational Response Selection Retrieval

Paper
Code

Kairos: Practical Intrusion Detection and Investigation using Whole-system Provenance

1 code implementation • 9 Aug 2023 • Zijun Cheng, Qiujian Lv, Jinyuan Liang, Yan Wang, Degang Sun, Thomas Pasquier, Xueyuan Han

Sifting through their design documents, we identify four common dimensions that drive the development of provenance-based intrusion detection systems (PIDSes): scope (can PIDSes detect modern attacks that infiltrate across application boundaries?

Decoder Intrusion Detection

Paper
Code

ML-Bench: Evaluating Large Language Models for Code Generation in Repository-Level Machine Learning Tasks

1 code implementation • 16 Nov 2023 • Yuliang Liu, Xiangru Tang, Zefan Cai, Junjie Lu, Yichi Zhang, Yanjun Shao, Zexuan Deng, Helan Hu, Kaikai An, Ruijun Huang, Shuzheng Si, Sheng Chen, Haozhe Zhao, Liang Chen, Yan Wang, Tianyu Liu, Zhiwei Jiang, Baobao Chang, Yujia Qin, Wangchunshu Zhou, Yilun Zhao, Arman Cohan, Mark Gerstein

While Large Language Models (LLMs) have demonstrated proficiency in code generation benchmarks, translating these results into practical development scenarios - where leveraging existing repository-level libraries is the norm - remains challenging.

Code Generation Navigate

Paper
Code

Generalized Video Anomaly Event Detection: Systematic Taxonomy and Comparison of Deep Models

1 code implementation • 10 Feb 2023 • Yang Liu, Dingkang Yang, Yan Wang, Jing Liu, Jun Liu, Azzedine Boukerche, Peng Sun, Liang Song

Video Anomaly Detection (VAD) serves as a pivotal technology in the intelligent surveillance systems, enabling the temporal or spatial identification of anomalous events within videos.

Anomaly Detection Event Detection +1

Paper
Code

Unleashing the Potential of SAM for Medical Adaptation via Hierarchical Decoding

1 code implementation • 27 Mar 2024 • Zhiheng Cheng, Qingyue Wei, Hongru Zhu, Yan Wang, Liangqiong Qu, Wei Shao, Yuyin Zhou

This paper introduces H-SAM: a prompt-free adaptation of SAM tailored for efficient fine-tuning of medical images via a two-stage hierarchical decoding procedure.

Decoder Image Segmentation +4

Paper
Code

A Dynamic Model Identification Package for the da Vinci Research Kit

1 code implementation • 28 Feb 2019 • Yan Wang, Radian Gondokaryono, Adnan Munawar, Gregory S Fischer

We developed a dynamic model identification package for the dVRK, capable of modeling the parallelograms, springs, counterweight, and tendon couplings, which are inherent to the dVRK.

Robotics

Paper
Code

ContrastMask: Contrastive Learning to Segment Every Thing

1 code implementation • CVPR 2022 • Xuehui Wang, Kai Zhao, Ruixin Zhang, Shouhong Ding, Yan Wang, Wei Shen

In this framework, annotated masks of seen categories and pseudo masks of unseen categories serve as a prior for contrastive learning, where features from the mask regions (foreground) are pulled together, and are contrasted against those from the background, and vice versa.

Instance Segmentation Segmentation +1

Paper
Code

Distilling a Powerful Student Model via Online Knowledge Distillation

1 code implementation • 26 Mar 2021 • Shaojie Li, Mingbao Lin, Yan Wang, Yongjian Wu, Yonghong Tian, Ling Shao, Rongrong Ji

Besides, a self-distillation module is adopted to convert the feature map of deeper layers into a shallower one.

Knowledge Distillation

Paper
Code

Contrastive Diffusion Model with Auxiliary Guidance for Coarse-to-Fine PET Reconstruction

1 code implementation • 20 Aug 2023 • Zeyu Han, YuHan Wang, Luping Zhou, Peng Wang, Binyu Yan, Jiliu Zhou, Yan Wang, Dinggang Shen

To obtain high-quality positron emission tomography (PET) scans while reducing radiation exposure to the human body, various approaches have been proposed to reconstruct standard-dose PET (SPET) images from low-dose PET (LPET) images.

Paper
Code

Weakly-Supervised Salient Object Detection Using Point Supervision

1 code implementation • 22 Mar 2022 • Shuyong Gao, Wei zhang, Yan Wang, Qianyu Guo, Chenglong Zhang, Yangji He, Wenqiang Zhang

Then we develop a transformer-based point-supervised saliency detection model to produce the first round of saliency maps.

Object object-detection +3

Paper
Code

AIDE: A Vision-Driven Multi-View, Multi-Modal, Multi-Tasking Dataset for Assistive Driving Perception

1 code implementation • ICCV 2023 • Dingkang Yang, Shuai Huang, Zhi Xu, Zhenpeng Li, Shunli Wang, Mingcheng Li, Yuzheng Wang, Yang Liu, Kun Yang, Zhaoyu Chen, Yan Wang, Jing Liu, Peixuan Zhang, Peng Zhai, Lihua Zhang

Driver distraction has become a significant cause of severe traffic accidents over the past decade.

Paper
Code

Robust active flow control over a range of Reynolds numbers using an artificial neural network trained through deep reinforcement learning

1 code implementation • 26 Apr 2020 • Hongwei Tang, Jean Rabault, Alexander Kuhnle, Yan Wang, Tongguang Wang

This paper focuses on the active flow control of a computational fluid dynamics simulation over a range of Reynolds numbers using deep reinforcement learning (DRL).

Fluid Dynamics

Paper
Code

Calibration-free BEV Representation for Infrastructure Perception

1 code implementation • 7 Mar 2023 • Siqi Fan, Zhe Wang, Xiaoliang Huo, Yan Wang, Jingjing Liu

Effective BEV object detection on infrastructure can greatly improve traffic scenes understanding and vehicle-toinfrastructure (V2I) cooperative perception.

Ranked #5 on 3D Object Detection on DAIR-V2X-I

3D Object Detection object-detection

Paper
Code

OMPQ: Orthogonal Mixed Precision Quantization

1 code implementation • 16 Sep 2021 • Yuexiao Ma, Taisong Jin, Xiawu Zheng, Yan Wang, Huixia Li, Yongjian Wu, Guannan Jiang, Wei zhang, Rongrong Ji

Instead of solving a problem of the original integer programming, we propose to optimize a proxy metric, the concept of network orthogonality, which is highly correlated with the loss of the integer programming but also easy to optimize with linear programming.

AutoML Quantization

Paper
Code

Skeleton-to-Response: Dialogue Generation Guided by Retrieval Memory

1 code implementation • NAACL 2019 • Deng Cai, Yan Wang, Victoria Bi, Zhaopeng Tu, Xiaojiang Liu, Wai Lam, Shuming Shi

Such models rely on insufficient information for generating a specific response since a certain query could be answered in multiple ways.

Dialogue Generation Information Retrieval +3

Paper
Code

Dialogue Response Selection with Hierarchical Curriculum Learning

1 code implementation • ACL 2021 • Yixuan Su, Deng Cai, Qingyu Zhou, Zibo Lin, Simon Baker, Yunbo Cao, Shuming Shi, Nigel Collier, Yan Wang

As for IC, it progressively strengthens the model's ability in identifying the mismatching information between the dialogue context and a response candidate.

Ranked #3 on Conversational Response Selection on RRS

Conversational Response Selection

Paper
Code

Network Pruning using Adaptive Exemplar Filters

1 code implementation • 20 Jan 2021 • Mingbao Lin, Rongrong Ji, Shaojie Li, Yan Wang, Yongjian Wu, Feiyue Huang, Qixiang Ye

Inspired by the face recognition community, we use a message passing algorithm Affinity Propagation on the weight matrices to obtain an adaptive number of exemplars, which then act as the preserved filters.

Face Recognition Network Pruning

Paper
Code

Uncertainty Quantification in Machine Learning for Engineering Design and Health Prognostics: A Tutorial

1 code implementation • 7 May 2023 • Venkat Nemani, Luca Biggio, Xun Huan, Zhen Hu, Olga Fink, Anh Tran, Yan Wang, Xiaoge Zhang, Chao Hu

In this tutorial, we aim to provide a holistic lens on emerging UQ methods for ML models with a particular focus on neural networks and the applications of these UQ methods in tackling engineering design as well as prognostics and health management problems.

Decision Making Management +2

Paper
Code

Idempotence and Perceptual Image Compression

1 code implementation • 17 Jan 2024 • Tongda Xu, Ziran Zhu, Dailan He, Yanghao Li, Lina Guo, Yuanyuan Wang, Zhe Wang, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang

However, we find that theoretically: 1) Conditional generative model-based perceptual codec satisfies idempotence; 2) Unconditional generative model with idempotence constraint is equivalent to conditional generative codec.

Image Compression

Paper
Code

A Correlation Information-based Spatiotemporal Network for Traffic Flow Forecasting

2 code implementations • 20 May 2022 • Weiguo Zhu, Yongqi Sun, Xintong Yi, Yan Wang

In this paper, based on the maximal information coefficient, we present two elaborate spatiotemporal representations, spatial correlation information (SCorr) and temporal correlation information (TCorr).

Ranked #1 on Traffic Prediction on HZME(outflow)

Traffic Prediction

Paper
Code

GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting

1 code implementation • 13 Mar 2024 • Xinjie Zhang, Xingtong Ge, Tongda Xu, Dailan He, Yan Wang, Hongwei Qin, Guo Lu, Jing Geng, Jun Zhang

In response, we propose a groundbreaking paradigm of image representation and compression by 2D Gaussian Splatting, named GaussianImage.

Quantization

Paper
Code

Spectral Network Embedding: A Fast and Scalable Method via Sparsity

1 code implementation • 7 Jun 2018 • Jie Zhang, Yan Wang, Jie Tang, Ming Ding

In this paper, we propose a $10\times \sim 100\times$ faster network embedding method, called Progle, by elegantly utilizing the sparsity property of online networks and spectral analysis.

Link Prediction Network Embedding +1

Paper
Code

PO-ELIC: Perception-Oriented Efficient Learned Image Coding

1 code implementation • 28 May 2022 • Dailan He, Ziming Yang, Hongjiu Yu, Tongda Xu, Jixiang Luo, Yuan Chen, Chenjian Gao, Xinjie Shi, Hongwei Qin, Yan Wang

In the past years, learned image compression (LIC) has achieved remarkable performance.

Image Compression MS-SSIM +1

Paper
Code

A Unified Framework for 3D Point Cloud Visual Grounding

1 code implementation • 23 Aug 2023 • Haojia Lin, Yongdong Luo, Xiawu Zheng, Lijiang Li, Fei Chao, Taisong Jin, Donghao Luo, Yan Wang, Liujuan Cao, Rongrong Ji

This elaborate design enables 3DRefTR to achieve both well-performing 3DRES and 3DREC capacities with only a 6% additional latency compared to the original 3DREC model.

Referring Expression Referring Expression Comprehension +1

Paper
Code

Towards Lightweight Transformer via Group-wise Transformation for Vision-and-Language Tasks

1 code implementation • 16 Apr 2022 • Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Yan Wang, Liujuan Cao, Yongjian Wu, Feiyue Huang, Rongrong Ji

Despite the exciting performance, Transformer is criticized for its excessive parameters and computation cost.

Image Classification

Paper
Code

Unsupervised Domain Adaptation through Shape Modeling for Medical Image Segmentation

1 code implementation • 6 Jul 2022 • Yuan YAO, Fengze Liu, Zongwei Zhou, Yan Wang, Wei Shen, Alan Yuille, Yongyi Lu

Previous methods proposed Variational Autoencoder (VAE) based models to learn the distribution of shape for a particular organ and used it to automatically evaluate the quality of a segmentation prediction by fitting it into the learned shape distribution.

Image Segmentation Pancreas Segmentation +3

Paper
Code

Bit Allocation using Optimization

1 code implementation • 20 Sep 2022 • Tongda Xu, Han Gao, Chenjian Gao, Yuanyuan Wang, Dailan He, Jinyong Pi, Jixiang Luo, Ziyu Zhu, Mao Ye, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang

In this paper, we consider the problem of bit allocation in Neural Video Compression (NVC).

Variational Inference Video Compression

Paper
Code

Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso under Information Bottleneck Principle

1 code implementation • ICCV 2023 • Song Guo, Lei Zhang, Xiawu Zheng, Yan Wang, Yuchao Li, Fei Chao, Chenglin Wu, Shengchuan Zhang, Rongrong Ji

In this paper, we try to solve this problem by introducing a principled and unified framework based on Information Bottleneck (IB) theory, which further guides us to an automatic pruning approach.

Network Pruning

Paper
Code

DDistill-SR: Reparameterized Dynamic Distillation Network for Lightweight Image Super-Resolution

1 code implementation • 22 Dec 2023 • Yan Wang, Tongtong Su, Yusen Li, Jiuwen Cao, Gang Wang, Xiaoguang Liu

Specifically, we propose a plug-in reparameterized dynamic unit (RDU) to promote the performance and inference cost trade-off.

Image Super-Resolution

Paper
Code

Boosting Neural Representations for Videos with a Conditional Decoder

1 code implementation • 28 Feb 2024 • Xinjie Zhang, Ren Yang, Dailan He, Xingtong Ge, Tongda Xu, Yan Wang, Hongwei Qin, Jun Zhang

Implicit neural representations (INRs) have emerged as a promising approach for video storage and processing, showing remarkable versatility across various video tasks.

Decoder

Paper
Code

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

1 code implementation • 16 Apr 2024 • Bin Ren, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang, Wei Zhai, Renjing Pei, Jiaming Guo, Songcen Xu, Yang Cao, ZhengJun Zha, Yan Wang, Yi Liu, Qing Wang, Gang Zhang, Liou Zhang, Shijie Zhao, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Xin Liu, Min Yan, Menghan Zhou, Yiqiang Yan, Yixuan Liu, Wensong Chan, Dehua Tang, Dong Zhou, Li Wang, Lu Tian, Barsoum Emad, Bohan Jia, Junbo Qiao, Yunshuai Zhou, Yun Zhang, Wei Li, Shaohui Lin, Shenglong Zhou, Binbin Chen, Jincheng Liao, Suiyi Zhao, Zhao Zhang, Bo wang, Yan Luo, Yanyan Wei, Feng Li, Mingshen Wang, Yawei Li, Jinhan Guan, Dehua Hu, Jiawei Yu, Qisheng Xu, Tao Sun, Long Lan, Kele Xu, Xin Lin, Jingtong Yue, Lehan Yang, Shiyi Du, Lu Qi, Chao Ren, Zeyu Han, YuHan Wang, Chaolin Chen, Haobo Li, Mingjun Zheng, Zhongbao Yang, Lianhong Song, Xingzhuo Yan, Minghan Fu, Jingyi Zhang, Baiang Li, Qi Zhu, Xiaogang Xu, Dan Guo, Chunle Guo, Jiadi Chen, Huanhuan Long, Chunjiang Duanmu, Xiaoyan Lei, Jie Liu, Weilin Jia, Weifeng Cao, Wenlong Zhang, Yanyu Mao, Ruilong Guo, Nihao Zhang, Qian Wang, Manoj Pandey, Maksym Chernozhukov, Giang Le, Shuli Cheng, Hongyuan Wang, Ziyan Wei, Qingting Tang, Liejun Wang, Yongming Li, Yanhui Guo, Hao Xu, Akram Khatami-Rizi, Ahmad Mahmoudi-Aznaveh, Chih-Chung Hsu, Chia-Ming Lee, Yi-Shiuan Chou, Amogh Joshi, Nikhil Akalwadi, Sampada Malagi, Palani Yashaswini, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi

In sub-track 1, the practical runtime performance of the submissions was evaluated, and the corresponding score was used to determine the ranking.

Image Super-Resolution

Paper
Code

Modeling Intra-Relation in Math Word Problems with Different Functional Multi-Head Attentions

1 code implementation • ACL 2019 • Jierui Li, Lei Wang, Jipeng Zhang, Yan Wang, Bing Tian Dai, Dongxiang Zhang

Several deep learning models have been proposed for solving math word problems (MWPs) automatically.

Ranked #13 on Math Word Problem Solving on Math23K

Math Math Word Problem Solving +1

Paper
Code

Cross-dataset Training for Class Increasing Object Detection

1 code implementation • 14 Jan 2020 • Yongqiang Yao, Yan Wang, Yu Guo, Jiaojiao Lin, Hongwei Qin, Junjie Yan

Given two or more already labeled datasets that target for different object classes, cross-dataset training aims to detect the union of the different classes, so that we do not have to label all the classes for all the datasets.

Object object-detection +1

Paper
Code

Deep Co-Training with Task Decomposition for Semi-Supervised Domain Adaptation

1 code implementation • ICCV 2021 • Luyu Yang, Yan Wang, Mingfei Gao, Abhinav Shrivastava, Kilian Q. Weinberger, Wei-Lun Chao, Ser-Nam Lim

To integrate the strengths of the two classifiers, we apply the well-established co-training framework, in which the two classifiers exchange their high confident predictions to iteratively "teach each other" so that both classifiers can excel in the target domain.

Semi-supervised Domain Adaptation Unsupervised Domain Adaptation

Paper
Code

Transductive Learning for Unsupervised Text Style Transfer

1 code implementation • EMNLP 2021 • Fei Xiao, Liang Pang, Yanyan Lan, Yan Wang, HuaWei Shen, Xueqi Cheng

The proposed transductive learning approach is general and effective to the task of unsupervised style transfer, and we will apply it to the other two typical methods in the future.

Decoder Retrieval +4

Paper
Code

PepLand: a large-scale pre-trained peptide representation model for a comprehensive landscape of both canonical and non-canonical amino acids

1 code implementation • 8 Nov 2023 • Ruochi Zhang, Haoran Wu, Yuting Xiu, Kewei Li, Ningning Chen, Yu Wang, Yan Wang, Xin Gao, Fengfeng Zhou

In recent years, the scientific community has become increasingly interested on peptides with non-canonical amino acids due to their superior stability and resistance to proteolytic degradation.

Paper
Code

Real-Time Reinforcement Learning for Vision-Based Robotics Utilizing Local and Remote Computers

2 code implementations • 5 Oct 2022 • Yan Wang, Gautham Vasan, A. Rupam Mahmood

A common setup for a robotic agent is to have two different computers simultaneously: a resource-limited local computer tethered to the robot and a powerful remote computer connected wirelessly.

Reinforcement Learning (RL)

Paper
Code

Large Language Models for Intent-Driven Session Recommendations

1 code implementation • 7 Dec 2023 • Zhu Sun, Hongyang Liu, Xinghua Qu, Kaidong Feng, Yan Wang, Yew-Soon Ong

Intent-aware session recommendation (ISR) is pivotal in discerning user intents within sessions for precise predictions.

Paper
Code

RoT: Enhancing Large Language Models with Reflection on Search Trees

1 code implementation • 8 Apr 2024 • Wenyang Hui, Chengyue Jiang, Yan Wang, Kewei Tu

It uses a strong LLM to summarize guidelines from previous tree search experiences to enhance the ability of a weak LLM.

Paper
Code

RIBAC: Towards Robust and Imperceptible Backdoor Attack against Compact DNN

1 code implementation • 22 Aug 2022 • Huy Phan, Cong Shi, Yi Xie, Tianfang Zhang, Zhuohang Li, Tianming Zhao, Jian Liu, Yan Wang, Yingying Chen, Bo Yuan

Recently backdoor attack has become an emerging threat to the security of deep neural network (DNN) models.

Backdoor Attack

Paper
Code

SpiderMesh: Spatial-aware Demand-guided Recursive Meshing for RGB-T Semantic Segmentation

1 code implementation • 15 Mar 2023 • Siqi Fan, Zhe Wang, Yan Wang, Jingjing Liu

For semantic segmentation in urban scene understanding, RGB cameras alone often fail to capture a clear holistic topology in challenging lighting conditions.

Ranked #8 on Thermal Image Segmentation on PST900

Data Augmentation Segmentation +2

Paper
Code

Image2Points:A 3D Point-based Context Clusters GAN for High-Quality PET Image Reconstruction

1 code implementation • 1 Feb 2024 • Jiaqi Cui, Yan Wang, Lu Wen, Pinxian Zeng, Xi Wu, Jiliu Zhou, Dinggang Shen

To obtain high-quality Positron emission tomography (PET) images while minimizing radiation exposure, numerous methods have been proposed to reconstruct standard-dose PET (SPET) images from the corresponding low-dose PET (LPET) images.

Image Reconstruction

Paper
Code

Graph Learning based Recommender Systems: A Review

1 code implementation • 13 May 2021 • Shoujin Wang, Liang Hu, Yan Wang, Xiangnan He, Quan Z. Sheng, Mehmet A. Orgun, Longbing Cao, Francesco Ricci, Philip S. Yu

Recent years have witnessed the fast development of the emerging topic of Graph Learning based Recommender Systems (GLRS).

Collaborative Filtering Graph Learning +1

Paper
Code

The Medical Segmentation Decathlon

1 code implementation • 10 Jun 2021 • Michela Antonelli, Annika Reinke, Spyridon Bakas, Keyvan Farahani, AnnetteKopp-Schneider, Bennett A. Landman, Geert Litjens, Bjoern Menze, Olaf Ronneberger, Ronald M. Summers, Bram van Ginneken, Michel Bilello, Patrick Bilic, Patrick F. Christ, Richard K. G. Do, Marc J. Gollub, Stephan H. Heckers, William R. Jarnagin, Maureen K. McHugo, Sandy Napel, Jennifer S. Goli Pernicka, Kawal Rhode, Catalina Tobon-Gomez, Eugene Vorontsov, Henkjan Huisman, James A. Meakin, Sebastien Ourselin, Manuel Wiesenfarth, Pablo Arbelaez, Byeonguk Bae, Sihong Chen, Laura Daza, Jianjiang Feng, Baochun He, Fabian Isensee, Yuanfeng Ji, Fucang Jia, Namkug Kim, Ildoo Kim, Dorit Merhof, Akshay Pai, Beomhee Park, Mathias Perslev, Ramin Rezaiifar, Oliver Rippel, Ignacio Sarasua, Wei Shen, Jaemin Son, Christian Wachinger, Liansheng Wang, Yan Wang, Yingda Xia, Daguang Xu, Zhanwei Xu, Yefeng Zheng, Amber L. Simpson, Lena Maier-Hein, M. Jorge Cardoso

Segmentation is so far the most widely investigated medical image processing task, but the various segmentation challenges have typically been organized in isolation, such that algorithm development was driven by the need to tackle a single specific clinical problem.

Image Segmentation Segmentation +1

Paper
Code

A Counterfactual Collaborative Session-based Recommender System

1 code implementation • 31 Jan 2023 • Wenzhuo Song, Shoujin Wang, Yan Wang, Kunpeng Liu, Xueyan Liu, Minghao Yin

Next, COCO-SBRS adopts counterfactual inference to recommend items based on the outputs of the pre-trained recommendation model considering the causalities to alleviate the data sparsity problem.

counterfactual Counterfactual Inference +1

Paper
Code

Sketch and Customize: A Counterfactual Story Generator

1 code implementation • 2 Apr 2021 • Changying Hao, Liang Pang, Yanyan Lan, Yan Wang, Jiafeng Guo, Xueqi Cheng

In the sketch stage, a skeleton is extracted by removing words which are conflict to the counterfactual condition, from the original ending.

counterfactual Text Generation

Paper
Code

Deep Learning Analysis and Age Prediction from Shoeprints

1 code implementation • 7 Nov 2020 • Muhammad Hassan, Yan Wang, Di Wang, Daixi Li, Yanchun Liang, You Zhou, Dong Xu

We collected 100, 000 shoeprints of subjects ranging from 7 to 80 years old and used the data to develop a deep learning end-to-end model ShoeNet to analyze age-related patterns and predict age.

Gender Classification

Paper
Code

A Fast Divide-and-Conquer Sparse Cox Regression

2 code implementations • 2 Apr 2018 • Yan Wang, Nathan Palmer, Qian Di, Joel Schwartz, Isaac Kohane, Tianxi Cai

We propose a computationally and statistically efficient divide-and-conquer (DAC) algorithm to fit sparse Cox regression to massive datasets where the sample size $n_0$ is exceedingly large and the covariate dimension $p$ is not small but $n_0\gg p$.

Computation Applications

Paper
Code

Approximation of Images via Generalized Higher Order Singular Value Decomposition over Finite-dimensional Commutative Semisimple Algebra

1 code implementation • 1 Feb 2022 • Liang Liao, Sen Lin, Lun Li, Xiuwei Zhang, Song Zhao, Yan Wang, Xinqiang Wang, Qi Gao, Jingyu Wang

Higher order singular value decomposition (HOSVD) extends the SVD and can approximate higher order data using sums of a few rank-one components.

Paper
Code

Experience-Based Evolutionary Algorithms for Expensive Optimization

1 code implementation • 9 Apr 2023 • Xunzhao Yu, Yan Wang, Ling Zhu, Dimitar Filev, Xin Yao

Our experimental results on expensive multi-objective and constrained optimization problems demonstrate that experiences gained from related tasks are beneficial for the saving of evaluation budgets on the target problem.

Evolutionary Algorithms Meta-Learning

Paper
Code

Enhancing Psychological Counseling with Large Language Model: A Multifaceted Decision-Support System for Non-Professionals

1 code implementation • 29 Aug 2023 • Guanghui Fu, Qing Zhao, Jianqiang Li, Dan Luo, Changwei Song, Wei Zhai, Shuo Liu, Fan Wang, Yan Wang, Lijuan Cheng, Juan Zhang, Bing Xiang Yang

In the contemporary landscape of social media, an alarming number of users express negative emotions, some of which manifest as strong suicidal intentions.

Language Modelling Large Language Model

Paper
Code

Deep Learning with Information Fusion and Model Interpretation for Health Monitoring of Fetus based on Long-term Prenatal Electronic Fetal Heart Rate Monitoring Data

1 code implementation • 27 Jan 2024 • Zenghui Lin, Xintong Liu, Nan Wang, Ruichen Li, Qingao Liu, Jingying Ma, LiWei Wang, Yan Wang, Shenda Hong

This kind of continuous monitoring, in contrast to the short-term one, collects an extended period of fetal heart data.

Specificity

Paper
Code

A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis

1 code implementation • 20 Feb 2024 • Nailei Hei, Qianyu Guo, ZiHao Wang, Yan Wang, Haofen Wang, Wenqiang Zhang

To bridge the distribution gap between user input behavior and model training datasets, we first construct a novel Coarse-Fine Granularity Prompts dataset (CFP) and propose a novel User-Friendly Fine-Grained Text Generation framework (UF-FGTG) for automated prompt optimization.

Image Generation Prompt Engineering +1

Paper
Code

A Powerful Generative Model Using Random Weights for the Deep Image Representation

1 code implementation • NeurIPS 2016 • Kun He, Yan Wang, John Hopcroft

To our knowledge this is the first demonstration of image representations using untrained deep neural networks.

Paper
Code

Adaptive Hypergraph Network for Trust Prediction

1 code implementation • 7 Feb 2024 • Rongwei Xu, Guanfeng Liu, Yan Wang, Xuyun Zhang, Kai Zheng, Xiaofang Zhou

In this paper, we propose an Adaptive Hypergraph Network for Trust Prediction (AHNTP), a novel approach that improves trust prediction accuracy by using higher-order correlations.

Contrastive Learning Decision Making

Paper
Code

A Lightweight Inception Boosted U-Net Neural Network for Routability Prediction

1 code implementation • 7 Feb 2024 • Hailiang Li, Yan Huo, Yan Wang, Xu Yang, Miaohui Hao, Xiao Wang

As the modern CPU, GPU, and NPU chip design complexity and transistor counts keep increasing, and with the relentless shrinking of semiconductor technology nodes to nearly 1 nanometer, the placement and routing have gradually become the two most pivotal processes in modern very-large-scale-integrated (VLSI) circuit back-end design.

Avg SSIM

Paper
Code

AccidentBlip2: Accident Detection With Multi-View MotionBlip2

1 code implementation • 18 Apr 2024 • Yihua Shao, Hongyi Cai, Xinwei Long, Weiyi Lang, Zhe Wang, Haoran Wu, Yan Wang, Jiayi Yin, Yang Yang, Yisheng Lv, Zhen Lei

The inference capabilities of neural networks using cameras limit the accuracy of accident detection in complex transportation systems.

Language Modelling Large Language Model +2

Paper
Code

Semi-Supervised Multi-Organ Segmentation via Deep Multi-Planar Co-Training

no code implementations • 7 Apr 2018 • Yuyin Zhou, Yan Wang, Peng Tang, Song Bai, Wei Shen, Elliot K. Fishman, Alan L. Yuille

In multi-organ segmentation of abdominal CT scans, most existing fully supervised deep learning algorithms require lots of voxel-wise annotations, which are usually difficult, expensive, and slow to obtain.

Image Segmentation Organ Segmentation +2

Paper
Add Code

Automatic Article Commenting: the Task and Dataset

no code implementations • ACL 2018 • Lianhui Qin, Lemao Liu, Victoria Bi, Yan Wang, Xiaojiang Liu, Zhiting Hu, Hai Zhao, Shuming Shi

Comments of online articles provide extended views and improve user engagement.

Comment Generation

Paper
Add Code

Abdominal multi-organ segmentation with organ-attention networks and statistical fusion

no code implementations • 23 Apr 2018 • Yan Wang, Yuyin Zhou, Wei Shen, Seyoun Park, Elliot K. Fishman, Alan L. Yuille

To address these challenges, we introduce a novel framework for multi-organ segmentation by using organ-attention networks with reverse connections (OAN-RCs) which are applied to 2D views, of the 3D CT volume, and output estimates which are combined by statistical fusion exploiting structural similarity.

Organ Segmentation

Paper
Add Code

Training Multi-organ Segmentation Networks with Sample Selection by Relaxed Upper Confident Bound

no code implementations • 7 Apr 2018 • Yan Wang, Yuyin Zhou, Peng Tang, Wei Shen, Elliot K. Fishman, Alan L. Yuille

Based on the fact that very hard samples might have annotation errors, we propose a new sample selection policy, named Relaxed Upper Confident Bound (RUCB).

Image Segmentation Medical Image Segmentation +3

Paper
Add Code

Multi-Scale Spatially-Asymmetric Recalibration for Image Classification

no code implementations • ECCV 2018 • Yan Wang, Lingxi Xie, Siyuan Qiao, Ya zhang, Wenjun Zhang, Alan L. Yuille

Convolution is spatially-symmetric, i. e., the visual features are independent of its position in the image, which limits its ability to utilize contextual cues for visual recognition.

Classification General Classification +2

Paper
Add Code

Web-Scale Responsive Visual Search at Bing

no code implementations • 14 Feb 2018 • Houdong Hu, Yan Wang, Linjun Yang, Pavel Komlev, Li Huang, Xi Chen, Jiapei Huang, Ye Wu, Meenaz Merchant, Arun Sacheti

In this paper, we introduce a web-scale general visual search system deployed in Microsoft Bing.

Learning-To-Rank

Paper
Add Code

SORT: Second-Order Response Transform for Visual Recognition

no code implementations • ICCV 2017 • Yan Wang, Lingxi Xie, Chenxi Liu, Ya zhang, Wenjun Zhang, Alan Yuille

In this paper, we reveal the importance and benefits of introducing second-order operations into deep neural networks.

Paper
Add Code

Multi-stage Multi-recursive-input Fully Convolutional Networks for Neuronal Boundary Detection

no code implementations • ICCV 2017 • Wei Shen, Bin Wang, Yuan Jiang, Yan Wang, Alan Yuille

This design is biologically-plausible, as it likes a human visual system to compare different possible segmentation solutions to address the ambiguous boundary issue.

Boundary Detection Segmentation

Paper
Add Code

Deep View-Sensitive Pedestrian Attribute Inference in an end-to-end Model

no code implementations • 19 Jul 2017 • M. Saquib Sarfraz, Arne Schumann, Yan Wang, Rainer Stiefelhagen

The visual cues hinting at attributes can be strongly localized and inference of person attributes such as hair, backpack, shorts, etc., are highly dependent on the acquired view of the pedestrian.

Attribute Multi-Label Image Classification +2

Paper
Add Code

Deep Collaborative Learning for Visual Recognition

no code implementations • 3 Mar 2017 • Yan Wang, Lingxi Xie, Ya zhang, Wenjun Zhang, Alan Yuille

We formulate the function of a convolutional layer as learning a large visual vocabulary, and propose an alternative way, namely Deep Collaborative Learning (DCL), to reduce the computational complexity.

General Classification Image Classification

Paper
Add Code

EmotioNet Challenge: Recognition of facial expressions of emotion in the wild

no code implementations • 3 Mar 2017 • C. Fabian Benitez-Quiroz, Ramprakash Srinivasan, Qianli Feng, Yan Wang, Aleix M. Martinez

The second track tested the algorithms' ability to recognize emotion categories in images of facial expressions.

Paper
Add Code

e-Distance Weighted Support Vector Regression

no code implementations • 21 Jul 2016 • Yan Wang, Ge Ou, Wei Pang, Lan Huang, George Macleod Coghill

We propose a novel support vector regression approach called e-Distance Weighted Support Vector Regression (e-DWSVR). e-DWSVR specifically addresses two challenging issues in support vector regression: first, the process of noisy data; second, how to deal with the situation when the distribution of boundary data is different from that of the overall data.

regression

Paper
Add Code

A Simple, Fast and Highly-Accurate Algorithm to Recover 3D Shape from 2D Landmarks on a Single Image

no code implementations • 28 Sep 2016 • Ruiqi Zhao, Yan Wang, Aleix Martinez

Three-dimensional shape reconstruction of 2D landmark points on a single image is a hallmark of human vision, but is a task that has been proven difficult for computer vision algorithms.

3D Face Alignment 3D Shape Reconstruction +2

Paper
Add Code

Model-Driven Feed-Forward Prediction for Manipulation of Deformable Objects

no code implementations • 15 Jul 2016 • Yinxiao Li, Yan Wang, Yonghao Yue, Danfei Xu, Michael Case, Shih-Fu Chang, Eitan Grinspun, Peter Allen

A fully featured 3D model of the garment is constructed in real-time and volumetric features are then used to obtain the most similar model in the database to predict the object category and pose.

Object Pose Estimation +1

Paper
Add Code

Object Skeleton Extraction in Natural Images by Fusing Scale-associated Deep Side Outputs

no code implementations • CVPR 2016 • Wei Shen, Kai Zhao, Yuan Jiang, Yan Wang, Zhijiang Zhang, Xiang Bai

Object skeleton is a useful cue for object detection, complementary to the object contour, as it provides a structural representation to describe the relationship among object parts.

Object object-detection +1

Paper
Add Code

Learning to Rank Binary Codes

no code implementations • 21 Oct 2014 • Jie Feng, Wei Liu, Yan Wang

Binary codes have been widely used in vision problems as a compact feature representation to achieve both space and time advantages.

Binarization Image Retrieval +2

Paper
Add Code

Deep Person Re-identification for Probabilistic Data Association in Multiple Pedestrian Tracking

no code implementations • 19 Oct 2018 • Brian H. Wang, Yan Wang, Kilian Q. Weinberger, Mark Campbell

We present a data association method for vision-based multiple pedestrian tracking, using deep convolutional features to distinguish between different people based on their appearances.

Person Re-Identification Translation

Paper
Add Code

A two-stage hybrid model by using artificial neural networks as feature construction algorithms

no code implementations • 6 Dec 2018 • Yan Wang, Xuelei Sherry Ni, Brian Stone

The hybrid model uses a very simple neural network structure as the new feature construction tool in the first stage, then the newly created features are used as the additional input variables in logistic regression in the second stage.

Paper
Add Code

Translating a Math Word Problem to a Expression Tree

no code implementations • EMNLP 2018 • Lei Wang, Yan Wang, Deng Cai, Dongxiang Zhang, Xiaojiang Liu

Moreover, we analyze the performance of three popular SEQ2SEQ models on the math word problem solving.

Machine Translation Math +2

Paper
Add Code

Deep Neural Solver for Math Word Problems

no code implementations • EMNLP 2017 • Yan Wang, Xiaojiang Liu, Shuming Shi

This paper presents a deep neural solver to automatically solve math word problems.

Ranked #4 on Math Word Problem Solving on ALG514

Feature Engineering Machine Translation +4

Paper
Add Code

Generative Adversarial Learning Towards Fast Weakly Supervised Detection

no code implementations • CVPR 2018 • Yunhan Shen, Rongrong Ji, Shengchuan Zhang, WangMeng Zuo, Yan Wang

Without the need of annotating bounding boxes, the existing methods usually follow a two/multi-stage pipeline with an online compulsive stage to extract object proposals, which is an order of magnitude slower than fast fully supervised object detectors such as SSD [31] and YOLO [34].

Object object-detection +1

Paper
Add Code

An Automatic Interaction Detection Hybrid Model for Bankcard Response Classification

no code implementations • 2 Jan 2019 • Yan Wang, Xuelei Sherry Ni, Brian Stone

In the first stage of the hybrid model, CHAID analysis is used to detect the possibly potential variable interactions.

Classification General Classification +1

Paper
Add Code

A XGBoost risk model via feature selection and Bayesian hyper-parameter optimization

no code implementations • 24 Jan 2019 • Yan Wang, Xuelei Sherry Ni

TPE optimization shows a superiority over RS since it results in a significantly higher accuracy and a marginally higher AUC, recall and F1 score.

Clustering Feature Importance +2

Paper
Add Code

Label Propagation from ImageNet to 3D Point Clouds

no code implementations • CVPR 2013 • Yan Wang, Rongrong Ji, Shih-Fu Chang

Our approach shows further major gains in accuracy when the training data from the target scenes is used, outperforming state-ofthe-art approaches with far better efficiency.

Paper
Add Code

DeepContour: A Deep Convolutional Feature Learned by Positive-Sharing Loss for Contour Detection

no code implementations • CVPR 2015 • Wei Shen, Xinggang Wang, Yan Wang, Xiang Bai, Zhijiang Zhang

Contour detection serves as the basis of a variety of computer vision tasks such as image segmentation and object recognition.

Contour Detection Image Segmentation +3

Paper
Add Code

Recognition of Action Units in the Wild With Deep Nets and a New Global-Local Loss

no code implementations • ICCV 2017 • C. Fabian Benitez-Quiroz, Yan Wang, Aleix M. Martinez

Most previous algorithms for the recognition of Action Units (AUs) were trained on a small number of sample images.

Paper
Add Code

Risk Prediction of Peer-to-Peer Lending Market by a LSTM Model with Macroeconomic Factor

no code implementations • 13 Feb 2019 • Yan Wang, Xuelei Sherry Ni

Our study can broaden the applications of the LSTM algorithm by using it on the sequential P2P data and guide the investors in making investment strategies.

Time Series Analysis

Paper
Add Code

Predicting class-imbalanced business risk using resampling, regularization, and model ensembling algorithms

no code implementations • 13 Mar 2019 • Yan Wang, Xuelei Sherry Ni

We aim at developing and improving the imbalanced business risk modeling via jointly using proper evaluation criteria, resampling, cross-validation, classifier regularization, and ensembling techniques.

Paper
Add Code

Using Machine Learning and Natural Language Processing to Review and Classify the Medical Literature on Cancer Susceptibility Genes

no code implementations • 24 Apr 2019 • Yujia Bao, Zhengyi Deng, Yan Wang, Heeyoon Kim, Victor Diego Armengol, Francisco Acevedo, Nofal Ouardaoui, Cathy Wang, Giovanni Parmigiani, Regina Barzilay, Danielle Braun, Kevin S. Hughes

We developed and evaluated two machine learning models to classify abstracts as relevant to the penetrance (risk of cancer for germline mutation carriers) or prevalence of germline genetic mutations.

Classification General Classification

Paper
Add Code

Multi-Scale Attentional Network for Multi-Focal Segmentation of Active Bleed after Pelvic Fractures

no code implementations • 23 Jun 2019 • Yuyin Zhou, David Dreizin, Yingwei Li, Zhishuai Zhang, Yan Wang, Alan Yuille

Trauma is the worldwide leading cause of death and disability in those younger than 45 years, and pelvic fractures are a major source of morbidity and mortality.

Segmentation

Paper
Add Code

Deep Differentiable Random Forests for Age Estimation

no code implementations • 23 Jul 2019 • Wei Shen, Yilu Guo, Yan Wang, Kai Zhao, Bo wang, Alan Yuille

Both of them connect split nodes to the top layer of convolutional neural networks (CNNs) and deal with inhomogeneous data by jointly learning input-dependent data partitions at the split nodes and age distributions at the leaf nodes.

Age Estimation regression

Paper
Add Code

A systematic review of fuzzing based on machine learning techniques

no code implementations • 4 Aug 2019 • Yan Wang, Peng Jia, Luping Liu, Jiayong Liu

Next, this paper assesses the performance of the machine learning models based on the frequently used evaluation metrics.

BIG-bench Machine Learning

Paper
Add Code

Semi-Supervised Adversarial Monocular Depth Estimation

no code implementations • 6 Aug 2019 • Rongrong Ji, Ke Li, Yan Wang, Xiaoshuai Sun, Feng Guo, Xiaowei Guo, Yongjian Wu, Feiyue Huang, Jiebo Luo

In this paper, we address the problem of monocular depth estimation when only a limited number of training image-depth pairs are available.

Monocular Depth Estimation

Paper
Add Code

Hyper-Pairing Network for Multi-Phase Pancreatic Ductal Adenocarcinoma Segmentation

no code implementations • 3 Sep 2019 • Yuyin Zhou, Yingwei Li, Zhishuai Zhang, Yan Wang, Angtian Wang, Elliot Fishman, Alan Yuille, Seyoun Park

Pancreatic ductal adenocarcinoma (PDAC) is one of the most lethal cancers with an overall five-year survival rate of 8%.

Paper
Add Code

Motion Planning through Demonstration to Deal with Complex Motions in Assembly Process

no code implementations • 4 Oct 2019 • Yan Wang, Kensuke Harada, Weiwei Wan

Complex and skillful motions in actual assembly process are challenging for the robot to generate with existing motion planning approaches, because some key poses during the human assembly can be too skillful for the robot to realize automatically.

Motion Planning

Paper
Add Code

Hadamard Codebook Based Deep Hashing

no code implementations • 21 Oct 2019 • Shen Chen, Liujuan Cao, Mingbao Lin, Yan Wang, Xiaoshuai Sun, Chenglin Wu, Jingfei Qiu, Rongrong Ji

Specifically, we utilize an off-the-shelf algorithm to generate a binary Hadamard codebook to satisfy the requirement of bit independence and bit balance, which subsequently serves as the desired outputs of the hash functions learning.

Deep Hashing Image Retrieval

Paper
Add Code

Metric Classification Network in Actual Face Recognition Scene

no code implementations • 25 Oct 2019 • Jian Li, Yan Wang, Xiubao Zhang, Weihong Deng, Haifeng Shen

In this paper, we train a validation classifier to normalize the decision threshold, which means that the result can be obtained directly without replacing the threshold.

Classification Face Recognition +2

Paper
Add Code

Applications of Generative Adversarial Models in Visual Search Reformulation

no code implementations • 28 Oct 2019 • Kyle Xiao, Houdong Hu, Yan Wang

Query reformulation is the process by which a input search query is refined by the user to match documents outside the original top-n results.

Paper
Add Code

Improving Open-Domain Dialogue Systems via Multi-Turn Incomplete Utterance Restoration

no code implementations • IJCNLP 2019 • Zhufeng Pan, Kun Bai, Yan Wang, Lianqiang Zhou, Xiaojiang Liu

To facilitate the study of incomplete utterance restoration for open-domain dialogue systems, a large-scale multi-turn dataset Restoration-200K is collected and manually labeled with the explicit relation between an utterance and its context.

Paper
Add Code

Retrieval-guided Dialogue Response Generation via a Matching-to-Generation Framework

no code implementations • IJCNLP 2019 • Deng Cai, Yan Wang, Wei Bi, Zhaopeng Tu, Xiaojiang Liu, Shuming Shi

End-to-end sequence generation is a popular technique for developing open domain dialogue systems, though they suffer from the \textit{safe response problem}.

Response Generation Retrieval

Paper
Add Code

Variational Structured Semantic Inference for Diverse Image Captioning

no code implementations • NeurIPS 2019 • Fuhai Chen, Rongrong Ji, Jiayi Ji, Xiaoshuai Sun, Baochang Zhang, Xuri Ge, Yongjian Wu, Feiyue Huang, Yan Wang

To model these two inherent diversities in image captioning, we propose a Variational Structured Semantic Inferring model (termed VSSI-cap) executed in a novel structured encoder-inferer-decoder schema.

Decoder Image Captioning

Paper
Add Code

Adaptive Portfolio by Solving Multi-armed Bandit via Thompson Sampling

no code implementations • 13 Nov 2019 • Mengying Zhu, Xiaolin Zheng, Yan Wang, Yuyuan Li, Qianqiao Liang

Also, by constructing multiple strategic arms, we can obtain the optimal investment portfolio to adapt different investment periods.

Decision Making Management +1

Paper
Add Code

Deep Distance Transform for Tubular Structure Segmentation in CT Scans

no code implementations • CVPR 2020 • Yan Wang, Xu Wei, Fengze Liu, Jieneng Chen, Yuyin Zhou, Wei Shen, Elliot K. Fishman, Alan L. Yuille

Tubular structure segmentation in medical images, e. g., segmenting vessels in CT scans, serves as a vital step in the use of computers to aid in screening early stages of related diseases.

Segmentation

Paper
Add Code

Sequential Recommender Systems: Challenges, Progress and Prospects

no code implementations • 28 Dec 2019 • Shoujin Wang, Liang Hu, Yan Wang, Longbing Cao, Quan Z. Sheng, Mehmet Orgun

The emerging topic of sequential recommender systems has attracted increasing attention in recent years. Different from the conventional recommender systems including collaborative filtering and content-based filtering, SRSs try to understand and model the sequential user behaviors, the interactions between users and items, and the evolution of users preferences and item popularity over time.

Collaborative Filtering Recommendation Systems

Paper
Add Code

Proposal Learning for Semi-Supervised Object Detection

no code implementations • 15 Jan 2020 • Peng Tang, Chetan Ramaiah, Yan Wang, ran Xu, Caiming Xiong

two-stage object detectors) by training on both labeled and unlabeled data.

Object object-detection +2

Paper
Add Code

The World is Not Binary: Learning to Rank with Grayscale Data for Dialogue Response Selection

no code implementations • EMNLP 2020 • Zibo Lin, Deng Cai, Yan Wang, Xiaojiang Liu, Hai-Tao Zheng, Shuming Shi

Despite that response selection is naturally a learning-to-rank problem, most prior works take a point-wise view and train binary classifiers for this task: each response candidate is labeled either relevant (one) or irrelevant (zero).

Ranked #10 on Conversational Response Selection on E-commerce

Conversational Response Selection Learning-To-Rank +2

Paper
Add Code

Prototype-to-Style: Dialogue Generation with Style-Aware Editing on Retrieval Memory

no code implementations • 5 Apr 2020 • Yixuan Su, Yan Wang, Simon Baker, Deng Cai, Xiaojiang Liu, Anna Korhonen, Nigel Collier

A stylistic response generator then takes the prototype and the desired language style as model input to obtain a high-quality and stylistic response.

Dialogue Generation Information Retrieval +1

Paper
Add Code

Stylistic Dialogue Generation via Information-Guided Reinforcement Learning Strategy

no code implementations • 5 Apr 2020 • Yixuan Su, Deng Cai, Yan Wang, Simon Baker, Anna Korhonen, Nigel Collier, Xiaojiang Liu

To enable better balance between the content quality and the style, we introduce a new training strategy, know as Information-Guided Reinforcement Learning (IG-RL).

Dialogue Generation reinforcement-learning +2

Paper
Add Code

Generate, Delete and Rewrite: A Three-Stage Framework for Improving Persona Consistency of Dialogue Generation

no code implementations • ACL 2020 • Haoyu Song, Yan Wang, Wei-Nan Zhang, Xiaojiang Liu, Ting Liu

Maintaining a consistent personality in conversations is quite natural for human beings, but is still a non-trivial task for machines.

Dialogue Generation

Paper
Add Code

Graph Learning Approaches to Recommender Systems: A Review

no code implementations • 22 Apr 2020 • Shoujin Wang, Liang Hu, Yan Wang, Xiangnan He, Quan Z. Sheng, Mehmet Orgun, Longbing Cao, Nan Wang, Francesco Ricci, Philip S. Yu

Recent years have witnessed the fast development of the emerging topic of Graph Learning based Recommender Systems (GLRS).

Collaborative Filtering Graph Learning +1

Paper
Add Code

A Dual-Dimer Method for Training Physics-Constrained Neural Networks with Minimax Architecture

no code implementations • 1 May 2020 • Dehao Liu, Yan Wang

Data sparsity is a common issue to train machine learning tools such as neural networks for engineering and scientific applications, where experiments and simulations are expensive.

Paper
Add Code

Beyond CNNs: Exploiting Further Inherent Symmetries in Medical Images for Segmentation

no code implementations • 8 May 2020 • Shuchao Pang, Anan Du, Mehmet A. Orgun, Yan Wang, Quanzheng Sheng, Shoujin Wang, Xiaoshui Huang, Zhemei Yu

To mitigate this shortcoming, we propose a novel group equivariant segmentation framework by encoding those inherent symmetries for learning more precise representations.

Segmentation Tumor Segmentation

Paper
Add Code

Domain Adaptive Relational Reasoning for 3D Multi-Organ Segmentation

no code implementations • 18 May 2020 • Shuhao Fu, Yongyi Lu, Yan Wang, Yuyin Zhou, Wei Shen, Elliot Fishman, Alan Yuille

In this paper, we present a novel unsupervised domain adaptation (UDA) method, named Domain Adaptive Relational Reasoning (DARR), to generalize 3D multi-organ segmentation models to medical data collected from different scanners and/or protocols (domains).

Organ Segmentation Relational Reasoning +3

Paper
Add Code

Recognizing Chinese Judicial Named Entity using BiLSTM-CRF

no code implementations • 31 May 2020 • Pin Tang, Pinli Yang, Yuang Shi, Yi Zhou, Feng Lin, Yan Wang

Named entity recognition (NER) plays an essential role in natural language processing systems.

Information Retrieval named-entity-recognition +4

Paper
Add Code

The 'Letter' Distribution in the Chinese Language

no code implementations • 26 May 2020 • Qinghua Chen, Yan Wang, Mengmeng Wang, Xiaomeng Li

In addition, we collected Chinese literature corpora for different historical periods from the Tang Dynasty to the present, and we dismantled the Chinese written language into three kinds of basic particles: characters, strokes and constructive parts.

Paper
Add Code

DSU-net: Dense SegU-net for automatic head-and-neck tumor segmentation in MR images

no code implementations • 11 Jun 2020 • Pin Tang, Chen Zu, Mei Hong, Rui Yan, Xingchen Peng, Jianghong Xiao, Xi Wu, Jiliu Zhou, Luping Zhou, Yan Wang

In this paper, we propose a Dense SegU-net (DSU-net) framework for automatic NPC segmentation in MRI.

Decoder Segmentation +1

Paper
Add Code

srMO-BO-3GP: A sequential regularized multi-objective constrained Bayesian optimization for design applications

no code implementations • 7 Jul 2020 • Anh Tran, Mike Eldred, Scott McCann, Yan Wang

Finally, we couple the third GP along with the classical BO framework to promote the richness and diversity of the Pareto frontier by the exploitation and exploration acquisition function.

Bayesian Optimization Gaussian Processes

Paper
Add Code

Interpretable Real-Time Win Prediction for Honor of Kings, a Popular Mobile MOBA Esport

no code implementations • 14 Aug 2020 • Zelong Yang, Zhufeng Pan, Yan Wang, Deng Cai, Xiaojiang Liu, Shuming Shi, Shao-Lun Huang

With the rapid prevalence and explosive development of MOBA esports (Multiplayer Online Battle Arena electronic sports), much research effort has been devoted to automatically predicting game results (win predictions).

Attribute

Paper
Add Code

Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing

no code implementations • 25 Aug 2020 • Wei Shen, Xiaonan He, Chuheng Zhang, Qiang Ni, Wanchun Dou, Yan Wang

Therefore, it is crucial to design a participant selection algorithm that applies to different MCS systems to achieve multiple goals.

Combinatorial Optimization Fairness +2

Paper
Add Code

Enabling Deep Residual Networks for Weakly Supervised Object Detection

no code implementations • ECCV 2020 • Yunhang Shen, Rongrong Ji, Yan Wang, Zhiwei Chen, Feng Zheng, Feiyue Huang, Yunsheng Wu

Weakly supervised object detection (WSOD) has attracted extensive research attention due to its great flexibility of exploiting large-scale image-level annotation for detector training.

Object object-detection +1

Paper
Add Code

Developing and Improving Risk Models using Machine-learning Based Algorithms

no code implementations • 9 Sep 2020 • Yan Wang, Xuelei Sherry Ni

The objective of this study is to develop a good risk model for classifying business delinquency by simultaneously exploring several machine learning based methods including regularization, hyper-parameter optimization, and model ensembling algorithms.

BIG-bench Machine Learning

Paper
Add Code

Improving Investment Suggestions for Peer-to-Peer (P2P) Lending via Integrating Credit Scoring into Profit Scoring

no code implementations • 9 Sep 2020 • Yan Wang, Xuelei Sherry Ni

The studies have mainly focused on two categories to guide the lenders' investments: one aims at minimizing the risk of investment (i. e., the credit scoring perspective) while the other aims at maximizing the profit (i. e., the profit scoring perspective).

Paper
Add Code

A Deep Framework for Cross-Domain and Cross-System Recommendations

no code implementations • 14 Sep 2020 • Feng Zhu, Yan Wang, Chaochao Chen, Guanfeng Liu, Mehmet Orgun, Jia Wu

Therefore, finding an accurate mapping of the latent factors across domains or systems is crucial to enhancing recommendation accuracy.

Recommendation Systems

Paper
Add Code

Double-Wing Mixture of Experts for Streaming Recommendations

no code implementations • 14 Sep 2020 • Yan Zhao, Shoujin Wang, Yan Wang, Hongwei Liu, Weizhe Zhang

In VRS-DWMoE, we first devise variational and reservoir-enhanced sampling to wisely complement new data with historical data, and thus address the user preference drift issue while capturing long-term user preferences.

Ensemble Learning Recommendation Systems

Paper
Add Code

Stratified and Time-aware Sampling based Adaptive Ensemble Learning for Streaming Recommendations

no code implementations • 15 Sep 2020 • Yan Zhao, Shoujin Wang, Yan Wang, Hongwei Liu

To address these problems, we propose a Stratified and Time-aware Sampling based Adaptive Ensemble Learning framework, called STS-AEL, to improve the accuracy of streaming recommendations.

Ensemble Learning Recommendation Systems +1

Paper
Add Code

Enhancing Dialogue Generation via Multi-Level Contrastive Learning

no code implementations • 19 Sep 2020 • Xin Li, Piji Li, Yan Wang, Xiaojiang Liu, Wai Lam

Most of the existing works for dialogue generation are data-driven models trained directly on corpora crawled from websites.

Contrastive Learning Dialogue Generation +1

Paper
Add Code

AC-VAE: Learning Semantic Representation with VAE for Adaptive Clustering

no code implementations • 1 Jan 2021 • Xingyu Xie, Minjuan Zhu, Yan Wang, Lei Zhang

Experimental evaluations show that the proposed method outperforms state-of-the-art representation learning methods in terms of neighbor clustering accuracy.

Classification Clustering +3

Paper
Add Code

ASMFS: Adaptive-Similarity-based Multi-modality Feature Selection for Classification of Alzheimer's Disease

no code implementations • 16 Oct 2020 • Yuang Shi, Chen Zu, Mei Hong, Luping Zhou, Lei Wang, Xi Wu, Jiliu Zhou, Daoqiang Zhang, Yan Wang

With the increasing amounts of high-dimensional heterogeneous data to be processed, multi-modality feature selection has become an important research direction in medical image analysis.

feature selection General Classification

Paper
Add Code

Organ size increases with obesity and correlates with cancer risk

no code implementations • 27 May 2020 • Haley Grant Yifan Zhang, Lu Li, Yan Wang, Satomi Kawamoto, Sophie Pénisson, Daniel F. Fouladi, Shahab Shayesteh, Alejandra Blanco, Saeed Ghandili, Eva Zinreich, Jefferson S. Graves, Seyoun Park, Scott Kern, Jody Hooper, Alan L. Yuille, Elliot K Fishman, Linda Chu, Cristian Tomasetti

Obesity increases significantly cancer risk in various organs.

Paper
Add Code

Complementary probe of dark matter blind spots by lepton colliders and gravitational waves

no code implementations • 7 Dec 2020 • Yan Wang, Chong Sheng Li, Fa Peng Huang

We study how to unravel the dark matter blind spots by phase transition gravitational waves in synergy with collider signatures at electroweak one-loop level taking the inert doublet model as an example.

High Energy Physics - Phenomenology

Paper
Add Code

A multi-objective optimization framework for on-line ridesharing systems

no code implementations • 7 Dec 2020 • Hamed Javidi, Dan Simon, Ling Zhu, Yan Wang

The ultimate goal of ridesharing systems is to matchtravelers who do not have a vehicle with those travelers whowant to share their vehicle.

Paper
Add Code

Predicting Events in MOBA Games: Prediction, Attribution, and Evaluation

no code implementations • 17 Dec 2020 • Zelong Yang, Yan Wang, Piji Li, Shaobin Lin, Shuming Shi, Shao-Lun Huang, Wei Bi

The multiplayer online battle arena (MOBA) games have become increasingly popular in recent years.

Paper
Add Code

Towards Scalable and Privacy-Preserving Deep Neural Network via Algorithmic-Cryptographic Co-design

no code implementations • 17 Dec 2020 • Jun Zhou, Longfei Zheng, Chaochao Chen, Yan Wang, Xiaolin Zheng, Bingzhe Wu, Cen Chen, Li Wang, Jianwei Yin

In this paper, we propose SPNN - a Scalable and Privacy-preserving deep Neural Network learning framework, from algorithmic-cryptographic co-perspective.

Privacy Preserving

Paper
Add Code

A Two Sub-problem Decomposition for the Optimal Design of Filterless Optical Networks

no code implementations • 4 Jan 2021 • Brigitte Jaumard, Yan Wang

The first type of subproblem relies on the generation of filterless subnetworks while the second one takes care of their wavelength assignment.

Problem Decomposition Networking and Internet Architecture

Paper
Add Code

Contextualized Emotion Recognition in Conversation as Sequence Tagging

no code implementations • 1 Jul 2020 • Yan Wang, Jiayu Zhang, Jun Ma, Shaojun Wang, Jing Xiao

Emotion recognition in conversation (ERC) is an important topic for developing empathetic machines in a variety of areas including social opinion mining, health-care and so on.

Ranked #2 on Emotion Recognition in Conversation on DailyDialog

Emotion Classification Emotion Recognition in Conversation +1

Paper
Add Code

Spreading dynamics of a 2SIH2R, rumor spreading model in the homogeneous network

no code implementations • 26 Jan 2021 • Yan Wang, Feng Qing, Jian-Ping Chai, Ye-Peng Ni

In the era of the rapid development of the Internet, the threshold for information spreading has become lower.

Social and Information Networks Probability

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.