Search Results for author: Wei Shen

Found 125 papers, 50 papers with code

GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting

1 code implementation15 Feb 2024 Chen Yang, Sikuang Li, Jiemin Fang, Ruofan Liang, Lingxi Xie, Xiaopeng Zhang, Wei Shen, Qi Tian

Then we construct a Gaussian repair model based on diffusion models to supplement the omitted object information, where Gaussians are further refined.

Neural Rendering Object

PCL: Proposal Cluster Learning for Weakly Supervised Object Detection

4 code implementations9 Jul 2018 Peng Tang, Xinggang Wang, Song Bai, Wei Shen, Xiang Bai, Wenyu Liu, Alan Yuille

The iterative instance classifier refinement is implemented online using multiple streams in convolutional neural networks, where the first is an MIL network and the others are for instance classifier refinement supervised by the preceding one.

Multiple Instance Learning Object +3

Intriguing Findings of Frequency Selection for Image Deblurring

2 code implementations23 Nov 2021 Xintian Mao, Yiming Liu, Fengze Liu, Qingli Li, Wei Shen, Yan Wang

Blur was naturally analyzed in the frequency domain, by estimating the latent sharp image and the blur kernel given a blurry image.

Deblurring Image Deblurring +1

Robust Face Detection via Learning Small Faces on Hard Images

1 code implementation28 Nov 2018 Zhishuai Zhang, Wei Shen, Siyuan Qiao, Yan Wang, Bo wang, Alan Yuille

In this paper, we propose that the robustness of a face detector against hard faces can be improved by learning small faces on hard images.

Face Detection

Shape-Texture Debiased Neural Network Training

1 code implementation ICLR 2021 Yingwei Li, Qihang Yu, Mingxing Tan, Jieru Mei, Peng Tang, Wei Shen, Alan Yuille, Cihang Xie

To prevent models from exclusively attending on a single cue in representation learning, we augment training data with images with conflicting shape and texture information (eg, an image of chimpanzee shape but with lemon texture) and, most importantly, provide the corresponding supervisions from shape and texture simultaneously.

Adversarial Robustness Data Augmentation +2

Self-supervised Implicit Glyph Attention for Text Recognition

1 code implementation CVPR 2023 Tongkun Guan, Chaochen Gu, Jingzheng Tu, Xue Yang, Qi Feng, Yudi Zhao, Xiaokang Yang, Wei Shen

Supervised attention can alleviate the above issue, but it is character category-specific, which requires extra laborious character-level bounding box annotations and would be memory-intensive when handling languages with larger character categories.

Scene Text Recognition Text Segmentation

SoccerNet 2023 Challenges Results

2 code implementations12 Sep 2023 Anthony Cioppa, Silvio Giancola, Vladimir Somers, Floriane Magera, Xin Zhou, Hassan Mkhallati, Adrien Deliège, Jan Held, Carlos Hinojosa, Amir M. Mansourian, Pierre Miralles, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdullah Kamal, Adrien Maglo, Albert Clapés, Amr Abdelaziz, Artur Xarles, Astrid Orcesi, Atom Scott, Bin Liu, Byoungkwon Lim, Chen Chen, Fabian Deuser, Feng Yan, Fufu Yu, Gal Shitrit, Guanshuo Wang, Gyusik Choi, Hankyul Kim, Hao Guo, Hasby Fahrudin, Hidenari Koguchi, Håkan Ardö, Ibrahim Salah, Ido Yerushalmy, Iftikar Muhammad, Ikuma Uchida, Ishay Be'ery, Jaonary Rabarisoa, Jeongae Lee, Jiajun Fu, Jianqin Yin, Jinghang Xu, Jongho Nang, Julien Denize, Junjie Li, Junpei Zhang, Juntae Kim, Kamil Synowiec, Kenji Kobayashi, Kexin Zhang, Konrad Habel, Kota Nakajima, Licheng Jiao, Lin Ma, Lizhi Wang, Luping Wang, Menglong Li, Mengying Zhou, Mohamed Nasr, Mohamed Abdelwahed, Mykola Liashuha, Nikolay Falaleev, Norbert Oswald, Qiong Jia, Quoc-Cuong Pham, Ran Song, Romain Hérault, Rui Peng, Ruilong Chen, Ruixuan Liu, Ruslan Baikulov, Ryuto Fukushima, Sergio Escalera, Seungcheon Lee, Shimin Chen, Shouhong Ding, Taiga Someya, Thomas B. Moeslund, Tianjiao Li, Wei Shen, Wei zhang, Wei Li, Wei Dai, Weixin Luo, Wending Zhao, Wenjie Zhang, Xinquan Yang, Yanbiao Ma, Yeeun Joo, Yingsen Zeng, Yiyang Gan, Yongqiang Zhu, Yujie Zhong, Zheng Ruan, Zhiheng Li, Zhijian Huang, Ziyu Meng

More information on the tasks, challenges, and leaderboards are available on https://www. soccer-net. org.

Action Spotting Camera Calibration +3

DeepSkeleton: Learning Multi-task Scale-associated Deep Side Outputs for Object Skeleton Extraction in Natural Images

1 code implementation13 Sep 2016 Wei Shen, Kai Zhao, Yuan Jiang, Yan Wang, Xiang Bai, Alan Yuille

By observing the relationship between the receptive field sizes of the different layers in the network and the skeleton scales they can capture, we introduce two scale-associated side outputs to each stage of the network.

Multi-Task Learning Object +3

Deep Co-Training for Semi-Supervised Image Recognition

1 code implementation ECCV 2018 Siyuan Qiao, Wei Shen, Zhishuai Zhang, Bo wang, Alan Yuille

We present Deep Co-Training, a deep learning based method inspired by the Co-Training framework.

Synthesize then Compare: Detecting Failures and Anomalies for Semantic Segmentation

1 code implementation ECCV 2020 Yingda Xia, Yi Zhang, Fengze Liu, Wei Shen, Alan Yuille

The ability to detect failures and anomalies are fundamental requirements for building reliable systems for computer vision applications, especially safety-critical applications of semantic segmentation, such as autonomous driving and medical image analysis.

Ranked #8 on Anomaly Detection on Road Anomaly (using extra training data)

Anomaly Detection Autonomous Driving +3

SpecTr: Spectral Transformer for Hyperspectral Pathology Image Segmentation

1 code implementation5 Mar 2021 Boxiang Yun, Yan Wang, Jieneng Chen, Huiyu Wang, Wei Shen, Qingli Li

Hyperspectral imaging (HSI) unlocks the huge potential to a wide variety of applications relied on high-precision pathology image segmentation, such as computational pathology and precision medicine.

Image Segmentation Segmentation +1

Batch Normalization with Enhanced Linear Transformation

1 code implementation28 Nov 2020 Yuhui Xu, Lingxi Xie, Cihang Xie, Jieru Mei, Siyuan Qiao, Wei Shen, Hongkai Xiong, Alan Yuille

Batch normalization (BN) is a fundamental unit in modern deep networks, in which a linear transformation module was designed for improving BN's flexibility of fitting complex data distributions.

LoRAMoE: Alleviate World Knowledge Forgetting in Large Language Models via MoE-Style Plugin

1 code implementation15 Dec 2023 Shihan Dou, Enyu Zhou, Yan Liu, Songyang Gao, Jun Zhao, Wei Shen, Yuhao Zhou, Zhiheng Xi, Xiao Wang, Xiaoran Fan, ShiLiang Pu, Jiang Zhu, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang

Supervised fine-tuning (SFT) is a crucial step for large language models (LLMs), enabling them to align with human instructions and enhance their capabilities in downstream tasks.

Language Modelling Multi-Task Learning +1

Rethinking Normalization and Elimination Singularity in Neural Networks

1 code implementation21 Nov 2019 Siyuan Qiao, Huiyu Wang, Chenxi Liu, Wei Shen, Alan Yuille

To address this issue, we propose BatchChannel Normalization (BCN), which uses batch knowledge to avoid the elimination singularities in the training of channel-normalized models.

Image Classification Instance Segmentation +4

Skeleton2Humanoid: Animating Simulated Characters for Physically-plausible Motion In-betweening

1 code implementation9 Oct 2022 Yunhao Li, Zhenbo Yu, Yucheng Zhu, Bingbing Ni, Guangtao Zhai, Wei Shen

Stage I introduces a test time adaptation strategy, which improves the physical plausibility of synthesized human skeleton motions by optimizing skeleton joint locations.

Motion Synthesis Reinforcement Learning (RL) +1

ChildPredictor: A Child Face Prediction Framework with Disentangled Learning

1 code implementation21 Apr 2022 Yuzhi Zhao, Lai-Man Po, Xuehui Wang, Qiong Yan, Wei Shen, Yujia Zhang, Wei Liu, Chun-Kit Wong, Chiu-Sing Pang, Weifeng Ou, Wing-Yin Yu, Buhua Liu

On this basis, we formulate predictions as a mapping from parents' genetic factors to children's genetic factors, and disentangle them from external and variety factors.

Age-Invariant Face Recognition Image-to-Image Translation +2

MagicNet: Semi-Supervised Multi-Organ Segmentation via Magic-Cube Partition and Recovery

1 code implementation CVPR 2023 Duowen Chen, Yunhao Bai, Wei Shen, Qingli Li, Lequan Yu, Yan Wang

Our strategy encourages unlabeled images to learn organ semantics in relative locations from the labeled images (cross-branch) and enhances the learning ability for small organs (within-branch).

Anatomy Data Augmentation +4

Neural LerPlane Representations for Fast 4D Reconstruction of Deformable Tissues

2 code implementations31 May 2023 Chen Yang, Kailing Wang, Yuehao Wang, Xiaokang Yang, Wei Shen

Reconstructing deformable tissues from endoscopic stereo videos in robotic surgery is crucial for various clinical applications.

4D reconstruction

Efficient Deformable Tissue Reconstruction via Orthogonal Neural Plane

2 code implementations23 Dec 2023 Chen Yang, Kailing Wang, Yuehao Wang, Qi Dou, Xiaokang Yang, Wei Shen

Intraoperative imaging techniques for reconstructing deformable tissues in vivo are pivotal for advanced surgical systems.

Glance-and-Gaze Vision Transformer

1 code implementation NeurIPS 2021 Qihang Yu, Yingda Xia, Yutong Bai, Yongyi Lu, Alan Yuille, Wei Shen

It is motivated by the Glance and Gaze behavior of human beings when recognizing objects in natural scenes, with the ability to efficiently model both long-range dependencies and local context.

ContrastMask: Contrastive Learning to Segment Every Thing

1 code implementation CVPR 2022 Xuehui Wang, Kai Zhao, Ruixin Zhang, Shouhong Ding, Yan Wang, Wei Shen

In this framework, annotated masks of seen categories and pseudo masks of unseen categories serve as a prior for contrastive learning, where features from the mask regions (foreground) are pulled together, and are contrasted against those from the background, and vice versa.

Instance Segmentation Segmentation +1

Masked Autoencoders as Image Processors

1 code implementation30 Mar 2023 Huiyu Duan, Wei Shen, Xiongkuo Min, Danyang Tu, Long Teng, Jia Wang, Guangtao Zhai

Recently, masked autoencoders (MAE) for feature pre-training have further unleashed the potential of Transformers, leading to state-of-the-art performances on various high-level vision tasks.

Deblurring Image Defocus Deblurring +2

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

1 code implementation8 Feb 2024 Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, wei he, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang

In this paper, we propose R$^3$: Learning Reasoning through Reverse Curriculum Reinforcement Learning (RL), a novel method that employs only outcome supervision to achieve the benefits of process supervision for large language models.

GSM8K reinforcement-learning +1

Dual Attention Guided Gaze Target Detection in the Wild

1 code implementation CVPR 2021 Yi Fang, Jiapeng Tang, Wang Shen, Wei Shen, Xiao Gu, Li Song, Guangtao Zhai

In the third stage, we use the generated dual attention as guidance to perform two sub-tasks: (1) identifying whether the gaze target is inside or out of the image; (2) locating the target if inside.

Inductive Matrix Completion Using Graph Autoencoder

2 code implementations25 Aug 2021 Wei Shen, Chuheng Zhang, Yun Tian, Liang Zeng, Xiaonan He, Wanchun Dou, Xiaolong Xu

However, without node content (i. e., side information) for training, the user (or item) specific representation can not be learned in the inductive setting, that is, a model trained on one group of users (or items) cannot adapt to new users (or items).

Matrix Completion Recommendation Systems

Deeply Shape-guided Cascade for Instance Segmentation

1 code implementation CVPR 2021 Hao Ding, Siyuan Qiao, Alan Yuille, Wei Shen

The key to a successful cascade architecture for precise instance segmentation is to fully leverage the relationship between bounding box detection and mask segmentation across multiple stages.

Instance Segmentation Region Proposal +2

Unsupervised Domain Adaptation through Shape Modeling for Medical Image Segmentation

1 code implementation6 Jul 2022 Yuan YAO, Fengze Liu, Zongwei Zhou, Yan Wang, Wei Shen, Alan Yuille, Yongyi Lu

Previous methods proposed Variational Autoencoder (VAE) based models to learn the distribution of shape for a particular organ and used it to automatically evaluate the quality of a segmentation prediction by fitting it into the learned shape distribution.

Image Segmentation Pancreas Segmentation +3

Looking Here or There? Gaze Following in 360-Degree Images

1 code implementation ICCV 2021 Yunhao Li, Wei Shen, Zhongpai Gao, Yucheng Zhu, Guangtao Zhai, Guodong Guo

Specifically, the local region is obtained as a 2D cone-shaped field along the 2D projection of the sight line starting at the human subject's head position, and the distant region is obtained by searching along the sight line in 3D sphere space.

CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation

1 code implementation22 Mar 2022 Feng Wang, Huiyu Wang, Chen Wei, Alan Yuille, Wei Shen

Recent advances in self-supervised contrastive learning yield good image-level representation, which favors classification tasks but usually neglects pixel-level detailed information, leading to unsatisfactory transfer performance to dense prediction tasks such as semantic segmentation.

Contrastive Learning Representation Learning +2

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback

1 code implementation21 Jan 2024 Songyang Gao, Qiming Ge, Wei Shen, Shihan Dou, Junjie Ye, Xiao Wang, Rui Zheng, Yicheng Zou, Zhi Chen, Hang Yan, Qi Zhang, Dahua Lin

This reliance limits the applicability of RLHF and hinders the development of professional assistants tailored to diverse human preferences.

Saliency in Augmented Reality

1 code implementation18 Apr 2022 Huiyu Duan, Wei Shen, Xiongkuo Min, Danyang Tu, Jing Li, Guangtao Zhai

Therefore, in this paper, we mainly analyze the interaction effect between background (BG) scenes and AR contents, and study the saliency prediction problem in AR.

Saliency Prediction

Multi-View Clustering for Open Knowledge Base Canonicalization

2 code implementations22 Jun 2022 Wei Shen, Yang Yang, Yinan Liu

In this paper, we propose CMVC, a novel unsupervised framework that leverages these two views of knowledge jointly for canonicalizing OKBs without the need of manually annotated labels.

Clustering Open Information Extraction +1

Community Question Answering Entity Linking via Leveraging Auxiliary Data

2 code implementations24 May 2022 Yuhan Li, Wei Shen, Jianbo Gao, Yadong Wang

Community Question Answering (CQA) platforms contain plenty of CQA texts (i. e., questions and answers corresponding to the question) where named entities appear ubiquitously.

Community Question Answering Entity Linking

TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets

1 code implementation5 Dec 2022 Yuanying Cai, Chuheng Zhang, Li Zhao, Wei Shen, Xuyun Zhang, Lei Song, Jiang Bian, Tao Qin, TieYan Liu

There are two challenges for this setting: 1) The optimal trade-off between optimizing the RL signal and the behavior cloning (BC) signal changes on different states due to the variation of the action coverage induced by different behavior policies.

D4RL Offline RL +2

Learning Residual Images for Face Attribute Manipulation

1 code implementation CVPR 2017 Wei Shen, Rujie Liu

The transformation networks are responsible for the attribute manipulation and its dual operation and the discriminative network is used to distinguish the generated images from real images.

Attribute Generative Adversarial Network

Resisting Large Data Variations via Introspective Transformation Network

no code implementations16 May 2018 Yunhan Zhao, Ye Tian, Charless Fowlkes, Wei Shen, Alan Yuille

Experimental results verify that our approach significantly improves the ability of deep networks to resist large variations between training and testing data and achieves classification accuracy improvements on several benchmark datasets, including MNIST, affNIST, SVHN, CIFAR-10 and miniImageNet.

Data Augmentation Few-Shot Learning

Semi-Supervised Multi-Organ Segmentation via Deep Multi-Planar Co-Training

no code implementations7 Apr 2018 Yuyin Zhou, Yan Wang, Peng Tang, Song Bai, Wei Shen, Elliot K. Fishman, Alan L. Yuille

In multi-organ segmentation of abdominal CT scans, most existing fully supervised deep learning algorithms require lots of voxel-wise annotations, which are usually difficult, expensive, and slow to obtain.

Image Segmentation Organ Segmentation +2

Hi-Fi: Hierarchical Feature Integration for Skeleton Detection

no code implementations5 Jan 2018 Kai Zhao, Wei Shen, Shang-Hua Gao, Dandan Li, Ming-Ming Cheng

In natural images, the scales (thickness) of object skeletons may dramatically vary among objects and object parts, making object skeleton detection a challenging problem.

Object Object Skeleton Detection

Abdominal multi-organ segmentation with organ-attention networks and statistical fusion

no code implementations23 Apr 2018 Yan Wang, Yuyin Zhou, Wei Shen, Seyoun Park, Elliot K. Fishman, Alan L. Yuille

To address these challenges, we introduce a novel framework for multi-organ segmentation by using organ-attention networks with reverse connections (OAN-RCs) which are applied to 2D views, of the 3D CT volume, and output estimates which are combined by statistical fusion exploiting structural similarity.

Organ Segmentation

Single-Shot Object Detection with Enriched Semantics

no code implementations CVPR 2018 Zhishuai Zhang, Siyuan Qiao, Cihang Xie, Wei Shen, Bo wang, Alan L. Yuille

Our motivation is to enrich the semantics of object detection features within a typical deep detector, by a semantic segmentation branch and a global activation module.

Object object-detection +3

Training Multi-organ Segmentation Networks with Sample Selection by Relaxed Upper Confident Bound

no code implementations7 Apr 2018 Yan Wang, Yuyin Zhou, Peng Tang, Wei Shen, Elliot K. Fishman, Alan L. Yuille

Based on the fact that very hard samples might have annotation errors, we propose a new sample selection policy, named Relaxed Upper Confident Bound (RUCB).

Image Segmentation Medical Image Segmentation +3

Gradually Updated Neural Networks for Large-Scale Image Recognition

no code implementations ICML 2018 Siyuan Qiao, Zhishuai Zhang, Wei Shen, Bo wang, Alan Yuille

Our method is by introducing computation orderings to the channels within convolutional layers or blocks, based on which we gradually compute the outputs in a channel-wise manner.

Label Distribution Learning Forests

no code implementations NeurIPS 2017 Wei Shen, Kai Zhao, Yilu Guo, Alan Yuille

This paper presents label distribution learning forests (LDLFs) - a novel label distribution learning algorithm based on differentiable decision trees, which have several advantages: 1) Decision trees have the potential to model any general form of label distributions by a mixture of leaf node predictions.

Age Estimation Representation Learning

Multi-stage Multi-recursive-input Fully Convolutional Networks for Neuronal Boundary Detection

no code implementations ICCV 2017 Wei Shen, Bin Wang, Yuan Jiang, Yan Wang, Alan Yuille

This design is biologically-plausible, as it likes a human visual system to compare different possible segmentation solutions to address the ambiguous boundary issue.

Boundary Detection Segmentation

ScaleNet: Guiding Object Proposal Generation in Supermarkets and Beyond

no code implementations ICCV 2017 Siyuan Qiao, Wei Shen, Weichao Qiu, Chenxi Liu, Alan Yuille

We argue that estimation of object scales in images is helpful for generating object proposals, especially for supermarket images where object scales are usually within a small range.

Object Object Proposal Generation

Shape Recognition by Bag of Skeleton-associated Contour Parts

no code implementations20 May 2016 Wei Shen, Yuan Jiang, Wenjing Gao, Dan Zeng, Xinggang Wang

Contour and skeleton are two complementary representations for shape recognition.

Object Skeleton Extraction in Natural Images by Fusing Scale-associated Deep Side Outputs

no code implementations CVPR 2016 Wei Shen, Kai Zhao, Yuan Jiang, Yan Wang, Zhijiang Zhang, Xiang Bai

Object skeleton is a useful cue for object detection, complementary to the object contour, as it provides a structural representation to describe the relationship among object parts.

Object object-detection +1

Tackling Early Sparse Gradients in Softmax Activation Using Leaky Squared Euclidean Distance

no code implementations27 Nov 2018 Wei Shen, Rujie Liu

However, we find that choosing squared Euclidean distance may cause distance explosion leading gradients to be extremely sparse in the early stage of back propagation.

One-Shot Learning

Generating Attention from Classifier Activations for Fine-grained Recognition

no code implementations27 Nov 2018 Wei Shen, Rujie Liu

Recent advances in fine-grained recognition utilize attention maps to localize objects of interest.

Semantic Segmentation

Learning to generate filters for convolutional neural networks

no code implementations ICLR 2018 Wei Shen, Rujie Liu

In this paper, we propose to generate sample-specific filters for convolutional layers in the forward pass.

Symmetry-Based Text Line Detection in Natural Scenes

no code implementations CVPR 2015 Zheng Zhang, Wei Shen, Cong Yao, Xiang Bai

Recently, a variety of real-world applications have triggered huge demand for techniques that can extract textual information from natural scenes.

Line Detection Scene Text Detection +1

Learning from Adversarial Features for Few-Shot Classification

no code implementations25 Mar 2019 Wei Shen, Ziqiang Shi, Jun Sun

Then we use the adversarial region attention to aggregate the feature maps to obtain the adversarial features.

Classification Few-Shot Learning +1

Stability and Optimization Error of Stochastic Gradient Descent for Pairwise Learning

no code implementations25 Apr 2019 Wei Shen, Zhenhuan Yang, Yiming Ying, Xiaoming Yuan

From this fundamental trade-off, we obtain lower bounds for the optimization error of SGD algorithms and the excess expected risk over a class of pairwise losses.

Generalization Bounds Metric Learning

Learning to Find Correlated Features by Maximizing Information Flow in Convolutional Neural Networks

no code implementations30 Jun 2019 Wei Shen, Fei Li, Rujie Liu

We argue that the discard of the correlated discriminative information is partially caused by the fact that the minimization of the classification loss doesn't ensure to learn the overall discriminative information but only the most discriminative information.

Classification General Classification +1

Deep Differentiable Random Forests for Age Estimation

no code implementations23 Jul 2019 Wei Shen, Yilu Guo, Yan Wang, Kai Zhao, Bo wang, Alan Yuille

Both of them connect split nodes to the top layer of convolutional neural networks (CNNs) and deal with inhomogeneous data by jointly learning input-dependent data partitions at the split nodes and age distributions at the leaf nodes.

Age Estimation regression

TDAPNet: Prototype Network with Recurrent Top-Down Attention for Robust Object Classification under Partial Occlusion

no code implementations9 Sep 2019 Mingqing Xiao, Adam Kortylewski, Ruihai Wu, Siyuan Qiao, Wei Shen, Alan Yuille

Despite deep convolutional neural networks' great success in object classification, it suffers from severe generalization performance drop under occlusion due to the inconsistency between training and testing data.

General Classification Object +1

Deep Distance Transform for Tubular Structure Segmentation in CT Scans

no code implementations CVPR 2020 Yan Wang, Xu Wei, Fengze Liu, Jieneng Chen, Yuyin Zhou, Wei Shen, Elliot K. Fishman, Alan L. Yuille

Tubular structure segmentation in medical images, e. g., segmenting vessels in CT scans, serves as a vital step in the use of computers to aid in screening early stages of related diseases.

Segmentation

Segmentation for Classification of Screening Pancreatic Neuroendocrine Tumors

no code implementations4 Apr 2020 Zhuotun Zhu, Yongyi Lu, Wei Shen, Elliot K. Fishman, Alan L. Yuille

This work presents comprehensive results to detect in the early stage the pancreatic neuroendocrine tumors (PNETs), a group of endocrine tumors arising in the pancreas, which are the second common type of pancreatic cancer, by checking the abdominal CT scans.

Classification General Classification +1

Domain Adaptive Relational Reasoning for 3D Multi-Organ Segmentation

no code implementations18 May 2020 Shuhao Fu, Yongyi Lu, Yan Wang, Yuyin Zhou, Wei Shen, Elliot Fishman, Alan Yuille

In this paper, we present a novel unsupervised domain adaptation (UDA) method, named Domain Adaptive Relational Reasoning (DARR), to generalize 3D multi-organ segmentation models to medical data collected from different scanners and/or protocols (domains).

Organ Segmentation Relational Reasoning +3

Towards Unsupervised Learning for Instrument Segmentation in Robotic Surgery with Cycle-Consistent Adversarial Networks

no code implementations9 Jul 2020 Daniil Pakhomov, Wei Shen, Nassir Navab

Surgical tool segmentation in endoscopic images is an important problem: it is a crucial step towards full instrument pose estimation and it is used for integration of pre- and intra-operative images into the endoscopic view.

Image Segmentation Image-to-Image Translation +3

Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing

no code implementations25 Aug 2020 Wei Shen, Xiaonan He, Chuheng Zhang, Qiang Ni, Wanchun Dou, Yan Wang

Therefore, it is crucial to design a participant selection algorithm that applies to different MCS systems to achieve multiple goals.

Combinatorial Optimization Fairness +2

CO2: Consistent Contrast for Unsupervised Visual Representation Learning

no code implementations ICLR 2021 Chen Wei, Huiyu Wang, Wei Shen, Alan Yuille

Regarding the similarity of the query crop to each crop from other images as "unlabeled", the consistency term takes the corresponding similarity of a positive crop as a pseudo label, and encourages consistency between these two similarities.

Contrastive Learning Image Classification +5

Volumetric Medical Image Segmentation: A 3D Deep Coarse-to-fine Framework and Its Adversarial Examples

no code implementations29 Oct 2020 Yingwei Li, Zhuotun Zhu, Yuyin Zhou, Yingda Xia, Wei Shen, Elliot K. Fishman, Alan L. Yuille

Although deep neural networks have been a dominant method for many 2D vision tasks, it is still challenging to apply them to 3D tasks, such as medical image segmentation, due to the limited amount of annotated 3D data and limited computational resources.

Image Segmentation Pancreas Segmentation +2

Hifi: Hierarchical feature integration for skeleton detection

no code implementations1 Jul 2018 Kai Zhao, Wei Shen, ShangHua Gao, Dandan Li, Ming-Ming Cheng

In natural images, the scales (thickness) of object skeletons may dramatically vary among objects and object parts.

Object Object Skeleton Detection

Learning Inductive Attention Guidance for Partially Supervised Pancreatic Ductal Adenocarcinoma Prediction

no code implementations31 May 2021 Yan Wang, Peng Tang, Yuyin Zhou, Wei Shen, Elliot K. Fishman, Alan L. Yuille

We instantiate both the global and the local classifiers by multiple instance learning (MIL), where the attention guidance, indicating roughly where the PDAC regions are, is the key to bridging them: For global MIL based normal/PDAC classification, attention serves as a weight for each instance (voxel) during MIL pooling, which eliminates the distraction from the background; For local MIL based semi-supervised PDAC segmentation, the attention guidance is inductive, which not only provides bag-level pseudo-labels to training data without per-voxel annotations for MIL training, but also acts as a proxy of an instance-level classifier.

Multiple Instance Learning Segmentation

Making CNNs Interpretable by Building Dynamic Sequential Decision Forests with Top-down Hierarchy Learning

no code implementations5 Jun 2021 Yilin Wang, Shaozuo Yu, Xiaokang Yang, Wei Shen

In this paper, we propose a generic model transfer scheme to make Convlutional Neural Networks (CNNs) interpretable, while maintaining their high classification accuracy.

Classification

An Efficient Group-based Search Engine Marketing System for E-Commerce

no code implementations24 Jun 2021 Cheng Jie, Da Xu, Zigeng Wang, Lu Wang, Wei Shen

With the increasing scale of search engine marketing, designing an efficient bidding system is becoming paramount for the success of e-commerce companies.

Marketing

Entity Linking Meets Deep Learning: Techniques and Solutions

no code implementations26 Sep 2021 Wei Shen, Yuhan Li, Yinan Liu, Jiawei Han, Jianyong Wang, Xiaojie Yuan

Entity linking (EL) is the process of linking entity mentions appearing in web text with their corresponding entities in a knowledge base.

Entity Linking Knowledge Base Population +2

Image BERT Pre-training with Online Tokenizer

no code implementations ICLR 2022 Jinghao Zhou, Chen Wei, Huiyu Wang, Wei Shen, Cihang Xie, Alan Yuille, Tao Kong

The success of language Transformers is primarily attributed to the pretext task of masked language modeling (MLM), where texts are first tokenized into semantically meaningful pieces.

Image Classification Instance Segmentation +5

Toward Tweet Entity Linking with Heterogeneous Information Networks

1 code implementation IEEE Transactions on Knowledge and Data Engineering 2021 Wei Shen, Yuwei Yin, Yang Yang, Jiawei Han, Jianyong Wang, Xiaojie Yuan

The task of linking an entity mention in a tweet with its corresponding entity in a heterogeneous information network is of great importance, for the purpose of enriching heterogeneous information networks with the abundant and fresh knowledge embedded in tweets.

Entity Linking Metric Learning

Consensus Synergizes with Memory: A Simple Approach for Anomaly Segmentation in Urban Scenes

no code implementations24 Nov 2021 Jiazhong Cen, Zenkun Jiang, Lingxi Xie, Qi Tian, Xiaokang Yang, Wei Shen

Anomaly segmentation is a crucial task for safety-critical applications, such as autonomous driving in urban scenes, where the goal is to detect out-of-distribution (OOD) objects with categories which are unseen during training.

Anomaly Detection Autonomous Driving +1

Geometric Synthesis: A Free lunch for Large-scale Palmprint Recognition Model Pretraining

no code implementations11 Mar 2022 Kai Zhao, Lei Shen, Yingyi Zhang, Chuhan Zhou, Tao Wang, Ruixin Zhang, Shouhong Ding, Wei Jia, Wei Shen

In this paper, by observing that palmar creases are the key information to deep-learning-based palmprint recognition, we propose to synthesize training data by manipulating palmar creases.

TAR

Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows

no code implementations20 Mar 2022 Danyang Tu, Xiongkuo Min, Huiyu Duan, Guodong Guo, Guangtao Zhai, Wei Shen

Iwin Transformer is a hierarchical Transformer which progressively performs token representation learning and token agglomeration within irregular windows.

Human-Object Interaction Detection Object +4

Video-based Human-Object Interaction Detection from Tubelet Tokens

no code implementations4 Jun 2022 Danyang Tu, Wei Sun, Xiongkuo Min, Guangtao Zhai, Wei Shen

We present a novel vision Transformer, named TUTOR, which is able to learn tubelet tokens, served as highly-abstracted spatiotemporal representations, for video-based human-object interaction (V-HOI) detection.

Human-Object Interaction Detection

A Survey on Label-efficient Deep Image Segmentation: Bridging the Gap between Weak Supervision and Dense Prediction

no code implementations4 Jul 2022 Wei Shen, Zelin Peng, Xuehui Wang, Huayu Wang, Jiazhong Cen, Dongsheng Jiang, Lingxi Xie, Xiaokang Yang, Qi Tian

Next, we summarize the existing label-efficient image segmentation methods from a unified perspective that discusses an important question: how to bridge the gap between weak supervision and dense prediction -- the current methods are mostly based on heuristic priors, such as cross-pixel similarity, cross-label constraint, cross-view consistency, and cross-image relation.

Image Segmentation Instance Segmentation +2

Enhancing Self-Attention with Knowledge-Assisted Attention Maps

no code implementations NAACL 2022 Jiangang Bai, Yujing Wang, Hong Sun, Ruonan Wu, Tianmeng Yang, Pengfei Tang, Defu Cao, Mingliang Zhang1, Yunhai Tong, Yaming Yang, Jing Bai, Ruofei Zhang, Hao Sun, Wei Shen

Large-scale pre-trained language models have attracted extensive attentions in the research community and shown promising results on various tasks of natural language processing.

Multi-Task Learning Natural Language Understanding

Learning Entity Linking Features for Emerging Entities

1 code implementation8 Aug 2022 Chenwei Ran, Wei Shen, Jianbo Gao, Yuhan Li, Jianyong Wang, Yantao Jia

Entity linking (EL) is the process of linking entity mentions appearing in text with their corresponding entities in a knowledge base.

Entity Linking

SwiftPruner: Reinforced Evolutionary Pruning for Efficient Ad Relevance

no code implementations30 Aug 2022 Li Lyna Zhang, Youkow Homma, Yujing Wang, Min Wu, Mao Yang, Ruofei Zhang, Ting Cao, Wei Shen

Remarkably, under our latency requirement of 1900us on CPU, SwiftPruner achieves a 0. 86% higher AUC than the state-of-the-art uniform sparse baseline for BERT-Mini on a large scale real-world dataset.

TEDL: A Two-stage Evidential Deep Learning Method for Classification Uncertainty Quantification

no code implementations12 Sep 2022 Xue Li, Wei Shen, Denis Charles

In this paper, we propose TEDL, a two-stage learning approach to quantify uncertainty for deep learning models in classification tasks, inspired by our findings in experimenting with Evidential Deep Learning (EDL) method, a recently proposed uncertainty quantification approach based on the Dempster-Shafer theory.

Uncertainty Quantification

Personal Attribute Prediction from Conversations

1 code implementation29 Aug 2022 Yinan Liu, Hu Chen, Wei Shen

Personal knowledge bases (PKBs) are critical to many applications, such as Web-based chatbots and personalized recommendation.

Attribute Language Modelling

Low-resource Personal Attribute Prediction from Conversation

no code implementations28 Nov 2022 Yinan Liu, Hu Chen, Wei Shen, Jiaoyan Chen

Previous studies often rely on a relative number of resources such as labeled utterances and external data, yet the attribute knowledge embedded in unlabeled utterances is underutilized and their performance of predicting some difficult personal attributes is still unsatisfactory.

Attribute text-classification +1

Joint Open Knowledge Base Canonicalization and Linking

no code implementations Proceedings of the 2021 International Conference on Management of Data 2021 Yinan Liu, Wei Shen, Yuanfei Wang, Jianyong Wang, Zhenglu Yang, Xiaojie Yuan

However, noun phrases (NPs) and relation phrases (RPs) in OKBs are not canonicalized and often appear in different paraphrased textual variants, which leads to redundant and ambiguous facts.

Open Information Extraction Relation

A Transformer-Based User Satisfaction Prediction for Proactive Interaction Mechanism in DuerOS

no code implementations5 Dec 2022 Wei Shen, Xiaonan He, Chuheng Zhang, Xuyun Zhang, Jian Xie

Moreover, they are trained and evaluated on the benchmark datasets with adequate labels, which are expensive to obtain in a commercial dialogue system.

Spoken Dialogue Systems

Finding Lookalike Customers for E-Commerce Marketing

no code implementations9 Jan 2023 Yang Peng, Changzheng Liu, Wei Shen

Customer-centric marketing campaigns generate a large portion of e-commerce website traffic for Walmart.

Marketing

RePreM: Representation Pre-training with Masked Model for Reinforcement Learning

no code implementations3 Mar 2023 Yuanying Cai, Chuheng Zhang, Wei Shen, Xuyun Zhang, Wenjie Ruan, Longbo Huang

Inspired by the recent success of sequence modeling in RL and the use of masked language model for pre-training, we propose a masked model for pre-training in RL, RePreM (Representation Pre-training with Masked Model), which trains the encoder combined with transformer blocks to predict the masked states or actions in a trajectory.

Data Augmentation Language Modelling +3

Bid Optimization for Offsite Display Ad Campaigns on eCommerce

no code implementations18 Jun 2023 Hangjian Li, Dong Xu, Konstantin Shmakov, Kuang-Chih Lee, Wei Shen

Online retailers often use third-party demand-side-platforms (DSPs) to conduct offsite advertising and reach shoppers across the Internet on behalf of their advertisers.

Agglomerative Transformer for Human-Object Interaction Detection

no code implementations ICCV 2023 Danyang Tu, Wei Sun, Guangtao Zhai, Wei Shen

We propose an agglomerative Transformer (AGER) that enables Transformer-based human-object interaction (HOI) detectors to flexibly exploit extra instance-level cues in a single-stage and end-to-end manner for the first time.

Clustering Human-Object Interaction Detection +1

SAM-PARSER: Fine-tuning SAM Efficiently by Parameter Space Reconstruction

no code implementations28 Aug 2023 Zelin Peng, Zhengqin Xu, Zhilin Zeng, Xiaokang Yang, Wei Shen

Most existing fine-tuning methods attempt to bridge the gaps among different scenarios by introducing a set of new parameters to modify SAM's original parameter space.

Segmentation Semantic Segmentation

Joint Gaze-Location and Gaze-Object Detection

no code implementations26 Aug 2023 Danyang Tu, Wei Shen, Wei Sun, Xiongkuo Min, Guangtao Zhai

In contrast, we reframe the gaze following detection task as detecting human head locations and their gaze followings simultaneously, aiming at jointly detect human gaze location and gaze object in a unified and single-stage pipeline.

Object object-detection +1

Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback

no code implementations8 Oct 2023 Wei Shen, Rui Zheng, WenYu Zhan, Jun Zhao, Shihan Dou, Tao Gui, Qi Zhang, Xuanjing Huang

Reinforcement learning from human feedback serves as a crucial bridge, aligning large language models with human and societal values.

Language Modelling

Stochastic Smoothed Gradient Descent Ascent for Federated Minimax Optimization

no code implementations2 Nov 2023 Wei Shen, Minhui Huang, Jiawei Zhang, Cong Shen

In recent years, federated minimax optimization has attracted growing interest due to its extensive applications in various machine learning tasks.

Federated Learning

All Data on the Table: Novel Dataset and Benchmark for Cross-Modality Scientific Information Extraction

no code implementations14 Nov 2023 Yuhan Li, Jian Wu, Zhiwei Yu, Börje F. Karlsson, Wei Shen, Manabu Okumura, Chin-Yew Lin

To close this gap in data availability and enable cross-modality IE, while alleviating labeling costs, we propose a semi-supervised pipeline for annotating entities in text, as well as entities and relations in tables, in an iterative procedure.

Parameter Efficient Fine-tuning via Cross Block Orchestration for Segment Anything Model

no code implementations28 Nov 2023 Zelin Peng, Zhengqin Xu, Zhilin Zeng, Lingxi Xie, Qi Tian, Wei Shen

Parameter-efficient fine-tuning (PEFT) is an effective methodology to unleash the potential of large foundation models in novel scenarios with limited training data.

Image Classification Image Segmentation +2

Segment Any 3D Gaussians

no code implementations1 Dec 2023 Jiazhong Cen, Jiemin Fang, Chen Yang, Lingxi Xie, Xiaopeng Zhang, Wei Shen, Qi Tian

Interactive 3D segmentation in radiance fields is an appealing task since its importance in 3D scene understanding and manipulation.

Interactive Segmentation Scene Understanding +1

Partial Label Learning with a Partner

no code implementations18 Dec 2023 Chongjie Si, Zekun Jiang, Xuehui Wang, Yan Wang, Xiaokang Yang, Wei Shen

To help existing PLL methods identify and rectify mislabeled samples, in this paper, we introduce a novel partner classifier and propose a novel ``mutual supervision'' paradigm.

Partial Label Learning

Human-Instruction-Free LLM Self-Alignment with Limited Samples

no code implementations6 Jan 2024 Hongyi Guo, Yuanshun Yao, Wei Shen, Jiaheng Wei, Xiaoying Zhang, Zhaoran Wang, Yang Liu

The key idea is to first retrieve high-quality samples related to the target domain and use them as In-context Learning examples to generate more samples.

In-Context Learning Instruction Following

ViTree: Single-path Neural Tree for Step-wise Interpretable Fine-grained Visual Categorization

no code implementations30 Jan 2024 Danning Lao, Qi Liu, Jiazi Bu, Junchi Yan, Wei Shen

As computer vision continues to advance and finds widespread applications across various domains, the need for interpretability in deep learning models becomes paramount.

Decision Making Fine-Grained Visual Categorization

Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation

no code implementations8 Mar 2024 Xiaoying Zhang, Jean-Francois Ton, Wei Shen, Hongning Wang, Yang Liu

We introduce Adversarial Policy Optimization (AdvPO), a novel solution to the pervasive issue of reward over-optimization in Reinforcement Learning from Human Feedback (RLHF) for Large Language Models (LLMs).

Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards

no code implementations12 Mar 2024 Wei Shen, Xiaoying Zhang, Yuanshun Yao, Rui Zheng, Hongyi Guo, Yang Liu

Reinforcement learning from human feedback (RLHF) is the mainstream paradigm used to align large language models (LLMs) with human preferences.

reinforcement-learning

EndoGSLAM: Real-Time Dense Reconstruction and Tracking in Endoscopic Surgeries using Gaussian Splatting

no code implementations22 Mar 2024 Kailing Wang, Chen Yang, Yuehao Wang, Sikuang Li, Yan Wang, Qi Dou, Xiaokang Yang, Wei Shen

Precise camera tracking, high-fidelity 3D tissue reconstruction, and real-time online visualization are critical for intrabody medical imaging devices such as endoscopes and capsule robots.

Simultaneous Localization and Mapping

Cannot find the paper you are looking for? You can Submit a new open access paper.