Search Results for author: Qian Yu

Found 71 papers, 30 papers with code

Attention-based Extraction of Structured Information from Street View Imagery

3 code implementations11 Apr 2017 Zbigniew Wojna, Alex Gorban, Dar-Shyang Lee, Kevin Murphy, Qian Yu, Yeqing Li, Julian Ibarz

We present a neural network model - based on CNNs, RNNs and a novel attention mechanism - which achieves 84. 2% accuracy on the challenging French Street Name Signs (FSNS) dataset, significantly outperforming the previous state of the art (Smith'16), which achieved 72. 46%.

Optical Character Recognition (OCR)

DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models

1 code implementation NeurIPS 2023 XiMing Xing, Chuang Wang, Haitao Zhou, Jing Zhang, Qian Yu, Dong Xu

Even though trained mainly on images, we discover that pretrained diffusion models show impressive power in guiding sketch synthesis.

Multi-view Aggregation Network for Dichotomous Image Segmentation

2 code implementations11 Apr 2024 Qian Yu, Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu

Dichotomous Image Segmentation (DIS) has recently emerged towards high-precision object segmentation from high-resolution natural images.

Dichotomous Image Segmentation Image Segmentation +1

SketchyScene: Richly-Annotated Scene Sketches

2 code implementations ECCV 2018 Changqing Zou, Qian Yu, Ruofei Du, Haoran Mo, Yi-Zhe Song, Tao Xiang, Chengying Gao, Baoquan Chen, Hao Zhang

We contribute the first large-scale dataset of scene sketches, SketchyScene, with the goal of advancing research on sketch understanding at both the object and scene level.

Colorization Image Retrieval +2

SVGDreamer: Text Guided SVG Generation with Diffusion Model

1 code implementation27 Dec 2023 XiMing Xing, Haitao Zhou, Chuang Wang, Jing Zhang, Dong Xu, Qian Yu

However, existing text-to-SVG generation methods lack editability and struggle with visual quality and result diversity.

Vector Graphics

Orthogonal Annotation Benefits Barely-supervised Medical Image Segmentation

1 code implementation CVPR 2023 Heng Cai, Shumeng Li, Lei Qi, Qian Yu, Yinghuan Shi, Yang Gao

Subsequently, by introducing unlabeled volumes, we propose a dual-network paradigm named Dense-Sparse Co-training (DeSCO) that exploits dense pseudo labels in early stage and sparse labels in later stage and meanwhile forces consistent output of two networks.

Image Segmentation Semantic Segmentation +1

HiNet: Novel Multi-Scenario & Multi-Task Learning with Hierarchical Information Extraction

1 code implementation10 Mar 2023 Jie zhou, Xianshuai Cao, Wenhao Li, Lin Bo, Kun Zhang, Chuan Luo, Qian Yu

Multi-scenario & multi-task learning has been widely applied to many recommendation systems in industrial applications, wherein an effective and practical approach is to carry out multi-scenario transfer learning on the basis of the Mixture-of-Expert (MoE) architecture.

Multi-Task Learning Recommendation Systems

Inconsistency-aware Uncertainty Estimation for Semi-supervised Medical Image Segmentation

1 code implementation17 Oct 2021 Yinghuan Shi, Jian Zhang, Tong Ling, Jiwen Lu, Yefeng Zheng, Qian Yu, Lei Qi, Yang Gao

In semi-supervised medical image segmentation, most previous works draw on the common assumption that higher entropy means higher uncertainty.

Image Segmentation Segmentation +2

Unsupervised Sketch-to-Photo Synthesis

1 code implementation18 Sep 2019 Runtao Liu, Qian Yu, Stella Yu

Humans can envision a realistic photo given a free-hand sketch that is not only spatially imprecise and geometrically distorted but also without colors and visual details.

Colorization Data Augmentation +5

3D Shape Reconstruction from Free-Hand Sketches

1 code implementation17 Jun 2020 Jiayun Wang, Jierui Lin, Qian Yu, Runtao Liu, Yubei Chen, Stella X. Yu

Additionally, we propose a sketch standardization module to handle different sketch distortions and styles.

3D Reconstruction 3D Shape Reconstruction

Feature Decomposition for Reducing Negative Transfer: A Novel Multi-task Learning Method for Recommender System

1 code implementation10 Feb 2023 Jie zhou, Qian Yu, Chuan Luo, Jing Zhang

In recent years, thanks to the rapid development of deep learning (DL), DL-based multi-task learning (MTL) has made significant progress, and it has been successfully applied to recommendation systems (RS).

Multi-Task Learning Recommendation Systems

Sketch-a-Net that Beats Humans

2 code implementations30 Jan 2015 Qian Yu, Yongxin Yang, Yi-Zhe Song, Tao Xiang, Timothy Hospedales

We propose a multi-scale multi-channel deep neural network framework that, for the first time, yields sketch recognition performance surpassing that of humans.

Sketch Recognition

Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter

1 code implementation6 Sep 2023 Jinglong Wang, Xiawei Li, Jing Zhang, Qingyuan Xu, Qin Zhou, Qian Yu, Lu Sheng, Dong Xu

The pre-trained text-image discriminative models, such as CLIP, has been explored for open-vocabulary semantic segmentation with unsatisfactory results due to the loss of crucial localization information and awareness of object shapes.

Contrastive Learning Denoising +5

IAM: A Comprehensive and Large-Scale Dataset for Integrated Argument Mining Tasks

1 code implementation ACL 2022 Liying Cheng, Lidong Bing, Ruidan He, Qian Yu, Yan Zhang, Luo Si

Traditionally, a debate usually requires a manual preparation process, including reading plenty of articles, selecting the claims, identifying the stances of the claims, seeking the evidence for the claims, etc.

Claim-Evidence Pair Extraction (CEPE) Claim Extraction with Stance Classification (CESC) +1

SketchSampler: Sketch-based 3D Reconstruction via View-dependent Depth Sampling

1 code implementation14 Aug 2022 Chenjian Gao, Qian Yu, Lu Sheng, Yi-Zhe Song, Dong Xu

Reconstructing a 3D shape based on a single sketch image is challenging due to the large domain gap between a sparse, irregular sketch and a regular, dense 3D shape.

3D Reconstruction

3D Medical Image Segmentation with Sparse Annotation via Cross-Teaching between 3D and 2D Networks

1 code implementation30 Jul 2023 Heng Cai, Lei Qi, Qian Yu, Yinghuan Shi, Yang Gao

Our experimental results on the MMWHS dataset demonstrate that our method outperforms the state-of-the-art (SOTA) semi-supervised segmentation methods.

Image Segmentation Medical Image Segmentation +3

Inversion-by-Inversion: Exemplar-based Sketch-to-Photo Synthesis via Stochastic Differential Equations without Training

1 code implementation15 Aug 2023 XiMing Xing, Chuang Wang, Haitao Zhou, Zhihao Hu, Chongxuan Li, Dong Xu, Qian Yu

In the full-control inversion process, we propose an appearance-energy function to control the color and texture of the final generated photo. Importantly, our Inversion-by-Inversion pipeline is training-free and can accept different types of exemplars for color and texture control.

Image Generation

Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport

1 code implementation28 Feb 2024 Bin Li, Ye Shi, Qian Yu, Jingya Wang

This paper introduces ProtoOT, a novel Optimal Transport formulation explicitly tailored for UCIR, which integrates intra-domain feature representation learning and cross-domain alignment into a unified framework.

Contrastive Learning Image Retrieval +2

Crosslink-Net: Double-branch Encoder Segmentation Network via Fusing Vertical and Horizontal Convolutions

1 code implementation24 Jul 2021 Qian Yu, Lei Qi, Luping Zhou, Lei Wang, Yilong Yin, Yinghuan Shi, Wuzhang Wang, Yang Gao

Together, the above two schemes give rise to a novel double-branch encoder segmentation framework for medical image segmentation, namely Crosslink-Net.

Image Segmentation Medical Image Segmentation +2

Towards Automatic Power Battery Detection: New Challenge, Benchmark Dataset and Baseline

1 code implementation5 Dec 2023 Xiaoqi Zhao, Youwei Pang, Zhenyu Chen, Qian Yu, Lihe Zhang, Hanqi Liu, Jiaming Zuo, Huchuan Lu

We conduct a comprehensive study on a new task named power battery detection (PBD), which aims to localize the dense cathode and anode plates endpoints from X-ray images to evaluate the quality of power batteries.

Crowd Counting object-detection +2

PLN: Parasitic-Like Network for Barely Supervised Medical Image Segmentation

1 code implementation IEEE Transactions on Medical Imaging 2022 Shumeng Li, Heng Cai; Lei Qi, Qian Yu, Yinghuan Shi, Yang Gao

In this paper, by introducing an extremely sparse annotation way of labeling only one slice per 3D image, we investigate a novel barely-supervised segmentation setting with only a few sparsely-labeled images along with a large amount of unlabeled images.

Image Segmentation Medical Image Segmentation +2

Rethinking Large-scale Pre-ranking System: Entire-chain Cross-domain Models

1 code implementation12 Oct 2023 Jinbo Song, Ruoran Huang, Xinyang Wang, Wei Huang, Qian Yu, Mingming Chen, Yafei Yao, Chaosheng Fan, Changping Peng, Zhangang Lin, Jinghe Hu, Jingping Shao

Industrial systems such as recommender systems and online advertising, have been widely equipped with multi-stage architectures, which are divided into several cascaded modules, including matching, pre-ranking, ranking and re-ranking.

Recommendation Systems Re-Ranking +1

Concatenate, Fine-tuning, Re-training: A SAM-enabled Framework for Semi-supervised 3D Medical Image Segmentation

1 code implementation17 Mar 2024 Shumeng Li, Lei Qi, Qian Yu, Jing Huo, Yinghuan Shi, Yang Gao

Segment Anything Model (SAM) fine-tuning has shown remarkable performance in medical image segmentation in a fully supervised manner, but requires precise annotations.

Image Segmentation Segmentation +2

Polynomial Codes: an Optimal Design for High-Dimensional Coded Matrix Multiplication

3 code implementations NeurIPS 2017 Qian Yu, Mohammad Ali Maddah-Ali, A. Salman Avestimehr

We consider a large-scale matrix multiplication problem where the computation is carried out using a distributed system with a master node and multiple worker nodes, where each worker can store parts of the input matrices.

Information Theory Distributed, Parallel, and Cluster Computing Information Theory

Distortion-aware Transformer in 360° Salient Object Detection

1 code implementation7 Aug 2023 Yinjie Zhao, Lichen Zhao, Qian Yu, Jing Zhang, Lu Sheng, Dong Xu

The first is a Distortion Mapping Module, which guides the model to pre-adapt to distorted features globally.

ERP Object +3

Constructing and Exploring Intermediate Domains in Mixed Domain Semi-supervised Medical Image Segmentation

1 code implementation13 Apr 2024 Qinghe Ma, Jian Zhang, Lei Qi, Qian Yu, Yinghuan Shi, Yang Gao

To fully utilize the information within the intermediate domain, we propose a symmetric Guidance training strategy (SymGD), which additionally offers direct guidance to unlabeled data by merging pseudo labels from intermediate samples.

Image Segmentation Segmentation +4

Lagrange Coded Computing: Optimal Design for Resiliency, Security and Privacy

no code implementations4 Jun 2018 Qian Yu, Songze Li, Netanel Raviv, Seyed Mohammadreza Mousavi Kalan, Mahdi Soltanolkotabi, Salman Avestimehr

We consider a scenario involving computations over a massive dataset stored distributedly across multiple workers, which is at the core of distributed learning algorithms.

Polynomially Coded Regression: Optimal Straggler Mitigation via Data Encoding

no code implementations24 May 2018 Songze Li, Seyed Mohammadreza Mousavi Kalan, Qian Yu, Mahdi Soltanolkotabi, A. Salman Avestimehr

In particular, PCR requires a recovery threshold that scales inversely proportionally with the amount of computation/storage available at each worker.

regression

Crossbar-Net: A Novel Convolutional Network for Kidney Tumor Segmentation in CT Images

no code implementations27 Apr 2018 Qian Yu, Yinghuan Shi, Jinquan Sun, Yang Gao, Yakang Dai, Jianbing Zhu

Due to the irregular motion, similar appearance and diverse shape, accurate segmentation of kidney tumor in CT images is a difficult and challenging task.

Cardiac Segmentation Segmentation +1

Network Transplanting

no code implementations26 Apr 2018 Quanshi Zhang, Yu Yang, Qian Yu, Ying Nian Wu

This paper focuses on a new task, i. e., transplanting a category-and-task-specific neural network to a generic, modular network without strong supervision.

Coded Fourier Transform

no code implementations17 Oct 2017 Qian Yu, Mohammad Ali Maddah-Ali, A. Salman Avestimehr

We consider the problem of computing the Fourier transform of high-dimensional vectors, distributedly over a cluster of machines consisting of a master node and multiple worker nodes, where the worker nodes can only store and process a fraction of the inputs.

Large Scale Business Discovery from Street Level Imagery

no code implementations17 Dec 2015 Qian Yu, Christian Szegedy, Martin C. Stumpe, Liron Yatziv, Vinay Shet, Julian Ibarz, Sacha Arnoud

Precise business store front detection enables accurate geo-location of businesses, and further provides input for business categorization, listing generation, etc.

Understanding Boltzmann Machine and Deep Learning via A Confident Information First Principle

no code implementations16 Feb 2013 Xiaozhao Zhao, Yuexian Hou, Qian Yu, Dawei Song, Wenjie Li

Typical dimensionality reduction methods focus on directly reducing the number of random variables while retaining maximal variations in the data.

Density Estimation Dimensionality Reduction

Responding E-commerce Product Questions via Exploiting QA Collections and Reviews

no code implementations COLING 2018 Qian Yu, Wai Lam, ZiHao Wang

Providing instant responses for product questions in E-commerce sites can significantly improve satisfaction of potential consumers.

Learning-To-Rank Sentence

Network Transplanting (extended abstract)

no code implementations21 Jan 2019 Quanshi Zhang, Yu Yang, Qian Yu, Ying Nian Wu

This paper focuses on a new task, i. e., transplanting a category-and-task-specific neural network to a generic, modular network without strong supervision.

Recognizing Activities via Bag of Words for Attribute Dynamics

no code implementations CVPR 2013 Weixin Li, Qian Yu, Harpreet Sawhney, Nuno Vasconcelos

A video sequence is decomposed into short-term segments, which are characterized by the dynamics of their attributes.

Activity Recognition Attribute

Pedestrian Detection in Low-resolution Imagery by Learning Multi-scale Intrinsic Motion Structures (MIMS)

no code implementations CVPR 2014 Jiejie Zhu, Omar Javed, Jingen Liu, Qian Yu, Hui Cheng, Harpreet Sawhney

In this paper, we give a comparative evaluation of the proposed method and demonstrate that MIMS outperforms the state of the art approaches in identifying pedestrians from low resolution airborne videos.

Optical Flow Estimation Pedestrian Detection

Sketch Me That Shoe

no code implementations CVPR 2016 Qian Yu, Feng Liu, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales, Chen-Change Loy

We investigate the problem of fine-grained sketch-based image retrieval (SBIR), where free-hand human sketches are used as queries to perform instance-level retrieval of images.

Data Augmentation Retrieval +1

Minimax Regret of Switching-Constrained Online Convex Optimization: No Phase Transition

no code implementations NeurIPS 2020 Lin Chen, Qian Yu, Hannah Lawrence, Amin Karbasi

To establish the dimension-independent upper bound, we next show that a mini-batching algorithm provides an $ O(\frac{T}{\sqrt{K}}) $ upper bound, and therefore conclude that the minimax regret of switching-constrained OCO is $ \Theta(\frac{T}{\sqrt{K}}) $ for any $K$.

2k

Review-based Question Generation with Adaptive Instance Transfer and Augmentation

no code implementations ACL 2020 Qian Yu, Lidong Bing, Qiong Zhang, Wai Lam, Luo Si

We propose an iterative learning framework for handling this challenge via adaptive transfer and augmentation of the training instances with the help of the available user-posed question-answer data.

Question Generation Question-Generation

Crossover-Net: Leveraging the Vertical-Horizontal Crossover Relation for Robust Segmentation

no code implementations3 Apr 2020 Qian Yu, Yinghuan Shi, Yefeng Zheng, Yang Gao, Jianbing Zhu, Yakang Dai

Robust segmentation for non-elongated tissues in medical images is hard to realize due to the large variation of the shape, size, and appearance of these tissues in different patients.

Relation Segmentation

Unsupervised Sketch to Photo Synthesis

no code implementations ECCV 2020 Runtao Liu, Qian Yu, Stella X. Yu

Humans can envision a realistic photo given a free-hand sketch that is not only spatially imprecise and geometrically distorted but also without colors and visual details.

Denoising Retrieval +2

Deep Symmetric Adaptation Network for Cross-modality Medical Image Segmentation

no code implementations18 Jan 2021 Xiaoting Han, Lei Qi, Qian Yu, Ziqi Zhou, Yefeng Zheng, Yinghuan Shi, Yang Gao

These typical methods usually utilize a translation network to transform images from the source domain to target domain or train the pixel-level classifier merely using translated source images and original target images.

Image Segmentation Medical Image Segmentation +4

Answering Product-related Questions with Heterogeneous Information

no code implementations Asian Chapter of the Association for Computational Linguistics 2020 Wenxuan Zhang, Qian Yu, Wai Lam

Providing instant response for product-related questions in E-commerce question answering platforms can greatly improve users{'} online shopping experience.

Attribute Question Answering

Feature Cross Search via Submodular Optimization

no code implementations5 Jul 2021 Lin Chen, Hossein Esfandiari, Gang Fu, Vahab S. Mirrokni, Qian Yu

First, we show that it is not possible to provide an $n^{1/\log\log n}$-approximation algorithm for this problem unless the exponential time hypothesis fails.

Feature Engineering

LightSecAgg: a Lightweight and Versatile Design for Secure Aggregation in Federated Learning

no code implementations29 Sep 2021 Jinhyun So, Chaoyang He, Chien-Sheng Yang, Songze Li, Qian Yu, Ramy E. Ali, Basak Guler, Salman Avestimehr

We also demonstrate that, unlike existing schemes, LightSecAgg can be applied to secure aggregation in the asynchronous FL setting.

Federated Learning

STVGBert: A Visual-Linguistic Transformer Based Framework for Spatio-Temporal Video Grounding

no code implementations ICCV 2021 Rui Su, Qian Yu, Dong Xu

Spatio-temporal video grounding (STVG) aims to localize a spatio-temporal tube of a target object in an untrimmed video based on a query sentence.

Object Sentence +2

Action based Network for Conversation Question Reformulation

no code implementations29 Nov 2021 Zheyu Ye, Jiangning Liu, Qian Yu, Jianxun Ju

Conversation question answering requires the ability to interpret a question correctly.

Question Answering

Slow Motion Matters: A Slow Motion Enhanced Network for Weakly Supervised Temporal Action Localization

no code implementations21 Nov 2022 Weiqi Sun, Rui Su, Qian Yu, Dong Xu

Weakly supervised temporal action localization (WTAL) aims to localize actions in untrimmed videos with only weak supervision information (e. g. video-level labels).

Weakly-supervised Temporal Action Localization Weakly Supervised Temporal Action Localization

The Strategy Evolution in Double Auction Based on the Experience-Weighted Attraction Learning Model

no code implementations IEEE Access ( Volume: 7) 2019 Qian Yu, Yaqin Liu, De Xia, Luis Martínez

The double auction is a widely applicable trading mechanism used to converge to competitive equilibrium in different markets from which multiple equilibriums and incomplete information may arise.

Product Question Answering in E-Commerce: A Survey

no code implementations16 Feb 2023 Yang Deng, Wenxuan Zhang, Qian Yu, Wai Lam

Product question answering (PQA), aiming to automatically provide instant responses to customer's questions in E-Commerce platforms, has drawn increasing attention in recent years.

Question Answering

Sample Complexity for Quadratic Bandits: Hessian Dependent Bounds and Optimal Algorithms

no code implementations NeurIPS 2023 Qian Yu, Yining Wang, Baihe Huang, Qi Lei, Jason D. Lee

We consider a fundamental setting in which the objective function is quadratic, and provide the first tight characterization of the optimal Hessian-dependent sample complexity.

valid

DCRNN: A Deep Cross approach based on RNN for Partial Parameter Sharing in Multi-task Learning

no code implementations18 Oct 2023 Jie zhou, Qian Yu

The model has three innovations: 1) It adopts the idea of cross network and uses RNN network to cross-process the features, thereby effectively improves the expressive ability of the model; 2) It innovatively proposes the structure of partial parameter sharing; 3) It can effectively capture the potential correlation between different tasks to optimize the efficiency and methods for learning different tasks.

Multi-Task Learning Recommendation Systems

Data Contamination Issues in Brain-to-Text Decoding

no code implementations18 Dec 2023 Congchi Yin, Qian Yu, Zhiwei Fang, Jie He, Changping Peng, Zhangang Lin, Jingping Shao, Piji Li

Decoding non-invasive cognitive signals to natural language has long been the goal of building practical brain-computer interfaces (BCIs).

EEG

An Incremental Update Framework for Online Recommenders with Data-Driven Prior

no code implementations26 Dec 2023 Chen Yang, Jin Chen, Qian Yu, Xiangdong Wu, Kui Ma, Zihao Zhao, Zhiwei Fang, Wenlong Chen, Chaosheng Fan, Jie He, Changping Peng, Zhangang Lin, Jingping Shao

To address the aforementioned issue, we propose an incremental update framework for online recommenders with Data-Driven Prior (DDP), which is composed of Feature Prior (FP) and Model Prior (MP).

Continual Learning

Multi-modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation

1 code implementation27 Dec 2023 Xiawei Li, Qingyuan Xu, Jing Zhang, Tianyi Zhang, Qian Yu, Lu Sheng, Dong Xu

The point affinity proposed in this paper is characterized by features from multiple modalities (e. g., point cloud and RGB), and is further refined by normalizing the classifier weights to alleviate the detrimental effects of long-tailed distribution without the need of the prior of category distribution.

3D Semantic Segmentation Point Cloud Segmentation +1

Data-Free Generalized Zero-Shot Learning

no code implementations28 Jan 2024 Bowen Tang, Long Yan, Jing Zhang, Qian Yu, Lu Sheng, Dong Xu

Firstly, to recover the virtual features of the base data, we model the CLIP features of base class images as samples from a von Mises-Fisher (vMF) distribution based on the pre-trained classifier.

Generalized Zero-Shot Learning Zero-shot Generalization

GenesisTex: Adapting Image Denoising Diffusion to Texture Space

no code implementations26 Mar 2024 Chenjian Gao, Boyan Jiang, Xinghui Li, Yingpeng Zhang, Qian Yu

We present GenesisTex, a novel method for synthesizing textures for 3D geometries from text descriptions.

Image Denoising

Cannot find the paper you are looking for? You can Submit a new open access paper.