Search Results for author: Jian Wu

Found 81 papers, 28 papers with code

SmartCiteCon: Implicit Citation Context Extraction from Academic Literature Using Supervised Learning

no code implementations WOSP 2020 Chenrui Guo, Haoran Cui, Li Zhang, Jiamin Wang, Wei Lu, Jian Wu

The tool is built on a Support Vector Machine (SVM) model trained on a set of 7, 058 manually annotated citation context sentences, curated from 34, 000 papers from the ACL Anthology.

Acknowledgement Entity Recognition in CORD-19 Papers

1 code implementation EMNLP (sdp) 2020 Jian Wu, Pei Wang, Xin Wei, Sarah Rajtmajer, C. Lee Giles, Christopher Griffin

We built a supplementary database by linking CORD-19 papers with acknowledgement entities extracted by AckExtract including persons and organizations and find that only up to 50–60% of named entities are actually acknowledged.

Sound2Synth: Interpreting Sound via FM Synthesizer Parameters Estimation

1 code implementation6 May 2022 Zui Chen, Yansen Jing, Shengcheng Yuan, Yifei Xu, Jian Wu, Hang Zhao

Synthesizer is a type of electronic musical instrument that is now widely used in modern music production and sound design.

Audio Classification Audio Signal Processing

SciEv: Finding Scientific Evidence Papers for Scientific News

no code implementations30 Apr 2022 Md Reshad Ul Hoque, Jiang Li, Jian Wu

To our best knowledge, this is the first dataset of this kind.

Ultra Fast Speech Separation Model with Teacher Student Learning

no code implementations27 Apr 2022 Sanyuan Chen, Yu Wu, Zhuo Chen, Jian Wu, Takuya Yoshioka, Shujie Liu, Jinyu Li, Xiangzhan Yu

In this paper, an ultra fast speech separation Transformer model is proposed to achieve both better performance and efficiency with teacher student learning (T-S learning).

Speech Separation

Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?

no code implementations27 Apr 2022 Sanyuan Chen, Yu Wu, Chengyi Wang, Shujie Liu, Zhuo Chen, Peidong Wang, Gang Liu, Jinyu Li, Jian Wu, Xiangzhan Yu, Furu Wei

Recently, self-supervised learning (SSL) has demonstrated strong performance in speaker recognition, even if the pre-training objective is designed for speech recognition.

Self-Supervised Learning Speaker Recognition +2

Online Deep Learning from Doubly-Streaming Data

1 code implementation25 Apr 2022 Heng Lian, John Scovil Atwood, BoJian Hou, Jian Wu, Yi He

This paper investigates a new online learning problem with doubly-streaming data, where the data streams are described by feature spaces that constantly evolve, with new features emerging and old features fading away.

online learning

Attention-based CNN-LSTM and XGBoost hybrid model for stock prediction

1 code implementation6 Apr 2022 Zhuangwei Shi, Yang Hu, Guangliang Mo, Jian Wu

Due to the complex volatility of the stock market, the research and prediction on the change of the stock price, can avoid the risk for the investors.

Stock Prediction Time Series

Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings

no code implementations30 Mar 2022 Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka

The proposed speaker embedding, named t-vector, is extracted synchronously with the t-SOT ASR model, enabling joint execution of speaker identification (SID) or speaker diarization (SD) with the multi-talker transcription with low latency.

Automatic Speech Recognition Speaker Diarization +1

Large-Scale 3D Semantic Reconstruction for Automated Driving Vehicles with Adaptive Truncated Signed Distance Function

no code implementations28 Feb 2022 Haohao Hu, Hexing Yang, Jian Wu, Xiao Lei, Frank Bieder, Jan-Hendrik Pauls, Christoph Stiller

Since a 3D surface can be usually observed from multiple camera images with different view poses, an optimal image patch selection for the texturing and an optimal semantic class estimation for the semantic mapping are still challenging.

3D Reconstruction

DialMed: A Dataset for Dialogue-based Medication Recommendation

1 code implementation22 Feb 2022 Zhenfeng He, Yuqiang Han, Zhenqiu Ouyang, Wei Gao, Hongxu Chen, Guandong Xu, Jian Wu

Therefore, we make the first attempt to recommend medications with the conversations between doctors and patients.

Graph Attention

A State-of-the-art Survey of U-Net in Microscopic Image Analysis: from Simple Usage to Structure Mortification

no code implementations14 Feb 2022 Jian Wu, Wanli Liu, Chen Li, Tao Jiang, Islam Mohammad Shariful, Hongzan Sun, Xiaoqi Li, Xintong Li, Xinyu Huang, Marcin Grzegorzek

Image analysis technology is used to solve the inadvertences of artificial traditional methods in disease, wastewater treatment, environmental change monitoring analysis and convolutional neural networks (CNN) play an important role in microscopic image analysis.

Semantic Segmentation

Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study

no code implementations7 Feb 2022 Daniel Tompkins, Kshitiz Kumar, Jian Wu

An Xception model reaches state-of-the-art (SOTA) accuracy on the ESC-50 dataset for audio event detection through knowledge transfer from ImageNet weights, pretraining on AudioSet, and an on-the-fly data augmentation pipeline.

Data Augmentation Event Detection +1

Streaming Multi-Talker ASR with Token-Level Serialized Output Training

no code implementations2 Feb 2022 Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka

This paper proposes a token-level serialized output training (t-SOT), a novel framework for streaming multi-talker automatic speech recognition (ASR).

Automatic Speech Recognition

What Can Machine Vision Do for Lymphatic Histopathology Image Analysis: A Comprehensive Review

no code implementations21 Jan 2022 Xiaoqi Li, HaoYuan Chen, Chen Li, Md Mamunur Rahaman, Xintong Li, Jian Wu, Xiaoyan Li, Hongzan Sun, Marcin Grzegorzek

In the past ten years, the computing power of machine vision (MV) has been continuously improved, and image analysis algorithms have developed rapidly.

D-Former: A U-shaped Dilated Transformer for 3D Medical Image Segmentation

no code implementations3 Jan 2022 Yixuan Wu, Kuanlun Liao, Jintai Chen, Jinhong Wang, Danny Z. Chen, Honghao Gao, Jian Wu

In this paper, we propose a new method called Dilated Transformer, which conducts self-attention for pair-wise patch relations captured alternately in local and global scopes.

Medical Image Segmentation Semantic Segmentation

AGMI: Attention-Guided Multi-omics Integration for Drug Response Prediction with Graph Neural Networks

1 code implementation15 Dec 2021 Ruiwei Feng, Yufeng Xie, Minshan Lai, Danny Z. Chen, Ji Cao, Jian Wu

Accurate drug response prediction (DRP) is a crucial yet challenging task in precision medicine.

DANets: Deep Abstract Networks for Tabular Data Classification and Regression

1 code implementation6 Dec 2021 Jintai Chen, Kuanlun Liao, Yao Wan, Danny Z. Chen, Jian Wu

A special basic block is built using AbstLays, and we construct a family of Deep Abstract Networks (DANets) for tabular data classification and regression by stacking such blocks.

Continuous Speech Separation with Recurrent Selective Attention Network

no code implementations28 Oct 2021 Yixuan Zhang, Zhuo Chen, Jian Wu, Takuya Yoshioka, Peidong Wang, Zhong Meng, Jinyu Li

In this paper, we propose to apply recurrent selective attention network (RSAN) to CSS, which generates a variable number of output channels based on active speaker counting.

Speech Recognition Speech Separation

A Neural Network-Based Linguistic Similarity Measure for Entrainment in Conversations

no code implementations4 Sep 2021 Mingzhi Yu, Diane Litman, Shuang Ma, Jian Wu

Then we use the model to perform similarity measure in a corpus-based entrainment analysis.

A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio

no code implementations6 Jul 2021 Naoyuki Kanda, Xiong Xiao, Jian Wu, Tianyan Zhou, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

Our evaluation on the AMI meeting corpus reveals that after fine-tuning with a small real data, the joint system performs 8. 9--29. 9% better in accuracy compared to the best modular system while the modular system performs better before such fine-tuning.

Automatic Speech Recognition Representation Learning +2

Investigation of Practical Aspects of Single Channel Speech Separation for ASR

no code implementations5 Jul 2021 Jian Wu, Zhuo Chen, Sanyuan Chen, Yu Wu, Takuya Yoshioka, Naoyuki Kanda, Shujie Liu, Jinyu Li

Speech separation has been successfully applied as a frontend processing module of conversation transcription systems thanks to its ability to handle overlapped speech and its flexibility to combine with downstream tasks such as automatic speech recognition (ASR).

Automatic Speech Recognition Model Compression +1

Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and Dissertations

1 code implementation1 Jul 2021 Muntabir Hasan Choudhury, Himarsha R. Jayanetti, Jian Wu, William A. Ingram, Edward A. Fox

Our experiments show that CRF with visual features outperformed both a heuristic and a CRF model with only text-based features.

Sequence-level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models

no code implementations30 Jun 2021 Amber Afshan, Kshitiz Kumar, Jian Wu

We propose a cost-effective method of using CC scores to select an optimal adaptation data set, where we maximize ASR gains from minimal data.

Automatic Speech Recognition

Extractive Research Slide Generation Using Windowed Labeling Ranking

1 code implementation NAACL (sdp) 2021 Athar Sefid, Jian Wu, Prasenjit Mitra, Lee Giles

Presentation slides describing the content of scientific and technical papers are an efficient and effective way to present that work.

Extractive Summarization

Document Domain Randomization for Deep Learning Document Layout Extraction

no code implementations20 May 2021 Meng Ling, Jian Chen, Torsten Möller, Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Robert S. Laramee, Han-Wei Shen, Jian Wu, C. Lee Giles

We present document domain randomization (DDR), the first successful transfer of convolutional neural networks (CNNs) trained only on graphically rendered pseudo-paper pages to real-world document segmentation.

Electrocardio Panorama: Synthesizing New ECG Views with Self-supervision

1 code implementation12 May 2021 Jintai Chen, Xiangshang Zheng, Hongyun Yu, Danny Z. Chen, Jian Wu

For the first time, we propose a new concept, Electrocardio Panorama, which allows visualizing ECG signals from any queried viewpoints.

Self-Supervised Learning

Doctor Imitator: A Graph-based Bone Age Assessment Framework Using Hand Radiographs

no code implementations10 Feb 2021 Jintai Chen, Bohan Yu, Biwen Lei, Ruiwei Feng, Danny Z. Chen, Jian Wu

The architecture of DI is designed to learn the diagnostic logistics of doctors using the scoring methods (e. g., the Tanner-Whitehouse method) for bone age assessment.

Flow-Mixup: Classifying Multi-labeled Medical Images with Corrupted Labels

no code implementations9 Feb 2021 Jintai Chen, Hongyun Yu, Ruiwei Feng, Danny Z. Chen, Jian Wu

In clinical practice, medical image interpretation often involves multi-labeled classification, since the affected parts of a patient tend to present multiple symptoms or comorbidities.

Image Classification

Speaker attribution with voice profiles by graph-based semi-supervised learning

no code implementations6 Feb 2021 Jixuan Wang, Xiong Xiao, Jian Wu, Ranjani Ramamurthy, Frank Rudzicz, Michael Brudno

Speaker attribution is required in many real-world applications, such as meeting transcription, where speaker identity is assigned to each utterance according to speaker voice profiles.

Speaker Identification

Modeling Updates of Scholarly Webpages Using Archived Data

no code implementations7 Dec 2020 Yasith Jayawardana, Alexander C. Nwala, Gavindya Jayawardena, Jian Wu, Sampath Jayarathna, Michael L. Nelson, C. Lee Giles

The vastness of the web imposes a prohibitive cost on building large-scale search engines with limited resources.

Reconstruction Condition of Quantized Signals in Unlimited Sampling Framework

no code implementations29 Nov 2020 Yan He, Jifang Qiu, Chang Liu, Yue Liu, Jian Wu

The latest theoretical advances in the field of unlimited sampling framework (USF) show the potential to avoid clipping problems of analog-to-digital converters (ADC).


IEEE SLT 2021 Alpha-mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines

1 code implementation4 Nov 2020 Yihui Fu, Zhuoyuan Yao, Weipeng He, Jian Wu, Xiong Wang, Zhanheng Yang, Shimin Zhang, Lei Xie, DongYan Huang, Hui Bu, Petr Motlicek, Jean-Marc Odobez

In this challenge, we open source a sizable speech, keyword, echo and noise corpus for promoting data-driven methods, particularly deep-learning approaches on KWS and SSL.

Sound Audio and Speech Processing

Multi-View Adaptive Fusion Network for 3D Object Detection

1 code implementation2 Nov 2020 Guojun Wang, Bin Tian, Yachen Zhang, Long Chen, Dongpu Cao, Jian Wu

3D object detection based on LiDAR-camera fusion is becoming an emerging research theme for autonomous driving.

3D Object Detection Autonomous Driving +1

Dynamic radiomics: a new methodology to extract quantitative time-related features from tomographic images

no code implementations1 Nov 2020 Fengying Che, Ruichuan Shi, Jian Wu, Haoran Li, Shuqin Li, Weixing Chen, Hao Zhang, Zhi Li, Xiaoyu Cui

The feature extraction methods of radiomics are mainly based on static tomographic images at a certain moment, while the occurrence and development of disease is a dynamic process that cannot be fully reflected by only static characteristics.

An End-to-end Architecture of Online Multi-channel Speech Separation

no code implementations7 Sep 2020 Jian Wu, Zhuo Chen, Jinyu Li, Takuya Yoshioka, Zhili Tan, Ed Lin, Yi Luo, Lei Xie

Previously, we introduced a sys-tem, calledunmixing, fixed-beamformerandextraction(UFE), that was shown to be effective in addressing the speech over-lap problem in conversation transcription.

Speech Recognition Speech Separation

Preference Robust Optimization with Quasi-Concave Choice Functions for Multi-Attribute Prospects

no code implementations31 Aug 2020 Jian Wu, William B. Haskell, Wenjie Huang, Huifu Xu

Preference robust choice models concern decision-making problems where the decision maker's (DM) utility/risk preferences are ambiguous and the evaluation is based on the worst-case utility function/risk measure from a set of plausible utility functions/risk measures.

Decision Making Portfolio Optimization

Echoes in Unidirectionally Rotating Molecules

no code implementations15 Aug 2020 Long Xu, Ilia Tutunnikov, Lianrong Zhou, Kang Lin, Junjie Qiang, Peifen Lu, Yehiam Prior, Ilya Sh. Averbukh, Jian Wu

Abstract We report the experimental observation of molecular unidirectional rotation (UDR) echoes, and analyze their origin and behavior both classically and quantum mechanically.


Continuous Speech Separation with Conformer

1 code implementation13 Aug 2020 Sanyuan Chen, Yu Wu, Zhuo Chen, Jian Wu, Jinyu Li, Takuya Yoshioka, Chengyi Wang, Shujie Liu, Ming Zhou

Continuous speech separation plays a vital role in complicated speech related tasks such as conversation transcription.

Speech Separation

Channel-wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music

1 code implementation12 Aug 2020 Haohe Liu, Lei Xie, Jian Wu, Geng Yang

We aim to address the major issues in CNN-based high-resolution MSS model: high computational cost and weight sharing between distinctly different bands.

Audio and Speech Processing Sound

DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement

7 code implementations Interspeech 2020 Yanxin Hu, Yun Liu, Shubo Lv, Mengtao Xing, Shimin Zhang, Yihui Fu, Jian Wu, Bihong Zhang, Lei Xie

Speech enhancement has benefited from the success of deep learning in terms of intelligibility and perceptual quality.

Speech Enhancement Audio and Speech Processing Sound

Large Scale Subject Category Classification of Scholarly Papers with Deep Attentive Neural Networks

no code implementations27 Jul 2020 Bharath Kandimalla, Shaurya Rohatgi, Jian Wu, C. Lee Giles

The results showed the importance of retraining word embedding models to maximize the vocabulary overlap and the effectiveness of the attention mechanism.

General Classification

CenterNet3D: An Anchor Free Object Detector for Point Cloud

2 code implementations13 Jul 2020 Guojun Wang, Jian Wu, Bin Tian, Siyu Teng, Long Chen, Dongpu Cao

However, because inherent sparsity of point clouds, 3D object center points are likely to be in empty space which makes it difficult to estimate accurate boundaries.

3D Object Detection Autonomous Driving

Speaker diarization with session-level speaker embedding refinement using graph neural networks

no code implementations22 May 2020 Jixuan Wang, Xiong Xiao, Jian Wu, Ranjani Ramamurthy, Frank Rudzicz, Michael Brudno

Deep speaker embedding models have been commonly used as a building block for speaker diarization systems; however, the speaker embedding model is usually trained according to a global loss defined on the training data, which could be sub-optimal for distinguishing speakers locally in a specific meeting session.

Speaker Diarization

Continuous speech separation: dataset and analysis

1 code implementation30 Jan 2020 Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Xiong Xiao, Jinyu Li

In this paper, we define continuous speech separation (CSS) as a task of generating a set of non-overlapped speech signals from a \textit{continuous} audio stream that contains multiple utterances that are \emph{partially} overlapped by a varying degree.

Automatic Speech Recognition Speech Separation

Audio-visual Recognition of Overlapped speech for the LRS2 dataset

no code implementations6 Jan 2020 Jianwei Yu, Shi-Xiong Zhang, Jian Wu, Shahram Ghorbani, Bo Wu, Shiyin Kang, Shansong Liu, Xunying Liu, Helen Meng, Dong Yu

Experiments on overlapped speech simulated from the LRS2 dataset suggest the proposed AVSR system outperformed the audio only baseline LF-MMI DNN system by up to 29. 98\% absolute in word error rate (WER) reduction, and produced recognition performance comparable to a more complex pipelined system.

Ranked #7 on Lipreading on LRS2 (using extra training data)

Audio-Visual Speech Recognition Lipreading +2

Query Auto Completion for Math Formula Search

no code implementations9 Dec 2019 Shaurya Rohatgi, Wei Zhong, Richard Zanibbi, Jian Wu, C. Lee Giles

Query Auto Completion (QAC) is among the most appealing features of a web search engine.

Practical Two-Step Lookahead Bayesian Optimization

no code implementations NeurIPS 2019 Jian Wu, Peter Frazier

Expected improvement and other acquisition functions widely used in Bayesian optimization use a "one-step" assumption: they value objective function evaluations assuming no future evaluations will be performed.

Method and Dataset Mining in Scientific Papers

no code implementations29 Nov 2019 Rujing Yao, Linlin Hou, Yingchun Ye, Ou wu, Ji Zhang, Jian Wu

In the field of machine learning, the involved methods (M) and datasets (D) are key information in papers.

Privileged Features Distillation at Taobao Recommendations

no code implementations11 Jul 2019 Chen Xu, Quan Li, Junfeng Ge, Jinyang Gao, Xiaoyong Yang, Changhua Pei, Fei Sun, Jian Wu, Hanxiao Sun, Wenwu Ou

To guarantee the consistency of off-line training and on-line serving, we usually utilize the same features that are both available.

Dunhuang Grottoes Painting Dataset and Benchmark

no code implementations10 Jul 2019 Tianxiu Yu, Shijie Zhang, Cong Lin, ShaoDi You, Jian Wu, Jiawan Zhang, Xiaohong Ding, Huili An

Follow the trend, we release the first public dataset for Dunhuang Grotto Painting restoration.

A comprehensive study of speech separation: spectrogram vs waveform separation

no code implementations17 May 2019 Fahimeh Bahmaninezhad, Jian Wu, Rongzhi Gu, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu

We study the speech separation problem for far-field data (more similar to naturalistic audio streams) and develop multi-channel solutions for both frequency and time-domain separators with utilizing spectral, spatial and speaker location information.

Speech Recognition Speech Separation

X2CT-GAN: Reconstructing CT from Biplanar X-Rays with Generative Adversarial Networks

1 code implementation CVPR 2019 Xingde Ying, Heng Guo, Kai Ma, Jian Wu, Zheng-Xin Weng, Yefeng Zheng

Computed tomography (CT) can provide a 3D view of the patient's internal organs, facilitating disease diagnosis, but it incurs more radiation dose to a patient and a CT scanner is much more cost prohibitive than an X-ray machine too.

Computed Tomography (CT)

End-to-End Multi-Channel Speech Separation

no code implementations15 May 2019 Rongzhi Gu, Jian Wu, Shi-Xiong Zhang, Lian-Wu Chen, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu

This paper extended the previous approach and proposed a new end-to-end model for multi-channel speech separation.

Speech Separation

Personalized Re-ranking for Recommendation

1 code implementation15 Apr 2019 Changhua Pei, Yi Zhang, Yongfeng Zhang, Fei Sun, Xiao Lin, Hanxiao Sun, Jian Wu, Peng Jiang, Wenwu Ou

Ranking is a core task in recommender systems, which aims at providing an ordered list of items to users.

Recommendation Systems Re-Ranking

BERT4Rec: Sequential Recommendation with Bidirectional Encoder Representations from Transformer

5 code implementations14 Apr 2019 Fei Sun, Jun Liu, Jian Wu, Changhua Pei, Xiao Lin, Wenwu Ou, Peng Jiang

To address this problem, we train the bidirectional model using the Cloze task, predicting the masked items in the sequence by jointly conditioning on their left and right context.

Sequential Recommendation

Time Domain Audio Visual Speech Separation

no code implementations7 Apr 2019 Jian Wu, Yong Xu, Shi-Xiong Zhang, Lian-Wu Chen, Meng Yu, Lei Xie, Dong Yu

Audio-visual multi-modal modeling has been demonstrated to be effective in many speech related tasks, such as speech recognition and speech enhancement.

Audio and Speech Processing Sound

Practical Multi-fidelity Bayesian Optimization for Hyperparameter Tuning

no code implementations12 Mar 2019 Jian Wu, Saul Toscano-Palmerin, Peter I. Frazier, Andrew Gordon Wilson

Nonetheless, for hyperparameter tuning in deep neural networks, the time required to evaluate the validation error for even a few hyperparameter settings remains a bottleneck.

Improving Automatic Source Code Summarization via Deep Reinforcement Learning

2 code implementations17 Nov 2018 Yao Wan, Zhou Zhao, Min Yang, Guandong Xu, Haochao Ying, Jian Wu, Philip S. Yu

To the best of our knowledge, most state-of-the-art approaches follow an encoder-decoder framework which encodes the code into a hidden space and then decode it into natural language space, suffering from two major drawbacks: a) Their encoders only consider the sequential content of code, ignoring the tree structure which is also critical for the task of code summarization, b) Their decoders are typically trained to predict the next word by maximizing the likelihood of next ground-truth word with previous ground-truth word given.

Code Summarization reinforcement-learning +1

Improved Dynamic Memory Network for Dialogue Act Classification with Adversarial Training

no code implementations12 Nov 2018 Yao Wan, Wenqiang Yan, Jianwei Gao, Zhou Zhao, Jian Wu, Philip S. Yu

Dialogue Act (DA) classification is a challenging problem in dialogue interpretation, which aims to attach semantic labels to utterances and characterize the speaker's intention.

Classification Dialogue Act Classification +3

Learned Neural Iterative Decoding for Lossy Image Compression Systems

no code implementations15 Mar 2018 Alexander G. Ororbia, Ankur Mali, Jian Wu, Scott O'Connell, David Miller, C. Lee Giles

For lossy image compression systems, we develop an algorithm, iterative refinement, to improve the decoder's reconstruction compared to standard decoding techniques.

Image Compression

Continuous-fidelity Bayesian Optimization with Knowledge Gradient

no code implementations ICLR 2018 Jian Wu, Peter I. Frazier

While Bayesian optimization (BO) has achieved great success in optimizing expensive-to-evaluate black-box functions, especially tuning hyperparameters of neural networks, methods such as random search (Li et al., 2016) and multi-fidelity BO (e. g. Klein et al. (2017)) that exploit cheap approximations, e. g. training on a smaller training data or with fewer iterations, can outperform standard BO approaches that use only full-fidelity observations.

A Hierarchical Recurrent Neural Network for Symbolic Melody Generation

2 code implementations14 Dec 2017 Jian Wu, Changran Hu, Yulong Wang, Xiaolin Hu, Jun Zhu

In this paper, we present a hierarchical recurrent neural network for melody generation, which consists of three Long-Short-Term-Memory (LSTM) subnetworks working in a coarse-to-fine manner along time.

Sound Multimedia

Discretization-free Knowledge Gradient Methods for Bayesian Optimization

no code implementations20 Jul 2017 Jian Wu, Peter I. Frazier

This paper studies Bayesian ranking and selection (R&S) problems with correlated prior beliefs and continuous domains, i. e. Bayesian optimization (BO).

Bayesian Optimization with Gradients

1 code implementation NeurIPS 2017 Jian Wu, Matthias Poloczek, Andrew Gordon Wilson, Peter I. Frazier

Bayesian optimization has been successful at global optimization of expensive-to-evaluate multimodal objective functions.

The Parallel Knowledge Gradient Method for Batch Bayesian Optimization

2 code implementations NeurIPS 2016 Jian Wu, Peter I. Frazier

In many applications of black-box optimization, one can evaluate multiple points simultaneously, e. g. when evaluating the performances of several different neural network architectures in a parallel computing environment.

Multi-modal Fusion for Diabetes Mellitus and Impaired Glucose Regulation Detection

no code implementations12 Apr 2016 Jinxing Li, David Zhang, Yongcheng Li, Jian Wu

has proved that tongue, face and sublingual diagnosis as a noninvasive method is a reasonable way for disease detection.

Cannot find the paper you are looking for? You can Submit a new open access paper.