Search Results for author: Yi Huang

Found 57 papers, 26 papers with code

Orion-14B: Open-source Multilingual Large Language Models

1 code implementation20 Jan 2024 Du Chen, Yi Huang, Xiaopu Li, Yongqiang Li, Yongqiang Liu, Haihui Pan, Leichao Xu, Dacheng Zhang, Zhipeng Zhang, Kun Han

In this study, we introduce Orion-14B, a collection of multilingual large language models with 14 billion parameters.

Scheduling

Diffusion Model-Based Image Editing: A Survey

1 code implementation27 Feb 2024 Yi Huang, Jiancheng Huang, Yifan Liu, Mingfu Yan, Jiaxi Lv, Jianzhuang Liu, Wei Xiong, He Zhang, Shifeng Chen, Liangliang Cao

In this survey, we provide an exhaustive overview of existing methods using diffusion models for image editing, covering both theoretical and practical aspects in the field.

Denoising Image Inpainting +1

Transferable Adversarial Attack based on Integrated Gradients

2 code implementations ICLR 2022 Yi Huang, Adams Wai-Kin Kong

The vulnerability of deep neural networks to adversarial examples has drawn tremendous attention from the community.

Adversarial Attack

UVCGAN v2: An Improved Cycle-Consistent GAN for Unpaired Image-to-Image Translation

2 code implementations28 Mar 2023 Dmitrii Torbunov, Yi Huang, Huan-Hsin Tseng, Haiwang Yu, Jin Huang, Shinjae Yoo, MeiFeng Lin, Brett Viren, Yihui Ren

An unpaired image-to-image (I2I) translation technique seeks to find a mapping between two domains of data in a fully unsupervised manner.

Image-to-Image Translation Translation

A Challenge on Semi-Supervised and Reinforced Task-Oriented Dialog Systems

1 code implementation6 Jul 2022 Zhijian Ou, Junlan Feng, Juanzi Li, Yakun Li, Hong Liu, Hao Peng, Yi Huang, Jiangjiang Zhao

A challenge on Semi-Supervised and Reinforced Task-Oriented Dialog Systems, Co-located with EMNLP2022 SereTOD Workshop.

Information Extraction and Human-Robot Dialogue towards Real-life Tasks: A Baseline Study with the MobileCS Dataset

1 code implementation27 Sep 2022 Hong Liu, Hao Peng, Zhijian Ou, Juanzi Li, Yi Huang, Junlan Feng

Recently, there have merged a class of task-oriented dialogue (TOD) datasets collected through Wizard-of-Oz simulated games.

V4D: Voxel for 4D Novel View Synthesis

1 code implementation28 May 2022 Wanshui Gan, Hongbin Xu, Yi Huang, Shifeng Chen, Naoto Yokoya

The proposed LUTs-based refinement module achieves the performance gain with little computational cost and could serve as the plug-and-play module in the novel view synthesis task.

Novel View Synthesis

AggPose: Deep Aggregation Vision Transformer for Infant Pose Estimation

1 code implementation11 May 2022 Xu Cao, Xiaoye Li, Liya Ma, Yi Huang, Xuan Feng, Zening Chen, Hongwu Zeng, Jianguo Cao

We show that AggPose outperforms hybrid model HRFormer and TokenPose in the infant pose estimation dataset.

Keypoint Detection

Prompt Pool based Class-Incremental Continual Learning for Dialog State Tracking

1 code implementation17 Nov 2023 Hong Liu, Yucheng Cai, Yuan Zhou, Zhijian Ou, Yi Huang, Junlan Feng

Inspired by the recently emerging prompt tuning method that performs well on dialog systems, we propose to use the prompt pool method, where we maintain a pool of key-value paired prompts and select prompts from the pool according to the distance between the dialog history and the prompt keys.

Continual Learning dialog state tracking

WaveDM: Wavelet-Based Diffusion Models for Image Restoration

1 code implementation23 May 2023 Yi Huang, Jiancheng Huang, Jianzhuang Liu, Mingfu Yan, Yu Dong, Jiaxi Lv, Chaoqi Chen, Shifeng Chen

Latest diffusion-based methods for many image restoration tasks outperform traditional models, but they encounter the long-time inference problem.

Deblurring Denoising +2

Learning to Check Contract Inconsistencies

1 code implementation15 Dec 2020 Shuo Zhang, Junzhou Zhao, Pinghui Wang, Nuo Xu, Yang Yang, Yiting Liu, Yi Huang, Junlan Feng

This will result in the issue of contract inconsistencies, which may severely impair the legal validity of the contract.

Advancing Semi-Supervised Task Oriented Dialog Systems by JSA Learning of Discrete Latent Variable Models

1 code implementation SIGDIAL (ACL) 2022 Yucheng Cai, Hong Liu, Zhijian Ou, Yi Huang, Junlan Feng

In this paper, we propose to apply JSA to semi-supervised learning of the latent state TOD models, which is referred to as JSA-TOD.

Variational Latent-State GPT for Semi-Supervised Task-Oriented Dialog Systems

2 code implementations9 Sep 2021 Hong Liu, Yucheng Cai, Zhenru Lin, Zhijian Ou, Yi Huang, Junlan Feng

In this paper, we propose Variational Latent-State GPT model (VLS-GPT), which is the first to combine the strengths of the two approaches.

Building Markovian Generative Architectures over Pretrained LM Backbones for Efficient Task-Oriented Dialog Systems

2 code implementations13 Apr 2022 Hong Liu, Yucheng Cai, Zhijian Ou, Yi Huang, Junlan Feng

Recently, Transformer based pretrained language models (PLMs), such as GPT2 and T5, have been leveraged to build generative task-oriented dialog (TOD) systems.

A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems

1 code implementation17 Oct 2022 Hong Liu, Yucheng Cai, Zhijian Ou, Yi Huang, Junlan Feng

Second, an important ingredient in a US is that the user goal can be effectively incorporated and tracked; but how to flexibly integrate goal state tracking and develop an end-to-end trainable US for multi-domains has remained to be a challenge.

Reinforcement Learning (RL)

Efficient Data Compression for 3D Sparse TPC via Bicephalous Convolutional Autoencoder

1 code implementation9 Nov 2021 Yi Huang, Yihui Ren, Shinjae Yoo, Jin Huang

This method shows advantages both in compression fidelity and ratio compared to traditional data compression methods, such as MGARD, SZ, and ZFP.

Data Compression

"Think Before You Speak": Improving Multi-Action Dialog Policy by Planning Single-Action Dialogs

1 code implementation25 Apr 2022 Shuo Zhang, Junzhou Zhao, Pinghui Wang, Yu Li, Yi Huang, Junlan Feng

Multi-action dialog policy (MADP), which generates multiple atomic dialog actions per turn, has been widely applied in task-oriented dialog systems to provide expressive and efficient system responses.

Multi-Task Learning

Curriculum-Based Self-Training Makes Better Few-Shot Learners for Data-to-Text Generation

1 code implementation6 Jun 2022 Pei Ke, Haozhe Ji, Zhenyu Yang, Yi Huang, Junlan Feng, Xiaoyan Zhu, Minlie Huang

Despite the success of text-to-text pre-trained models in various natural language generation (NLG) tasks, the generation performance is largely restricted by the number of labeled data in downstream tasks, particularly in data-to-text generation tasks.

Data-to-Text Generation Unsupervised Pre-training

Knowledge-Retrieval Task-Oriented Dialog Systems with Semi-Supervision

1 code implementation22 May 2023 Yucheng Cai, Hong Liu, Zhijian Ou, Yi Huang, Junlan Feng

Most existing task-oriented dialog (TOD) systems track dialog states in terms of slots and values and use them to query a database to get relevant knowledge to generate responses.

Question Answering Retrieval

Fast 2D Bicephalous Convolutional Autoencoder for Compressing 3D Time Projection Chamber Data

1 code implementation23 Oct 2023 Yi Huang, Yihui Ren, Shinjae Yoo, Jin Huang

Developing real-time data compression algorithms to reduce such data at high throughput to fit permanent storage has drawn increasing attention.

Data Compression

Robust open-set classification for encrypted traffic fingerprinting

1 code implementation Elsevier Computer Networks Journal 2023 Thilini Dahanayaka, Yasod Ginige, Yi Huang, Guillaume Jourjon, Suranga Seneviratne

First, we show that a well-regularized deep learning model improves the open-set classification and then we propose a novel open-set classification method with three variants that perform consistently over multiple datasets.

Classification open-set classification +1

A Tamper-Free Semi-Universal Communication System for Deletion Channels

no code implementations9 Apr 2018 Shahab Asoodeh, Yi Huang, Ishanu Chattopadhyay

We investigate the problem of reliable communication between two legitimate parties over deletion channels under an active eavesdropping (aka jamming) adversarial model.

Towards a New Science of a Clinical Data Intelligence

no code implementations17 Nov 2013 Volker Tresp, Sonja Zillner, Maria J. Costa, Yi Huang, Alexander Cavallaro, Peter A. Fasching, Andre Reis, Martin Sedlmayr, Thomas Ganslandt, Klemens Budde, Carl Hinrichs, Danilo Schmidt, Philipp Daumke, Daniel Sonntag, Thomas Wittenberg, Patricia G. Oppelt, Denis Krompass

We argue that a science of a Clinical Data Intelligence is sensible in the context of a Big Data analysis, i. e., with data from many patients and with complete patient information.

Wasserstein Soft Label Propagation on Hypergraphs: Algorithm and Generalization Error Bounds

no code implementations6 Sep 2018 Tingran Gao, Shahab Asoodeh, Yi Huang, James Evans

Inspired by recent interests of developing machine learning and data mining algorithms on hypergraphs, we investigate in this paper the semi-supervised learning algorithm of propagating "soft labels" (e. g. probability distributions, class membership scores) over hypergraphs, by means of optimal transportation.

PAC learning

Perception Evaluation -- A new solar image quality metric based on the multi-fractal property of texture features

1 code implementation24 May 2019 Yi Huang, Peng Jia, Dongmei Cai, Bojun Cai

Next-generation ground-based solar observations require good image quality metrics for post-facto processing techniques.

Solar Image Restoration with the Cycle-GAN Based on Multi-Fractal Properties of Texture Features

no code implementations29 Jul 2019 Peng Jia, Yi Huang, Bojun Cai, Dongmei Cai

Texture is one of the most obvious characteristics in solar images and it is normally described by texture features.

Image Restoration

Data Smashing 2.0: Sequence Likelihood (SL) Divergence For Fast Time Series Comparison

no code implementations26 Sep 2019 Yi Huang, Ishanu Chattopadhyay

Recognizing subtle historical patterns is central to modeling and forecasting problems in time series analysis.

Feature Engineering Time Series +1

Long-range Event-level Prediction and Response Simulation for Urban Crime and Global Terrorism with Granger Networks

no code implementations4 Nov 2019 Timmy Li, Yi Huang, James Evans, Ishanu Chattopadhyay

Large-scale trends in urban crime and global terrorism are well-predicted by socio-economic drivers, but focused, event-level predictions have had limited success.

Time-Guided High-Order Attention Model of Longitudinal Heterogeneous Healthcare Data

no code implementations28 Nov 2019 Yi Huang, Xiaoshan Yang, Changsheng Xu

(1) It can model longitudinal heterogeneous EHRs data via capturing the 3-order correlations of different modalities and the irregular temporal impact of historical events.

Management Mortality Prediction +1

PSF--NET: A Non-parametric Point Spread Function Model for Ground Based Optical Telescopes

no code implementations2 Mar 2020 Peng Jia, Xuebo Wu, Yi Huang, Bojun Cai, Dongmei Cai

Assuming point spread functions induced by the atmospheric turbulence with the same profile belong to the same manifold space, we propose a non-parametric point spread function -- PSF-NET.

Image Restoration

New Threats against Object Detector with Non-local Block

no code implementations ECCV 2020 Yi Huang, Fan Wang, Adams Wai-Kin Kong, Kwok-Yan Lam

The experiments show that the universal patches are able to mislead the detector with greater probabilities.

Object

A Semi-Blind Multiuser SIMO GFDM System in the Presence of CFOs and IQ Imbalances

no code implementations14 Oct 2020 Yujie Liu, Xu Zhu, Eng Gee Lim, Yufei Jiang, Yi Huang

A low-complexity semi-blind joint estimation scheme of multiple channels, CFOs and IQ imbalances is proposed.

Metal-Insulator Transition in $n$-type bulk crystals and films of strongly compensated SrTiO$_3$

no code implementations23 Feb 2021 Yi Huang, Y. Ayino, B. I. Shklovskii

We try to explain this discrepancy in the framework of the theory of the percolation MIT in a strongly compensated semiconductor with the same $N=10^{19}$ cm$^{-3}$.

Materials Science

Counterfactual Matters: Intrinsic Probing For Dialogue State Tracking

no code implementations EANCS 2021 Yi Huang, Junlan Feng, Xiaoting Wu, Xiaoyu Du

Our findings are: the performance variance of generative DSTs is not only due to the model structure itself, but can be attributed to the distribution of cross-domain values.

counterfactual Dialogue State Tracking +1

Jointly Reinforced User Simulator and Task-oriented Dialog System with Simplified Generative Architecture

no code implementations13 Oct 2022 Hong Liu, Zhijian Ou, Yi Huang, Junlan Feng

Recently, there has been progress in supervised funetuning pretrained GPT-2 to build end-to-end task-oriented dialog (TOD) systems.

Temporal-Spatial dependencies ENhanced deep learning model (TSEN) for household leverage series forecasting

no code implementations17 Oct 2022 Hu Yang, Yi Huang, Haijun Wang, Yu Chen

Analyzing both temporal and spatial patterns for an accurate forecasting model for financial time series forecasting is a challenge due to the complex nature of temporal-spatial dynamics: time series from different locations often have distinct patterns; and for the same time series, patterns may vary as time goes by.

Time Series Time Series Forecasting

A holistically 3D-printed flexible millimeter-wave Doppler radar: Towards fully printed high-frequency multilayer flexible hybrid electronics systems

no code implementations24 Feb 2023 Hong Tang, Yingjie Zhang, Bowen Zheng, Sensong An, Mohammad Haerinia, Yunxi Dong, Yi Huang, Wei Guo, Hualiang Zhang

Flexible hybrid electronics (FHE) is an emerging technology enabled through the integration of advanced semiconductor devices and 3D printing technology.

Multi-Action Dialog Policy Learning from Logged User Feedback

no code implementations27 Feb 2023 Shuo Zhang, Junzhou Zhao, Pinghui Wang, Tianxiang Wang, Zi Liang, Jing Tao, Yi Huang, Junlan Feng

To cope with this problem, we explore to improve multi-action dialog policy learning with explicit and implicit turn-level user feedback received for historical predictions (i. e., logged user feedback) that are cost-efficient to collect and faithful to real-world scenarios.

Low-Light Image Enhancement by Learning Contrastive Representations in Spatial and Frequency Domains

no code implementations23 Mar 2023 Yi Huang, Xiaoguang Tu, Gui Fu, Tingting Liu, Bokai Liu, Ming Yang, Ziliang Feng

Images taken under low-light conditions tend to suffer from poor visibility, which can decrease image quality and even reduce the performance of the downstream tasks.

Contrastive Learning Low-Light Image Enhancement

Integrating Different Informations for Portfolio Selection

no code implementations29 May 2023 Yi Huang, Wei Zhu, Duan Li, Shushang Zhu, Shikun Wang

Following the idea of Bayesian learning via Gaussian mixture model, we organically combine the backward-looking information contained in the historical data and the forward-looking information implied by the market portfolio, which is affected by heterogeneous expectations and noisy trading behavior.

Designing a Deep Learning-Driven Resource-Efficient Diagnostic System for Metastatic Breast Cancer: Reducing Long Delays of Clinical Diagnosis and Improving Patient Survival in Developing Countries

no code implementations4 Aug 2023 William Gao, Dayong Wang, Yi Huang

This research provides an innovative technological solution to address the long delays in metastatic breast cancer diagnosis and the consequent disparity in patient survival outcome in developing countries.

Computational Efficiency

Graph Edit Distance Learning via Different Attention

no code implementations26 Aug 2023 Jiaxi Lv, Liang Zhang, Yi Huang, Jiancheng Huang, Shifeng Chen

To this end, DiffAtt uses the difference between two graph-level embeddings as an attentional mechanism to capture the graph structural difference of the two graphs.

Graph Similarity

BEVHeight++: Toward Robust Visual Centric 3D Object Detection

no code implementations28 Sep 2023 Lei Yang, Tao Tang, Jun Li, Peng Chen, Kun Yuan, Li Wang, Yi Huang, Xinyu Zhang, Kaicheng Yu

In essence, we regress the height to the ground to achieve a distance-agnostic formulation to ease the optimization process of camera-only perception methods.

3D Object Detection Autonomous Driving +2

MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings

no code implementations30 Sep 2023 Lei Yang, Jiaxin Yu, Xinyu Zhang, Jun Li, Li Wang, Yi Huang, Chuang Zhang, Hong Wang, Yiming Li

We discover that most existing monocular 3D object detectors rely on the ego-vehicle prior assumption that the optical axis of the camera is parallel to the ground.

Autonomous Driving Monocular 3D Object Detection +1

Seal2Real: Prompt Prior Learning on Diffusion Model for Unsupervised Document Seal Data Generation and Realisation

no code implementations1 Oct 2023 Jiancheng Huang, Yifan Liu, Yi Huang, Shifeng Chen

To address the lack of labelled datasets for these seal-related tasks, we propose Seal2Real, a generative method that generates a large amount of labelled document seal data, and construct a Seal-DB dataset containing 20K images with labels.

GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning

no code implementations21 Nov 2023 Jiaxi Lv, Yi Huang, Mingfu Yan, Jiancheng Huang, Jianzhuang Liu, Yifan Liu, Yafei Wen, Xiaoxin Chen, Shifeng Chen

To tackle these issues, we propose GPT4Motion, a training-free framework that leverages the planning capability of large language models such as GPT, the physical simulation strength of Blender, and the excellent image generation ability of text-to-image diffusion models to enhance the quality of video synthesis.

Image Generation Text-to-Video Generation +1

USFM: A Universal Ultrasound Foundation Model Generalized to Tasks and Organs towards Label Efficient Image Analysis

no code implementations30 Dec 2023 Jing Jiao, Jin Zhou, Xiaokang Li, Menghua Xia, Yi Huang, Lihong Huang, Na Wang, Xiaofan Zhang, Shichong Zhou, Yuanyuan Wang, Yi Guo

In this paper, we present a universal US foundation model, named USFM, generalized to diverse tasks and organs towards label efficient US image analysis.

Image Enhancement

Cannot find the paper you are looking for? You can Submit a new open access paper.