Search Results for author: Yi Huang

Found 57 papers, 26 papers with code

Orion-14B: Open-source Multilingual Large Language Models

1 code implementation • 20 Jan 2024 • Du Chen, Yi Huang, Xiaopu Li, Yongqiang Li, Yongqiang Liu, Haihui Pan, Leichao Xu, Dacheng Zhang, Zhipeng Zhang, Kun Han

In this study, we introduce Orion-14B, a collection of multilingual large language models with 14 billion parameters.

Scheduling

748

Paper
Code

Diffusion Model-Based Image Editing: A Survey

1 code implementation • 27 Feb 2024 • Yi Huang, Jiancheng Huang, Yifan Liu, Mingfu Yan, Jiaxi Lv, Jianzhuang Liu, Wei Xiong, He Zhang, Shifeng Chen, Liangliang Cao

In this survey, we provide an exhaustive overview of existing methods using diffusion models for image editing, covering both theoretical and practical aspects in the field.

Denoising Image Inpainting +1

232

Paper
Code

Transferable Adversarial Attack based on Integrated Gradients

2 code implementations • ICLR 2022 • Yi Huang, Adams Wai-Kin Kong

The vulnerability of deep neural networks to adversarial examples has drawn tremendous attention from the community.

Adversarial Attack

127

Paper
Code

Balancing Methods for Multi-label Text Classification with Long-Tailed Class Distribution

2 code implementations • EMNLP 2021 • Yi Huang, Buse Giledereli, Abdullatif Köksal, Arzucan Özgür, Elif Ozkirimli

Here, we introduce the application of balancing loss functions for multi-label text classification.

Ranked #1 on Multi-Label Text Classification on Reuters-21578

Document Classification Multi-Label Text Classification

114

Paper
Code

UVCGAN: UNet Vision Transformer cycle-consistent GAN for unpaired image-to-image translation

2 code implementations • 4 Mar 2022 • Dmitrii Torbunov, Yi Huang, Haiwang Yu, Jin Huang, Shinjae Yoo, MeiFeng Lin, Brett Viren, Yihui Ren

Unpaired image-to-image translation has broad applications in art, design, and scientific simulations.

Image-to-Image Translation Translation

111

Paper
Code

UVCGAN v2: An Improved Cycle-Consistent GAN for Unpaired Image-to-Image Translation

2 code implementations • 28 Mar 2023 • Dmitrii Torbunov, Yi Huang, Huan-Hsin Tseng, Haiwang Yu, Jin Huang, Shinjae Yoo, MeiFeng Lin, Brett Viren, Yihui Ren

An unpaired image-to-image (I2I) translation technique seeks to find a mapping between two domains of data in a fully unsupervised manner.

Image-to-Image Translation Translation

102

Paper
Code

A Challenge on Semi-Supervised and Reinforced Task-Oriented Dialog Systems

1 code implementation • 6 Jul 2022 • Zhijian Ou, Junlan Feng, Juanzi Li, Yakun Li, Hong Liu, Hao Peng, Yi Huang, Jiangjiang Zhao

A challenge on Semi-Supervised and Reinforced Task-Oriented Dialog Systems, Co-located with EMNLP2022 SereTOD Workshop.

Paper
Code

Information Extraction and Human-Robot Dialogue towards Real-life Tasks: A Baseline Study with the MobileCS Dataset

1 code implementation • 27 Sep 2022 • Hong Liu, Hao Peng, Zhijian Ou, Juanzi Li, Yi Huang, Junlan Feng

Recently, there have merged a class of task-oriented dialogue (TOD) datasets collected through Wizard-of-Oz simulated games.

Paper
Code

V4D: Voxel for 4D Novel View Synthesis

1 code implementation • 28 May 2022 • Wanshui Gan, Hongbin Xu, Yi Huang, Shifeng Chen, Naoto Yokoya

The proposed LUTs-based refinement module achieves the performance gain with little computational cost and could serve as the plug-and-play module in the novel view synthesis task.

Novel View Synthesis

Paper
Code

AggPose: Deep Aggregation Vision Transformer for Infant Pose Estimation

1 code implementation • 11 May 2022 • Xu Cao, Xiaoye Li, Liya Ma, Yi Huang, Xuan Feng, Zening Chen, Hongwu Zeng, Jianguo Cao

We show that AggPose outperforms hybrid model HRFormer and TokenPose in the infant pose estimation dataset.

Ranked #7 on Keypoint Detection on MS COCO

Keypoint Detection

Paper
Code

Unpaired Image Translation to Mitigate Domain Shift in Liquid Argon Time Projection Chamber Detector Responses

1 code implementation • 25 Apr 2023 • Yi Huang, Dmitrii Torbunov, Brett Viren, Haiwang Yu, Jin Huang, MeiFeng Lin, Yihui Ren

This issue is prevalent in many scientific domains where algorithms are trained on simulated data but applied to real-world datasets.

Domain Adaptation Image-to-Image Translation +1

Paper
Code

Prompt Pool based Class-Incremental Continual Learning for Dialog State Tracking

1 code implementation • 17 Nov 2023 • Hong Liu, Yucheng Cai, Yuan Zhou, Zhijian Ou, Yi Huang, Junlan Feng

Inspired by the recently emerging prompt tuning method that performs well on dialog systems, we propose to use the prompt pool method, where we maintain a pool of key-value paired prompts and select prompts from the pool according to the distance between the dialog history and the prompt keys.

Continual Learning dialog state tracking

Paper
Code

WaveDM: Wavelet-Based Diffusion Models for Image Restoration

1 code implementation • 23 May 2023 • Yi Huang, Jiancheng Huang, Jianzhuang Liu, Mingfu Yan, Yu Dong, Jiaxi Lv, Chaoqi Chen, Shifeng Chen

Latest diffusion-based methods for many image restoration tasks outperform traditional models, but they encounter the long-time inference problem.

Deblurring Denoising +2

Paper
Code

Learning to Check Contract Inconsistencies

1 code implementation • 15 Dec 2020 • Shuo Zhang, Junzhou Zhao, Pinghui Wang, Nuo Xu, Yang Yang, Yiting Liu, Yi Huang, Junlan Feng

This will result in the issue of contract inconsistencies, which may severely impair the legal validity of the contract.

Paper
Code

Generalized Intent Discovery: Learning from Open World Dialogue System

1 code implementation • COLING 2022 • Yutao Mou, Keqing He, Yanan Wu, Pei Wang, Jingang Wang, Wei Wu, Yi Huang, Junlan Feng, Weiran Xu

Traditional intent classification models are based on a pre-defined intent set and only recognize limited in-domain (IND) intent classes.

intent-classification Intent Classification +1

Paper
Code

Advancing Semi-Supervised Task Oriented Dialog Systems by JSA Learning of Discrete Latent Variable Models

1 code implementation • SIGDIAL (ACL) 2022 • Yucheng Cai, Hong Liu, Zhijian Ou, Yi Huang, Junlan Feng

In this paper, we propose to apply JSA to semi-supervised learning of the latent state TOD models, which is referred to as JSA-TOD.

Paper
Code

Variational Latent-State GPT for Semi-Supervised Task-Oriented Dialog Systems

2 code implementations • 9 Sep 2021 • Hong Liu, Yucheng Cai, Zhenru Lin, Zhijian Ou, Yi Huang, Junlan Feng

In this paper, we propose Variational Latent-State GPT model (VLS-GPT), which is the first to combine the strengths of the two approaches.

Paper
Code

Building Markovian Generative Architectures over Pretrained LM Backbones for Efficient Task-Oriented Dialog Systems

2 code implementations • 13 Apr 2022 • Hong Liu, Yucheng Cai, Zhijian Ou, Yi Huang, Junlan Feng

Recently, Transformer based pretrained language models (PLMs), such as GPT2 and T5, have been leveraged to build generative task-oriented dialog (TOD) systems.

Paper
Code

A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems

1 code implementation • 17 Oct 2022 • Hong Liu, Yucheng Cai, Zhijian Ou, Yi Huang, Junlan Feng

Second, an important ingredient in a US is that the user goal can be effectively incorporated and tracked; but how to flexibly integrate goal state tracking and develop an end-to-end trainable US for multi-domains has remained to be a challenge.

Reinforcement Learning (RL)

Paper
Code

Efficient Data Compression for 3D Sparse TPC via Bicephalous Convolutional Autoencoder

1 code implementation • 9 Nov 2021 • Yi Huang, Yihui Ren, Shinjae Yoo, Jin Huang

This method shows advantages both in compression fidelity and ratio compared to traditional data compression methods, such as MGARD, SZ, and ZFP.

Data Compression

Paper
Code

"Think Before You Speak": Improving Multi-Action Dialog Policy by Planning Single-Action Dialogs

1 code implementation • 25 Apr 2022 • Shuo Zhang, Junzhou Zhao, Pinghui Wang, Yu Li, Yi Huang, Junlan Feng

Multi-action dialog policy (MADP), which generates multiple atomic dialog actions per turn, has been widely applied in task-oriented dialog systems to provide expressive and efficient system responses.

Multi-Task Learning

Paper
Code

Curriculum-Based Self-Training Makes Better Few-Shot Learners for Data-to-Text Generation

1 code implementation • 6 Jun 2022 • Pei Ke, Haozhe Ji, Zhenyu Yang, Yi Huang, Junlan Feng, Xiaoyan Zhu, Minlie Huang

Despite the success of text-to-text pre-trained models in various natural language generation (NLG) tasks, the generation performance is largely restricted by the number of labeled data in downstream tasks, particularly in data-to-text generation tasks.

Data-to-Text Generation Unsupervised Pre-training

Paper
Code

Knowledge-Retrieval Task-Oriented Dialog Systems with Semi-Supervision

1 code implementation • 22 May 2023 • Yucheng Cai, Hong Liu, Zhijian Ou, Yi Huang, Junlan Feng

Most existing task-oriented dialog (TOD) systems track dialog states in terms of slots and values and use them to query a database to get relevant knowledge to generate responses.

Question Answering Retrieval

Paper
Code

Fast 2D Bicephalous Convolutional Autoencoder for Compressing 3D Time Projection Chamber Data

1 code implementation • 23 Oct 2023 • Yi Huang, Yihui Ren, Shinjae Yoo, Jin Huang

Developing real-time data compression algorithms to reduce such data at high throughput to fit permanent storage has drawn increasing attention.

Data Compression

Paper
Code

Robust open-set classification for encrypted traffic fingerprinting

1 code implementation • Elsevier Computer Networks Journal 2023 • Thilini Dahanayaka, Yasod Ginige, Yi Huang, Guillaume Jourjon, Suranga Seneviratne

First, we show that a well-regularized deep learning model improves the open-set classification and then we propose a novel open-set classification method with three variants that perform consistently over multiple datasets.

Classification open-set classification +1

Paper
Code

A Tamper-Free Semi-Universal Communication System for Deletion Channels

no code implementations • 9 Apr 2018 • Shahab Asoodeh, Yi Huang, Ishanu Chattopadhyay

We investigate the problem of reliable communication between two legitimate parties over deletion channels under an active eavesdropping (aka jamming) adversarial model.

Paper
Add Code

Towards a New Science of a Clinical Data Intelligence

no code implementations • 17 Nov 2013 • Volker Tresp, Sonja Zillner, Maria J. Costa, Yi Huang, Alexander Cavallaro, Peter A. Fasching, Andre Reis, Martin Sedlmayr, Thomas Ganslandt, Klemens Budde, Carl Hinrichs, Danilo Schmidt, Philipp Daumke, Daniel Sonntag, Thomas Wittenberg, Patricia G. Oppelt, Denis Krompass

We argue that a science of a Clinical Data Intelligence is sensible in the context of a Big Data analysis, i. e., with data from many patients and with complete patient information.

Paper
Add Code

Wasserstein Soft Label Propagation on Hypergraphs: Algorithm and Generalization Error Bounds

no code implementations • 6 Sep 2018 • Tingran Gao, Shahab Asoodeh, Yi Huang, James Evans

Inspired by recent interests of developing machine learning and data mining algorithms on hypergraphs, we investigate in this paper the semi-supervised learning algorithm of propagating "soft labels" (e. g. probability distributions, class membership scores) over hypergraphs, by means of optimal transportation.

PAC learning

Paper
Add Code

Perception Evaluation -- A new solar image quality metric based on the multi-fractal property of texture features

1 code implementation • 24 May 2019 • Yi Huang, Peng Jia, Dongmei Cai, Bojun Cai

Next-generation ground-based solar observations require good image quality metrics for post-facto processing techniques.

Paper
Code

Solar Image Restoration with the Cycle-GAN Based on Multi-Fractal Properties of Texture Features

no code implementations • 29 Jul 2019 • Peng Jia, Yi Huang, Bojun Cai, Dongmei Cai

Texture is one of the most obvious characteristics in solar images and it is normally described by texture features.

Image Restoration

Paper
Add Code

Data Smashing 2.0: Sequence Likelihood (SL) Divergence For Fast Time Series Comparison

no code implementations • 26 Sep 2019 • Yi Huang, Ishanu Chattopadhyay

Recognizing subtle historical patterns is central to modeling and forecasting problems in time series analysis.

Feature Engineering Time Series +1

Paper
Add Code

Long-range Event-level Prediction and Response Simulation for Urban Crime and Global Terrorism with Granger Networks

no code implementations • 4 Nov 2019 • Timmy Li, Yi Huang, James Evans, Ishanu Chattopadhyay

Large-scale trends in urban crime and global terrorism are well-predicted by socio-economic drivers, but focused, event-level predictions have had limited success.

Paper
Add Code

Time-Guided High-Order Attention Model of Longitudinal Heterogeneous Healthcare Data

no code implementations • 28 Nov 2019 • Yi Huang, Xiaoshan Yang, Changsheng Xu

(1) It can model longitudinal heterogeneous EHRs data via capturing the 3-order correlations of different modalities and the irregular temporal impact of historical events.

Management Mortality Prediction +1

Paper
Add Code

PSF--NET: A Non-parametric Point Spread Function Model for Ground Based Optical Telescopes

no code implementations • 2 Mar 2020 • Peng Jia, Xuebo Wu, Yi Huang, Bojun Cai, Dongmei Cai

Assuming point spread functions induced by the atmospheric turbulence with the same profile belong to the same manifold space, we propose a non-parametric point spread function -- PSF-NET.

Image Restoration

Paper
Add Code

Meta-Reinforced Multi-Domain State Generator for Dialogue Systems

no code implementations • ACL 2020 • Yi Huang, Junlan Feng, Min Hu, Xiaoting Wu, Xiaoyu Du, Shuo Ma

The state-of-the-art accuracy for DST is below 50{\%} for a multi-domain dialogue task.

Domain Adaptation Meta-Learning +1

Paper
Add Code

New Threats against Object Detector with Non-local Block

no code implementations • ECCV 2020 • Yi Huang, Fan Wang, Adams Wai-Kin Kong, Kwok-Yan Lam

The experiments show that the universal patches are able to mislead the detector with greater probabilities.

Object

Paper
Add Code

A Semi-Blind Multiuser SIMO GFDM System in the Presence of CFOs and IQ Imbalances

no code implementations • 14 Oct 2020 • Yujie Liu, Xu Zhu, Eng Gee Lim, Yufei Jiang, Yi Huang

A low-complexity semi-blind joint estimation scheme of multiple channels, CFOs and IQ imbalances is proposed.

Paper
Add Code

Metal-Insulator Transition in $n$-type bulk crystals and films of strongly compensated SrTiO$_3$

no code implementations • 23 Feb 2021 • Yi Huang, Y. Ayino, B. I. Shklovskii

We try to explain this discrepancy in the framework of the theory of the percolation MIT in a strongly compensated semiconductor with the same $N=10^{19}$ cm$^{-3}$.

Materials Science

Paper
Add Code

Towards Low-Resource Semi-Supervised Dialogue Generation with Meta-Learning

no code implementations • Findings of the Association for Computational Linguistics 2020 • Yi Huang, Junlan Feng, Shuo Ma, Xiaoyu Du, Xiaoting Wu

In this paper, we propose a meta-learning based semi-supervised explicit dialogue state tracker (SEDST) for neural dialogue generation, denoted as MEDST.

Dialogue Generation Meta-Learning

Paper
Add Code

Counterfactual Matters: Intrinsic Probing For Dialogue State Tracking

no code implementations • EANCS 2021 • Yi Huang, Junlan Feng, Xiaoting Wu, Xiaoyu Du

Our findings are: the performance variance of generative DSTs is not only due to the model structure itself, but can be attributed to the distribution of cross-domain values.

counterfactual Dialogue State Tracking +1

Paper
Add Code

Extraction of Medication Names from Twitter Using Augmentation and an Ensemble of Language Models

no code implementations • 12 Nov 2021 • Igor Kulev, Berkay Köprü, Raul Rodriguez-Esteban, Diego Saldana, Yi Huang, Alessandro La Torraca, Elif Ozkirimli

The BioCreative VII Track 3 challenge focused on the identification of medication names in Twitter user timelines.

Data Augmentation

Paper
Add Code

PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling

no code implementations • COLING 2022 • Guanting Dong, Daichi Guo, LiWen Wang, Xuefeng Li, Zechen Wang, Chen Zeng, Keqing He, Jinzheng Zhao, Hao Lei, Xinyue Cui, Yi Huang, Junlan Feng, Weiran Xu

Most existing slot filling models tend to memorize inherent patterns of entities and corresponding contexts from training data.

slot-filling Slot Filling

Paper
Add Code

Jointly Reinforced User Simulator and Task-oriented Dialog System with Simplified Generative Architecture

no code implementations • 13 Oct 2022 • Hong Liu, Zhijian Ou, Yi Huang, Junlan Feng

Recently, there has been progress in supervised funetuning pretrained GPT-2 to build end-to-end task-oriented dialog (TOD) systems.

Paper
Add Code

Temporal-Spatial dependencies ENhanced deep learning model (TSEN) for household leverage series forecasting

no code implementations • 17 Oct 2022 • Hu Yang, Yi Huang, Haijun Wang, Yu Chen

Analyzing both temporal and spatial patterns for an accurate forecasting model for financial time series forecasting is a challenge due to the complex nature of temporal-spatial dynamics: time series from different locations often have distinct patterns; and for the same time series, patterns may vary as time goes by.

Time Series Time Series Forecasting

Paper
Add Code

A holistically 3D-printed flexible millimeter-wave Doppler radar: Towards fully printed high-frequency multilayer flexible hybrid electronics systems

no code implementations • 24 Feb 2023 • Hong Tang, Yingjie Zhang, Bowen Zheng, Sensong An, Mohammad Haerinia, Yunxi Dong, Yi Huang, Wei Guo, Hualiang Zhang

Flexible hybrid electronics (FHE) is an emerging technology enabled through the integration of advanced semiconductor devices and 3D printing technology.

Paper
Add Code

Multi-Action Dialog Policy Learning from Logged User Feedback

no code implementations • 27 Feb 2023 • Shuo Zhang, Junzhou Zhao, Pinghui Wang, Tianxiang Wang, Zi Liang, Jing Tao, Yi Huang, Junlan Feng

To cope with this problem, we explore to improve multi-action dialog policy learning with explicit and implicit turn-level user feedback received for historical predictions (i. e., logged user feedback) that are cost-efficient to collect and faithful to real-world scenarios.

Paper
Add Code

Low-Light Image Enhancement by Learning Contrastive Representations in Spatial and Frequency Domains

no code implementations • 23 Mar 2023 • Yi Huang, Xiaoguang Tu, Gui Fu, Tingting Liu, Bokai Liu, Ming Yang, Ziliang Feng

Images taken under low-light conditions tend to suffer from poor visibility, which can decrease image quality and even reduce the performance of the downstream tasks.

Contrastive Learning Low-Light Image Enhancement

Paper
Add Code

Learning Higher-order Object Interactions for Keypoint-based Video Understanding

no code implementations • 16 May 2023 • Yi Huang, Asim Kadav, Farley Lai, Deep Patel, Hans Peter Graf

Specifically, KeyNet introduces the use of object based keypoint information to capture context in the scene.

Action Localization Action Recognition +2

Paper
Add Code

Integrating Different Informations for Portfolio Selection

no code implementations • 29 May 2023 • Yi Huang, Wei Zhu, Duan Li, Shushang Zhu, Shikun Wang

Following the idea of Bayesian learning via Gaussian mixture model, we organically combine the backward-looking information contained in the historical data and the forward-looking information implied by the market portfolio, which is affected by heterogeneous expectations and noisy trading behavior.

Paper
Add Code

Designing a Deep Learning-Driven Resource-Efficient Diagnostic System for Metastatic Breast Cancer: Reducing Long Delays of Clinical Diagnosis and Improving Patient Survival in Developing Countries

no code implementations • 4 Aug 2023 • William Gao, Dayong Wang, Yi Huang

This research provides an innovative technological solution to address the long delays in metastatic breast cancer diagnosis and the consequent disparity in patient survival outcome in developing countries.

Computational Efficiency

Paper
Add Code

Graph Edit Distance Learning via Different Attention

no code implementations • 26 Aug 2023 • Jiaxi Lv, Liang Zhang, Yi Huang, Jiancheng Huang, Shifeng Chen

To this end, DiffAtt uses the difference between two graph-level embeddings as an attentional mechanism to capture the graph structural difference of the two graphs.

Graph Similarity

Paper
Add Code

BEVHeight++: Toward Robust Visual Centric 3D Object Detection

no code implementations • 28 Sep 2023 • Lei Yang, Tao Tang, Jun Li, Peng Chen, Kun Yuan, Li Wang, Yi Huang, Xinyu Zhang, Kaicheng Yu

In essence, we regress the height to the ground to achieve a distance-agnostic formulation to ease the optimization process of camera-only perception methods.

3D Object Detection Autonomous Driving +2

Paper
Add Code

MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings

no code implementations • 30 Sep 2023 • Lei Yang, Jiaxin Yu, Xinyu Zhang, Jun Li, Li Wang, Yi Huang, Chuang Zhang, Hong Wang, Yiming Li

We discover that most existing monocular 3D object detectors rely on the ego-vehicle prior assumption that the optical axis of the camera is parallel to the ground.

Autonomous Driving Monocular 3D Object Detection +1

Paper
Add Code

Seal2Real: Prompt Prior Learning on Diffusion Model for Unsupervised Document Seal Data Generation and Realisation

no code implementations • 1 Oct 2023 • Jiancheng Huang, Yifan Liu, Yi Huang, Shifeng Chen

To address the lack of labelled datasets for these seal-related tasks, we propose Seal2Real, a generative method that generates a large amount of labelled document seal data, and construct a Seal-DB dataset containing 20K images with labels.

Paper
Add Code

Chemist-X: Large Language Model-empowered Agent for Reaction Condition Recommendation in Chemical Synthesis

no code implementations • 16 Nov 2023 • Kexin Chen, Junyou Li, Kunyi Wang, Yuyang Du, Jiahui Yu, Jiamin Lu, Lanqing Li, Jiezhong Qiu, Jianzhang Pan, Yi Huang, Qun Fang, Pheng Ann Heng, Guangyong Chen

Recent AI research plots a promising future of automatic chemical reactions within the chemistry society.

Contrastive Learning Drug Discovery +3

Paper
Add Code

GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning

no code implementations • 21 Nov 2023 • Jiaxi Lv, Yi Huang, Mingfu Yan, Jiancheng Huang, Jianzhuang Liu, Yifan Liu, Yafei Wen, Xiaoxin Chen, Shifeng Chen

To tackle these issues, we propose GPT4Motion, a training-free framework that leverages the planning capability of large language models such as GPT, the physical simulation strength of Blender, and the excellent image generation ability of text-to-image diffusion models to enhance the quality of video synthesis.

Image Generation Text-to-Video Generation +1

Paper
Add Code

USFM: A Universal Ultrasound Foundation Model Generalized to Tasks and Organs towards Label Efficient Image Analysis

no code implementations • 30 Dec 2023 • Jing Jiao, Jin Zhou, Xiaokang Li, Menghua Xia, Yi Huang, Lihong Huang, Na Wang, Xiaofan Zhang, Shichong Zhou, Yuanyuan Wang, Yi Guo

In this paper, we present a universal US foundation model, named USFM, generalized to diverse tasks and organs towards label efficient US image analysis.

Image Enhancement

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.