no code implementations • SemEval (NAACL) 2022 • Ye Wang, Yanmeng Wang, Baishun Ling, Zexiang Liao, Shaojun Wang, Jing Xiao
This paper describes the second-placed system for subtask 2 and the ninth-placed system for subtask 1 in SemEval 2022 Task 4: Patronizing and Condescending Language Detection.
no code implementations • 15 Apr 2025 • Yu Gao, Lixue Gong, Qiushan Guo, Xiaoxia Hou, Zhichao Lai, Fanshi Li, Liang Li, Xiaochen Lian, Chao Liao, Liyang Liu, Wei Liu, Yichun Shi, Shiqi Sun, Yu Tian, Zhi Tian, Peng Wang, Rui Wang, Xuanda Wang, Xun Wang, Ye Wang, Guofeng Wu, Jie Wu, Xin Xia, Xuefeng Xiao, Zhonghua Zhai, Xinyu Zhang, Qi Zhang, Yuwei Zhang, Shijia Zhao, Jianchao Yang, Weilin Huang
At the data stratum, we double the dataset using a defect-aware training paradigm and a dual-axis collaborative data-sampling framework.
1 code implementation • 17 Mar 2025 • Ye Wang, Boshen Xu, Zihao Yue, Zihan Xiao, Ziheng Wang, Liang Zhang, Dingyi Yang, Wenxuan Wang, Qin Jin
We introduce TimeZero, a reasoning-guided LVLM designed for the temporal video grounding (TVG) task.
1 code implementation • 10 Mar 2025 • Lixue Gong, Xiaoxia Hou, Fanshi Li, Liang Li, Xiaochen Lian, Fei Liu, Liyang Liu, Wei Liu, Wei Lu, Yichun Shi, Shiqi Sun, Yu Tian, Zhi Tian, Peng Wang, Xun Wang, Ye Wang, Guofeng Wu, Jie Wu, Xin Xia, Xuefeng Xiao, Linjie Yang, Zhonghua Zhai, Xinyu Zhang, Qi Zhang, Yuwei Zhang, Shijia Zhao, Jianchao Yang, Weilin Huang
To address these limitations, we present Seedream 2. 0, a native Chinese-English bilingual image generation foundation model that excels across diverse dimensions, which adeptly manages text prompt in both Chinese and English, supporting bilingual image generation and text rendering.
no code implementations • 26 Feb 2025 • Ashley Lewis, Michael White, Jing Liu, Toshiaki Koike-Akino, Kieran Parsons, Ye Wang
Using a dataset of questions about a Samsung Smart TV user manual, we demonstrate that synthetic data generated by LLMs outperforms crowdsourced data in reducing hallucination in finetuned models.
no code implementations • 20 Feb 2025 • Minjie Hong, Yan Xia, Zehan Wang, Jieming Zhu, Ye Wang, Sihang Cai, Xiaoda Yang, Quanyu Dai, Zhenhua Dong, Zhimeng Zhang, Zhou Zhao
Large language models (LLMs) are increasingly leveraged as foundational backbones in the development of advanced recommender systems, offering enhanced capabilities through their extensive knowledge and reasoning.
no code implementations • 27 Jan 2025 • Ryo Hase, Md Rafi Ur Rashid, Ashley Lewis, Jing Liu, Toshiaki Koike-Akino, Kieran Parsons, Ye Wang
Improving the safety and reliability of large language models (LLMs) is a crucial aspect of realizing trustworthy AI systems.
1 code implementation • 24 Jan 2025 • Weicai Yan, Ye Wang, Wang Lin, Zirun Guo, Zhou Zhao, Tao Jin
Considering that the training parameters scale to the number of layers and tasks, we propose low-rank interaction-augmented decomposition to avoid memory explosion while enhancing the cross-modal association through sharing and separating common-specific low-rank factors.
no code implementations • 22 Dec 2024 • Zhigen Li, Yanmeng Wang, Rizhao Fan, Ye Wang, Jianfeng Li, Shaojun Wang
LAPA has three-stage training on three types of related resources to solve this problem: 1. pre-training PLMs on unsupervised corpora, 2. inserting an adapter layer and meta-training on source domain labeled data, and 3. fine-tuning adapters on a small amount of target domain labeled data.
no code implementations • 17 Dec 2024 • Xinlong Cheng, Tiantian Cao, Guoan Cheng, BangXuan Huang, Xinghan Tian, Ye Wang, Xiaoyu He, Weixin Li, Tianfan Xue, Xuan Dong
In this work, we address the limitations of denoising diffusion models (DDMs) in image restoration tasks, particularly the shape and color distortions that can compromise image quality.
no code implementations • 17 Dec 2024 • Qinyu Zhang, Liang Xu, Jianhao Huang, Tao Yang, Jian Jiao, Ye Wang, Yao Shi, Chiya Zhang, Xingjian Zhang, Ke Zhang, Yupeng Gong, Na Deng, Nan Zhao, Zhen Gao, Shujun Han, Xiaodong Xu, Li You, Dongming Wang, Shan Jiang, Dixian Zhao, Nan Zhang, Liujun Hu, Xiongwen He, Yonghui Li, Xiqi Gao, Xiaohu You
In this context, the distributed satellite information networks (DSIN), exemplified by the cohesive clustered satellites system, have emerged as an innovative architecture, bridging information gaps across diverse satellite systems, such as communication, navigation, and remote sensing, and establishing a unified, open information network paradigm to support resilient space information services.
no code implementations • 7 Dec 2024 • Ye Wang, Yaxiong Wang, Guoshuai Zhao, Xueming Qian
Continuous Generalized Category Discovery (C-GCD) aims to continually discover novel classes from unlabelled image sets while maintaining performance on old classes.
no code implementations • 6 Nov 2024 • Ruhan Wang, Ye Wang, Jing Liu, Toshiaki Koike-Akino
Modern quantum machine learning (QML) methods involve the variational optimization of parameterized quantum circuits on training datasets, followed by predictions on testing datasets.
1 code implementation • 14 Oct 2024 • Hongfu Liu, Hengguan Huang, Xiangming Gu, Hao Wang, Ye Wang
Large language models (LLMs) pose significant risks due to the potential for generating harmful content or users attempting to evade guardrails.
1 code implementation • 14 Oct 2024 • Xiangming Gu, Tianyu Pang, Chao Du, Qian Liu, Fengzhuo Zhang, Cunxiao Du, Ye Wang, Min Lin
In this work, we first demonstrate that attention sinks exist universally in LMs with various inputs, even in small models.
no code implementations • 4 Oct 2024 • Ye Wang, Sipeng Zheng, Bin Cao, Qianshan Wei, Qin Jin, Zongqing Lu
Inspired by the recent success of LLMs, the field of human motion understanding has increasingly shifted towards the development of large motion models.
no code implementations • 25 Sep 2024 • Peicong Zheng, Xuantao Lyu, Ye Wang, Yi Gong
The CDL formulation is cast as a bilevel optimization problem, which we solve using a gradient-based approach.
no code implementations • 18 Sep 2024 • Ye Wang, Yaxiong Wang, Guoshuai Zhao, Xueming Qian
Few-shot class-incremental learning (FSCIL) aims to incrementally recognize new classes using a few samples while maintaining the performance on previously learned classes.
class-incremental learning
Few-Shot Class-Incremental Learning
+1
no code implementations • 11 Sep 2024 • Zhuohang Li, Andrew Lowy, Jing Liu, Toshiaki Koike-Akino, Bradley Malin, Kieran Parsons, Ye Wang
We explore user-level gradient inversion as a new attack surface in distributed learning.
no code implementations • 30 Aug 2024 • Md Rafi Ur Rashid, Jing Liu, Toshiaki Koike-Akino, Shagufta Mehnaz, Ye Wang
This approach manipulates a pre-trained language model to increase the leakage of private data during the fine-tuning process.
no code implementations • 29 Aug 2024 • Zhuohang Li, Andrew Lowy, Jing Liu, Toshiaki Koike-Akino, Kieran Parsons, Bradley Malin, Ye Wang
While previous work has studied various privacy risks of sharing gradients, our paper aims to provide a systematic approach to analyze private information leakage from gradients.
1 code implementation • 27 Aug 2024 • Hongfu Liu, Yuxi Xie, Ye Wang, Michael Shieh
Further analysis on cross-model transfer indicates the pivotal role of first target token optimization in leveraging suffix transferability for efficient searching.
no code implementations • 27 Aug 2024 • Longshen Ou, Jingwei Zhao, Ziyu Wang, Gus Xia, Ye Wang
Automatic music arrangement streamlines the creation of musical variants for composers and arrangers, reducing reliance on extensive music expertise.
1 code implementation • 19 Jul 2024 • Yihang Fu, Mingwei Jing, Jiaolun Zhou, Peilin Wu, Ye Wang, Luyao Zhang, Chuang Hu
Blockchain technology is essential for the digital economy and metaverse, supporting applications from decentralized finance to virtual assets.
no code implementations • 16 Jul 2024 • Ryo Hase, Ye Wang, Toshiaki Koike-Akino, Jing Liu, Kieran Parsons
Randomized smoothing is a defensive technique to achieve enhanced robustness against adversarial examples which are small input perturbations that degrade the performance of neural network models.
no code implementations • 15 Jul 2024 • Keshav Bimbraw, Ye Wang, Jing Liu, Toshiaki Koike-Akino
Large vision-language models (LVLMs), such as the Generative Pre-trained Transformer 4-omni (GPT-4o), are emerging multi-modal foundation models which have great potential as powerful artificial-intelligence (AI) assistance tools for a myriad of applications, including healthcare, industrial, and academic sectors.
no code implementations • 15 Jul 2024 • Keshav Bimbraw, Jing Liu, Ye Wang, Toshiaki Koike-Akino
Notably, the proposed method is also robust to an increase in the number of missing channels compared to other methods.
1 code implementation • 12 Jul 2024 • Naiyao Wang, Tongbang Jiang, Ye Wang, Shaoyang Qiu, Bo Zhang, Xinqiang Xie, Munan Li, Chunliu Wang, Yiyang Wang, Hongxiang Ren, Ruili Wang, Hongjun Shan, Hongbo Liu
Intelligent maritime, as an essential component of smart ocean construction, deeply integrates advanced artificial intelligence technology and data analysis methods, which covers multiple aspects such as smart vessels, route optimization, safe navigation, aiming to enhance the efficiency of ocean resource utilization and the intelligence of transportation networks.
1 code implementation • 4 Jul 2024 • Zhigen Li, Jianxiang Peng, Yanmeng Wang, Yong Cao, Tianhao Shen, Minghui Zhang, Linxi Su, Shang Wu, Yihang Wu, Yuqian Wang, Ye Wang, Wei Hu, Jianfeng Li, Shaojun Wang, Jing Xiao, Deyi Xiong
Conversational agents powered by Large Language Models (LLMs) show superior performance in various tasks.
no code implementations • 24 Jun 2024 • Yuting Mei, Ye Wang, Sipeng Zheng, Qin Jin
As robotic agents increasingly assist humans in reality, quadruped robots offer unique opportunities for interaction in complex scenarios due to their agile movement.
1 code implementation • 22 Jun 2024 • Hao Wang, Ye Wang, Xiangyu Yang
We prove the global convergence of the proposed algorithm, guaranteeing that every limit point of the iterates is a critical point.
1 code implementation • 20 Jun 2024 • Ye Wang, Jiahao Xun, Minjie Hong, Jieming Zhu, Tao Jin, Wang Lin, Haoyuan Li, Linjun Li, Yan Xia, Zhou Zhao, Zhenhua Dong
Generative retrieval has recently emerged as a promising approach to sequential recommendation, framing candidate item retrieval as an autoregressive sequence generation problem.
no code implementations • 13 Jun 2024 • Feng Xu, Yiming Wan, Ye Wang, Vicenc Puig
This paper proposes a joint gain and input design method for observer-based asymptotic active fault diagnosis, which is based on a newly-defined notion named the excluding degree of the origin from a zonotope.
no code implementations • 7 Jun 2024 • Jing Liu, Andrew Lowy, Toshiaki Koike-Akino, Kieran Parsons, Ye Wang
The recent developments of Diffusion Models (DMs) enable generation of astonishingly high-quality synthetic samples.
no code implementations • 7 Jun 2024 • Xintong Wang, Mingqian Shi, Ye Wang
Mispronunciation Detection and Diagnosis (MDD) systems, leveraging Automatic Speech Recognition (ASR), face two main challenges in Mandarin Chinese: 1) The two-stage models create an information gap between the phoneme or tone classification stage and the MDD stage.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
no code implementations • 28 May 2024 • Kai Chen, Ye Wang, Yitong Li, Aiping Li, Han Yu, Xin Song
Comprehensive experiments show that TPAR outperforms SOTA methods on the link prediction task for both the interpolation and the extrapolation settings.
1 code implementation • 22 May 2024 • Wei Zeng, Xian He, Ye Wang
Piano audio-to-score transcription (A2S) is an important yet underexplored task with extensive applications for music composition, practice, and analysis.
2 code implementations • 7 May 2024 • Albert Bou, Morgan Thomas, Sebastian Dittert, Carles Navarro Ramírez, Maciej Majewski, Ye Wang, Shivam Patel, Gary Tresadern, Mazen Ahmad, Vincent Moens, Woody Sherman, Simone Sciabola, Gianni de Fabritiis
In recent years, reinforcement learning (RL) has emerged as a valuable tool in drug design, offering the potential to propose and optimize molecules with desired properties.
1 code implementation • CVPR 2024 • Haomiao Ni, Bernhard Egger, Suhas Lohit, Anoop Cherian, Ye Wang, Toshiaki Koike-Akino, Sharon X. Huang, Tim K. Marks
To guide video generation with the additional image input, we propose a "repeat-and-slide" strategy that modulates the reverse denoising process, allowing the frozen diffusion model to synthesize a video frame-by-frame starting from the provided image.
no code implementations • 17 Apr 2024 • Haoxiang Deng, Yi Zhu, Ye Wang, Jipeng Qiang, Yunhao Yuan, Yun Li, Runmei Zhang
To address this problem, we propose a prompt-tuning method for clickbait detection via text summarization in this paper, text summarization is introduced to summarize the contents, and clickbait detection is performed based on the similarity between the generated summary and the contents.
1 code implementation • 13 Apr 2024 • Ye Wang, Yaxiong Wang, Yujiao Wu, Bingchen Zhao, Xueming Qian
To counteract this inefficiency, we opt to cluster only the unlabelled instances and subsequently expand the cluster prototypes with our introduced potential prototypes to fast explore novel classes.
no code implementations • 8 Apr 2024 • Meng Yuan, Ye Wang, Chris Manzie, Zhezhuang Xu, Tianyou Chai
To address the need for improved contouring accuracy in industrial machines with position-dependent structural flexibility, this paper introduces a novel contouring error-bounded control algorithm for biaxial switched linear systems.
no code implementations • 4 Apr 2024 • Mohammadmehdi Ataei, Hyunmin Cheong, Daniele Grandi, Ye Wang, Nigel Morris, Alexander Tessier
Second, we show how our framework effectively mimics empathic lead user interviews, identifying a greater number of latent needs than conventional human interviews.
no code implementations • 19 Mar 2024 • Xun Shen, Ye Wang, Kazumune Hashimoto, Yuhu Wu, Sebastien Gros
The existing methods of computing probabilistic reachable sets normally assume that stochastic uncertainties are independent of system states, inputs, and other environment variables.
no code implementations • 18 Mar 2024 • Xiangyu Chen, Jing Liu, Ye Wang, Pu, Wang, Matthew Brand, Guanghui Wang, Toshiaki Koike-Akino
Low-rank adaptation (LoRA) and its variants are widely employed in fine-tuning large models, including large language models for natural language processing and diffusion models for computer vision.
no code implementations • 17 Mar 2024 • Ye Wang, Zili Yi, Rui Ma
Personalized text-to-image (T2I) models not only produce lifelike and varied visuals but also allow users to tailor the images to fit their personal taste.
no code implementations • 15 Mar 2024 • Md Rubel Ahmed, Toshiaki Koike-Akino, Kieran Parsons, Ye Wang
High-level synthesis (HLS) is a design flow that leverages modern language features and flexibility, such as complex data structures, inheritance, templates, etc., to prototype hardware designs rapidly.
no code implementations • 14 Mar 2024 • Xu Yang, Jiyuan Feng, Songyue Guo, Ye Wang, Ye Ding, Binxing Fang, Qing Liao
In this paper, we propose a novel Dynamic Affinity-based Personalized Federated Learning model (DA-PFL) to alleviate the class imbalanced problem during federated learning.
no code implementations • 14 Mar 2024 • Sipeng Zheng, Bohan Zhou, Yicheng Feng, Ye Wang, Zongqing Lu
In this paper, we propose \textbf{UniCode}, a novel approach within the domain of multimodal large language models (MLLMs) that learns a unified codebook to efficiently tokenize visual, text, and potentially other types of signals.
no code implementations • 26 Feb 2024 • Ye Wang, Xinrun Xu, Rui Xie, Wenxin Hu, Wei Ye
When completing knowledge-intensive tasks, humans sometimes need not just an answer but also a corresponding reference passage for auxiliary reading.
no code implementations • 22 Feb 2024 • Renyi Mao, Qingshan Xu, Peng Zheng, Ye Wang, Tieru Wu, Rui Ma
In this paper, we aim for both fast and high-quality implicit field learning, and propose TaylorGrid, a novel implicit field representation which can be efficiently computed via direct Taylor expansion optimization on 2D or 3D grids.
no code implementations • 14 Feb 2024 • Andrew Lowy, Zhuohang Li, Jing Liu, Toshiaki Koike-Akino, Kieran Parsons, Ye Wang
In practical applications, such a worst-case guarantee may be overkill: practical attackers may lack exact knowledge of (nearly all of) the private data, and our data set might be easier to defend, in some sense, than the worst-case data set.
1 code implementation • 13 Feb 2024 • Xiangming Gu, Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Ye Wang, Jing Jiang, Min Lin
A multimodal large language model (MLLM) agent can receive instructions, capture images, retrieve histories from memory, and decide which tools to use.
1 code implementation • 8 Feb 2024 • Hengguan Huang, Songtao Wang, Hongfu Liu, Hao Wang, Ye Wang
Traditional applications of natural language processing (NLP) in healthcare have predominantly focused on patient-centered services, enhancing patient interactions and care delivery, such as through medical dialogue systems.
no code implementations • 8 Jan 2024 • Ruiqi Liu, Peng Zheng, Ye Wang, Rui Ma
Conversely, some GAN-based 2D portrait synthesis methods can achieve clear disentanglement of facial regions, but they cannot preserve view consistency due to a lack of 3D modeling abilities.
no code implementations • 29 Dec 2023 • Linlian Jiang, Pan Chen, Ye Wang, Tieru Wu, Rui Ma
Inferring missing regions from severely occluded point clouds is highly challenging.
no code implementations • 20 Dec 2023 • Yan Cai, LinLin Wang, Ye Wang, Gerard de Melo, Ya zhang, Yanfeng Wang, Liang He
The emergence of various medical large language models (LLMs) in the medical domain has highlighted the need for unified evaluation standards, as manual evaluation of LLMs proves to be time-consuming and labor-intensive.
1 code implementation • 25 Oct 2023 • Jingwei Zhao, Gus Xia, Ziyu Wang, Ye Wang
In the realm of music AI, arranging rich and structured multi-track accompaniments from a simple lead sheet presents significant challenges.
1 code implementation • 24 Oct 2023 • Panfeng Cao, Ye Wang, Qiang Zhang, Zaiqiao Meng
Key information extraction (KIE) from scanned documents has gained increasing attention because of its applications in various domains.
1 code implementation • 14 Oct 2023 • Hongfu Liu, Hengguan Huang, Ye Wang
In this work, we propose a novel wild acoustic TTA method tailored for ASR fine-tuned acoustic foundation models.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • 13 Oct 2023 • Hongfu Liu, Ye Wang
Large Language models (LLMs) possess the capability to engage In-context Learning (ICL) by leveraging a few demonstrations pertaining to a new downstream task as conditions.
no code implementations • 12 Oct 2023 • Niklas Smedemark-Margulies, Ye Wang, Toshiaki Koike-Akino, Jing Liu, Kieran Parsons, Yunus Bicer, Deniz Erdogmus
Classification models for electroencephalogram (EEG) data show a large decrease in performance when evaluated on unseen test sub jects.
2 code implementations • 4 Oct 2023 • Xiangming Gu, Chao Du, Tianyu Pang, Chongxuan Li, Min Lin, Ye Wang
Looking into this, we first observe that memorization behaviors tend to occur on smaller-sized datasets, which motivates our definition of effective model memorization (EMM), a metric measuring the maximum size of training data at which a learned diffusion model approximates its theoretical optimum.
1 code implementation • 4 Oct 2023 • Zejun Li, Ye Wang, Mengfei Du, Qingwen Liu, Binhao Wu, Jiwen Zhang, Chengxing Zhou, Zhihao Fan, Jie Fu, Jingjing Chen, Xuanjing Huang, Zhongyu Wei
Recent years have witnessed remarkable progress in the development of large vision-language models (LVLMs).
no code implementations • 3 Oct 2023 • Vivian Ziemke, Benjamin Estermann, Roger Wattenhofer, Ye Wang
In the evolving landscape of digital art, Non-Fungible Tokens (NFTs) have emerged as a groundbreaking platform, bridging the realms of art and technology.
1 code implementation • ICCV 2023 • Nithin Gopalakrishnan Nair, Anoop Cherian, Suhas Lohit, Ye Wang, Toshiaki Koike-Akino, Vishal M. Patel, Tim K. Marks
To this end, and capitalizing on the powerful fine-grained generative control offered by the recent diffusion-based generative models, we introduce Steered Diffusion, a generalized framework for photorealistic zero-shot conditional image generation using a diffusion model trained for unconditional generation.
no code implementations • 20 Sep 2023 • Zhonglin Cao, Simone Sciabola, Ye Wang
Accurate model can achieve high sample efficiency by finding the most promising compounds with only a fraction of the whole library being virtually screened.
no code implementations • 5 Sep 2023 • Minjie Chen, Yao Cheng, Ye Wang, Xiang Li, Ming Gao
Further, Since the triplet loss only optimizes the relative distance between the anchor and its positive/negative samples, it is difficult to ensure the absolute distance between the anchor and positive sample.
no code implementations • 21 Aug 2023 • Ye Wang, Erik Weyer, Chris Manzie, Angus R. Simpson, Lisa Blinco
To address these limitations, we introduce a method to simultaneously design infrastructure and develop control parameters, the co-design problem, with the aim of improving the overall efficiency of the system.
1 code implementation • 5 Aug 2023 • Xiangming Gu, Wei Zeng, Ye Wang
Leveraging the prior knowledge that pitch distributions may contribute to the gender bias, we propose conditionally aligning acoustic representations between demographic groups by feeding note events to the attribute predictor.
1 code implementation • journal 2023 • Bo Xie, Xiaohui Jia, Xiawen Song, Hua Zhang, Bi Chen, Bo Jiang, Ye Wang, Yun Pan
It usually includes slot filling and intent detection (SFID) tasks aiming at semantic parsing of utterances.
no code implementations • 5 Jul 2023 • Longshen Ou, Xichu Ma, Ye Wang
Despite previous efforts in melody-to-lyric generation research, there is still a significant compatibility gap between generated lyrics and melodies, negatively impacting the singability of the outputs.
1 code implementation • 26 Jun 2023 • Ye Wang, Huazheng Pan, Tao Zhang, Wen Wu, Wenxin Hu
Motivated by this, we propose a positive-augmentation and positive-mixup positive-unlabeled metric learning framework (P3M).
Document-level RE with incomplete labeling
Metric Learning
+1
1 code implementation • 19 Jun 2023 • Ye Wang, Yaxiong Wang, Guoshuai Zhao, Xueming Qian
Concretely, RESA mimics the real incremental setting and constructs pseudo incremental tasks globally and locally, where the global pseudo incremental tasks are designed to coincide with the learning objective of FSCIL and the local pseudo incremental tasks are designed to improve the model's plasticity, respectively.
class-incremental learning
Few-Shot Class-Incremental Learning
+2
1 code implementation • 16 Jun 2023 • Han Wang, Yi Zhu, Ye Wang, Yun Li, Yunhao Yuan, Jipeng Qiang
Clickbait, which aims to induce users with some surprising and even thrilling headlines for increasing click-through rates, permeates almost all online content publishers, such as news portals and social media.
no code implementations • 31 May 2023 • Hongfu Liu, Mingqian Shi, Ye Wang
Automatic Pronunciation Assessment (APA) is vital for computer-assisted language learning.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+3
1 code implementation • 26 May 2023 • Longshen Ou, Xichu Ma, Min-Yen Kan, Ye Wang
The development of general-domain neural machine translation (NMT) methods has advanced significantly in recent years, but the lack of naturalness and musical constraints in the outputs makes them unable to produce singable lyric translations.
no code implementations • 5 May 2023 • Xiaochuan Zhang, Mengran Li, Ye Wang, Haojun Fei
To address these challenges, we propose Attribute missing Graph Contrastive Learning (AmGCL), a framework for handling missing node attributes in attribute graph data.
no code implementations • 20 Mar 2023 • Ye Wang, Bowei Jiang, Changqing Zou, Rui Ma
Existing cross-modal contrastive representation learning (XM-CLR) methods such as CLIP are not fully suitable for multifold data as they only consider one positive pair and treat other pairs as negative when computing the contrastive loss.
2 code implementations • ICCV 2023 • Xize Cheng, Linjun Li, Tao Jin, Rongjie Huang, Wang Lin, Zehan Wang, Huangdai Liu, Ye Wang, Aoxiong Yin, Zhou Zhao
However, despite researchers exploring cross-lingual translation techniques such as machine translation and audio speech translation to overcome language barriers, there is still a shortage of cross-lingual studies on visual speech.
no code implementations • 5 Mar 2023 • Xiaodan Xi, Ge Li, Ye Wang, Yeonsoo Jeon, Michael Orshansky
We construct lattice PUF with a physically obfuscated key and an LWE decryption function block.
no code implementations • ICCV 2023 • Wang Lin, Tao Jin, Ye Wang, Wenwen Pan, Linjun Li, Xize Cheng, Zhou Zhao
In this study, we propose a new task, group video captioning, which aims to infer the desired content among a group of target videos and describe it with another group of related reference videos.
1 code implementation • 28 Dec 2022 • Ye Wang, Rui Ma, Xiaoqing Ma, Honghua Cui, Yubin Xiao, Xuan Wu, You Zhou
BMEC contains 5, 666 images of individual erythroid cells, each of which is extracted from the bone marrow erythroid cell smears and professionally annotated to one of the four types of erythroid cells.
no code implementations • 7 Dec 2022 • Yinpeng Dong, Peng Chen, Senyou Deng, Lianji L, Yi Sun, Hanyu Zhao, Jiaxing Li, Yunteng Tan, Xinyu Liu, Yangyi Dong, Enhui Xu, Jincai Xu, Shu Xu, Xuelin Fu, Changfeng Sun, Haoliang Han, Xuchong Zhang, Shen Chen, Zhimin Sun, Junyi Cao, Taiping Yao, Shouhong Ding, Yu Wu, Jian Lin, Tianpeng Wu, Ye Wang, Yu Fu, Lin Feng, Kangkang Gao, Zeyu Liu, Yuanzhe Pang, Chengqi Duan, Huipeng Zhou, Yajie Wang, Yuhang Zhao, Shangbo Wu, Haoran Lyu, Zhiyu Lin, YiFei Gao, Shuang Li, Haonan Wang, Jitao Sang, Chen Ma, Junhao Zheng, Yijia Li, Chao Shen, Chenhao Lin, Zhichao Cui, Guoshuai Liu, Huafeng Shi, Kun Hu, Mengxin Zhang
The security of artificial intelligence (AI) is an important research area towards safe, reliable, and trustworthy AI systems.
no code implementations • 17 Oct 2022 • Wenlu Wang, Ye Wang, Honggang Zhao, Simone Sciabola
In the scope of drug discovery, the molecular design aims to identify novel compounds from the chemical space where the potential drug-like molecules are estimated to be in the order of 10^60 - 10^100.
1 code implementation • 17 Oct 2022 • Ye Wang, Xinxin Liu, Wenxin Hu, Tao Zhang
To solve the common incomplete labeling problem, we propose a unified positive-unlabeled learning framework - shift and squared ranking loss positive-unlabeled (SSR-PU) learning.
no code implementations • 6 Oct 2022 • Meng Yuan, Ye Wang, Lei LI, Tianyou Chai, Wei Tech Ang
Electric-powered wheelchair plays an important role in providing accessibility for people with mobility impairment.
no code implementations • 29 Sep 2022 • Toshiaki Koike-Akino, Ye Wang
In this paper, we introduce an emerging quantum machine learning (QML) framework to assist classical deep learning methods for biosignal processing applications.
no code implementations • 20 Sep 2022 • Haifeng Xia, Pu Perry Wang, Toshiaki Koike-Akino, Ye Wang, Philip Orlik, Zhengming Ding
Domain adaptation (DA) aims to transfer the knowledge of a well-labeled source domain to facilitate unlabeled target learning.
no code implementations • 26 Aug 2022 • Yujia Yang, Ye Wang, Chris Manzie, Ye Pu
The cyclic-small-gain theorem is used to derive sufficient conditions on the quantization parameters for guaranteeing the stability of the system under a limited data rate.
no code implementations • 26 Aug 2022 • Ye Wang, Qi Zhao, Wenyan Wu, Ailsa Willis, Angus R. Simpson, Erik Weyer
This paper presents a case study of the operational management of the Robinvale high-pressure piped irrigation water delivery system (RVHPS) in Australia.
no code implementations • 25 Aug 2022 • Ye Wang, Yujia Yang, Ye Pu, Chris Manzie
Constraint handling during tracking operations is at the core of many real-world control implementations and is well understood when dynamic models of the underlying system exist, yet becomes more challenging when data-driven models are used to describe the nonlinear system at hand.
no code implementations • 26 Jul 2022 • Ye Wang, Jingbo Liao, Hong Yu, Guoyin Wang, Xiaoxia Zhang, Li Liu
Particularly, the model integrates the macro-level guided-category knowledge and micro-level open-domain dialogue data for the training, leveraging the priori knowledge into the latent space, which enables the model to disentangle the latent variables within the mesoscopic scale.
1 code implementation • 20 Jul 2022 • Longshen Ou, Xiangming Gu, Ye Wang
To fill in the performance gap between ALT and ASR, we attempt to exploit the similarities between speech and singing.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+3
1 code implementation • 13 Jul 2022 • Xiangming Gu, Longshen Ou, Danielle Ong, Ye Wang
Automatic lyric transcription (ALT) is a nascent field of study attracting increasing interest from both the speech and music information retrieval communities, given its significant application potential.
no code implementations • Findings (EMNLP) 2021 • Yanmeng Wang, Jun Bai, Ye Wang, Jianfei Zhang, Wenge Rong, Zongcheng Ji, Shaojun Wang, Jing Xiao
To keep independent encoding of questions and answers during inference stage, variational auto-encoder is further introduced to reconstruct answers (questions) from question (answer) embeddings as an auxiliary task to enhance QA interaction in representation learning in training stage.
no code implementations • 17 May 2022 • Toshiaki Koike-Akino, Pu Wang, Ye Wang
Commercial Wi-Fi devices can be used for integrated sensing and communications (ISAC) to jointly exchange data and monitor indoor environment.
no code implementations • 17 May 2022 • Toshiaki Koike-Akino, Pu Wang, Ye Wang
Beyond data communications, commercial-off-the-shelf Wi-Fi devices can be used to monitor human activities, track device locomotion, and sense the ambient environment.
no code implementations • 17 May 2022 • Bryan Liu, Toshiaki Koike-Akino, Ye Wang, Kieran Parsons
This paper introduces a new quantum computing framework integrated with a two-step compressed sensing technique, applied to a joint channel estimation and user identification problem.
no code implementations • 17 May 2022 • Bryan Liu, Toshiaki Koike-Akino, Ye Wang, Kieran Parsons
This paper investigates a turbo receiver employing a variational quantum circuit (VQC).
5 code implementations • 27 Apr 2022 • Jiahong Zhang, Meijun Qu, Ye Wang, Lihong Cao
Unlike previous attention mechanisms that handle pixel-level, channel-level, or patch-level features, MPA focuses on features at the image level.
no code implementations • 8 Apr 2022 • Longshen Ou, Ziyi Guo, Emmanouil Benetos, Jiqing Han, Ye Wang
Most recent research about automatic music transcription (AMT) uses convolutional neural networks and recurrent neural networks to model the mapping from music signals to symbolic notation.
no code implementations • ACL 2022 • Kai Chen, Ye Wang, Yitong Li, Aiping Li
Temporal factors are tied to the growth of facts in realistic applications, such as the progress of diseases and the development of political situation, therefore, research on Temporal Knowledge Graph (TKG) attracks much attention.
Ranked #3 on
Link Prediction
on GDELT
no code implementations • 11 Feb 2022 • Agostino Capponi, Ruizhe Jia, Ye Wang
A 1% increase in the probability of being frontrun raises users' adoption rate of the dark venue by 0. 6%.
no code implementations • 28 Dec 2021 • Jianyuan Yu, Pu, Wang, Toshiaki Koike-Akino, Ye Wang, Philip V. Orlik, R. Michael Buehrer
The granularity matching is realized by pairing two feature maps from the CSI and beam SNR at different granularity levels and linearly combining all paired feature maps into a fused feature map with learnable weights.
no code implementations • 17 Dec 2021 • Niklas Smedemark-Margulies, Ye Wang, Toshiaki Koike-Akino, Deniz Erdogmus
We provide a regularization framework for subject transfer learning in which we seek to train an encoder and classifier to minimize classification loss, subject to a penalty measuring independence between the latent representation and the subject label.
no code implementations • 1 Nov 2021 • Safa C. Medin, Bernhard Egger, Anoop Cherian, Ye Wang, Joshua B. Tenenbaum, Xiaoming Liu, Tim K. Marks
Recent advances in generative adversarial networks (GANs) have led to remarkable achievements in face image synthesis.
no code implementations • 19 Oct 2021 • Bo Pang, Yongquan Fu, Siyuan Ren, Ye Wang, Qing Liao, Yan Jia
Extensive evaluation over real-world traffic data sets, including normal, encrypted and malicious labels, show that, CGNN improves the prediction accuracy by 23\% to 29\% for application classification, by 2\% to 37\% for malicious traffic classification, and reaches the same accuracy level for encrypted traffic classification.
1 code implementation • CVPR 2022 • Aditya Sanghi, Hang Chu, Joseph G. Lambourne, Ye Wang, Chin-Yi Cheng, Marco Fumero, Kamal Rahimi Malekshan
Generating shapes using natural language can enable new ways of imagining and creating the things around us.
no code implementations • 29 Sep 2021 • Xueyang Wu, Hengguan Huang, Hao Wang, Ye Wang, Qian Xu
However, it is challenging for GANs to model distributions of separate non-i. i. d.
no code implementations • 28 Sep 2021 • Qianmengke Zhao, Ye Wang, Qun Liu
Although deep learning models are powerful among various applications, most deep learning models are still a black box, lacking verifiability and interpretability, which means the decision-making process that human beings cannot understand.
no code implementations • SEMEVAL 2021 • Ye Wang, Yanmeng Wang, Haijun Zhu, Bo Zeng, Zhenghong Hao, Shaojun Wang, Jing Xiao
This paper describes the winning system for subtask 2 and the second-placed system for subtask 1 in SemEval 2021 Task 4: ReadingComprehension of Abstract Meaning.
1 code implementation • 17 Jul 2021 • Hengguan Huang, Hongfu Liu, Hao Wang, Chang Xiao, Ye Wang
In this paper, we present a probabilistic ordinary differential equation (ODE), called STochastic boundaRy ODE (STRODE), that learns both the timings and the dynamics of time series data without requiring any timing annotations during training.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+4
no code implementations • 16 Jun 2021 • Andac Demir, Toshiaki Koike-Akino, Ye Wang, Masaki Haruna, Deniz Erdogmus
Convolutional neural networks (CNN) have been frequently used to extract subject-invariant features from electroencephalogram (EEG) for classification tasks.
no code implementations • 28 May 2021 • Lioba Heimbach, Ye Wang, Roger Wattenhofer
In this paper, we aim to understand how liquidity providers react to market information and how they benefit from providing liquidity in DEXes.
no code implementations • 10 May 2021 • Shuqi Dai, Xichu Ma, Ye Wang, Roger B. Dannenberg
Many practices have been presented in music generation recently.
no code implementations • 21 Apr 2021 • Ye Wang, Yan Chen, Haotian Wu, Liyi Zhou, Shuiguang Deng, Roger Wattenhofer
We find that traders have executed 292, 606 cyclic arbitrages over eleven months and exploited more than 138 million USD in revenue.
no code implementations • 13 Apr 2021 • Kaitai Zhang, Bin Wang, Hong-Shuo Chen, Ye Wang, Shiyu Mou, C. -C. Jay Kuo
The main challenge of dynamic texture synthesis lies in how to maintain spatial and temporal consistency in synthesized videos.
no code implementations • 28 Feb 2021 • Jialin Peng, Ye Wang
Despite the remarkable performance of deep learning methods on various tasks, most cutting-edge models rely heavily on large-scale annotated training examples, which are often unavailable for clinical and health care tasks.
1 code implementation • 23 Nov 2020 • Saeid Asgari Taghanaki, Jieliang Luo, Ran Zhang, Ye Wang, Pradeep Kumar Jayaraman, Krishna Murthy Jatavallabhula
We also find that robustness to unseen transformations cannot be brought about merely by extensive data augmentation.
no code implementations • 15 Oct 2020 • Ye Wang, Kevin Too Yok, Wenyan Wu, Angus R. Simpson, Erik Weyer, Chris Manzie
In this research, a novel economic model predictive control (EMPC) framework for real-time management of WDSs is proposed.
no code implementations • 28 Sep 2020 • Mo Han, Ozan Ozdenizci, Toshiaki Koike-Akino, Ye Wang, Deniz Erdogmus
Human computer interaction (HCI) involves a multidisciplinary fusion of technologies, through which the control of external devices could be achieved by monitoring physiological status of users.
no code implementations • 26 Aug 2020 • Mo Han, Ozan Ozdenizci, Ye Wang, Toshiaki Koike-Akino, Deniz Erdogmus
Recent developments in biosignal processing have enabled users to exploit their physiological status for manipulating devices in a reliable and safe manner.
no code implementations • 17 Aug 2020 • Qiang Liu, Tao Han, Ning Zhang, Ye Wang
Network slicing enables multiple virtual networks run on the same physical infrastructure to support various use cases in 5G and beyond.
no code implementations • 22 Jul 2020 • Ye Wang, Shuchin Aeron, Adnan Siraj Rakin, Toshiaki Koike-Akino, Pierre Moulin
Robust machine learning formulations have emerged to address the prevalent vulnerability of deep neural networks to adversarial examples.
no code implementations • 17 Jul 2020 • Wenjie Chen, Fengtong Du, Ye Wang, Lihong Cao
Furthermore, we define a new continual learning paradigm to simulate the possible continual learning process in the human brain.
no code implementations • ICML 2020 • Hengguan Huang, Fuzhao Xue, Hao Wang, Ye Wang
Lying at the core of human intelligence, relational thinking is characterized by initially relying on innumerable unconscious percepts pertaining to relations between new sensory signals and prior knowledge, consequently becoming a recognizable concept or object through coupling and transformation of these percepts.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
no code implementations • 2 Jul 2020 • Andac Demir, Toshiaki Koike-Akino, Ye Wang, Deniz Erdogmus
Learning data representations that capture task-related features, but are invariant to nuisance variations remains a key challenge in machine learning.
no code implementations • 6 May 2020 • Toshiaki Koike-Akino, Ye Wang
This is motivated by the rateless property of conventional PCA, where the least important principal components can be discarded to realize variable rate dimensionality reduction that gracefully degrades the distortion.
no code implementations • 15 Apr 2020 • Mo Han, Ozan Ozdenizci, Ye Wang, Toshiaki Koike-Akino, Deniz Erdogmus
Recent developments in wearable sensors demonstrate promising results for monitoring physiological status in effective and comfortable ways.
1 code implementation • CVPR 2020 • Abhinav Kumar, Tim K. Marks, Wenxuan Mou, Ye Wang, Michael Jones, Anoop Cherian, Toshiaki Koike-Akino, Xiaoming Liu, Chen Feng
In this paper, we present a novel framework for jointly predicting landmark locations, associated uncertainties of these predicted locations, and landmark visibilities.
Ranked #1 on
Face Alignment
on Menpo
no code implementations • 15 Feb 2020 • Sairamvinay Vijayaraghavan, Ye Wang, Zhiyuan Guo, John Voong, Wenda Xu, Armand Nasseri, Jiaru Cai, Linda Li, Kevin Vuong, Eshan Wadhwa
This is a paper for exploring various different models aiming at developing fake news detection models and we had used certain machine learning algorithms and we had used pretrained algorithms such as TFIDF and CV and W2V as features for processing textual data.
no code implementations • 22 Nov 2019 • Toshiaki Koike-Akino, Ye Wang, David S. Millar, Keisuke Kojima, Kieran Parsons
Recently, data-driven approaches motivated by modern deep learning have been applied to optical communications in place of traditional model-based counterparts.
no code implementations • 27 Mar 2019 • Ozan Ozdenizci, Ye Wang, Toshiaki Koike-Akino, Deniz Erdogmus
Deep learning methods for person identification based on electroencephalographic (EEG) brain activity encounters the problem of exploiting the temporally correlated structures or recording session specific variability within EEG.
no code implementations • 9 Mar 2019 • Ye Wang, Toshiaki Koike-Akino
The deep learning trend has recently impacted a variety of fields, including communication systems, where various approaches have explored the application of neural networks in place of traditional designs.
no code implementations • 9 Mar 2019 • Toshiki Matsumine, Toshiaki Koike-Akino, Ye Wang
This paper studies a new application of deep learning (DL) for optimizing constellations in two-way relaying with physical-layer network coding (PNC), where deep neural network (DNN)-based modulation and demodulation are employed at each terminal and relay node.
no code implementations • 19 Dec 2018 • Ye Wang, Jongmoo Choi, Yueru Chen, Siyang Li, Qin Huang, Kaitai Zhang, Ming-Sui Lee, C. -C. Jay Kuo
Unsupervised video object segmentation is a crucial application in video analysis without knowing any prior information about the objects.
no code implementations • 19 Dec 2018 • Ye Wang, Yueru Chen, Jongmoo Choi, C. -C. Jay Kuo
One is a model-based drone augmentation technique that automatically generates visible drone images with a bounding box label on the drone's location.
no code implementations • 17 Dec 2018 • Ozan Ozdenizci, Ye Wang, Toshiaki Koike-Akino, Deniz Erdogmus
We introduce adversarial neural networks for representation learning as a novel approach to transfer learning in brain-computer interfaces (BCIs).
no code implementations • 13 Dec 2018 • Ye Wang, Jongmoo Choi, Yueru Chen, Qin Huang, Siyang Li, Ming-Sui Lee, C. -C. Jay Kuo
Experimental results on DAVIS and FBMS show that the proposed method outperforms state-of-the-art unsupervised segmentation methods on various benchmark datasets.
no code implementations • 11 Jun 2018 • Ye Wang, Mi Lu
Currently, various types of CAPTCHAs need corresponding segmentation to identify single character due to the numerous different segmentation ways.
no code implementations • 21 May 2018 • Ye Wang, Toshiaki Koike-Akino, Deniz Erdogmus
In this method, an adversarial network attempts to recover the nuisance variable from the representation, which the VAE is trained to prevent.
no code implementations • 11 Apr 2018 • Han Wang, Ye Wang, Xinxiang Zhang, Mi Lu, Yoonsuck Choe, Jingjing Cao
Unlike previous unknown nouns tagging task, this is the first attempt to focus on out-of-vocabulary (OOV) lexical evaluation tasks that do not require any prior knowledge.
no code implementations • 19 Dec 2017 • Ardhendu Tripathy, Ye Wang, Prakash Ishwar
We propose a data-driven framework for optimizing privacy-preserving data release mechanisms to attain the information-theoretically optimal tradeoff between minimizing distortion of useful data and concealing specific sensitive information.
no code implementations • 20 Jul 2017 • Qin Huang, Chunyang Xia, Chi-Hao Wu, Siyang Li, Ye Wang, Yuhang Song, C. -C. Jay Kuo
Recent development in fully convolutional neural network enables efficient end-to-end learning of semantic segmentation.
Ranked #107 on
Semantic Segmentation
on NYU Depth v2
no code implementations • NeurIPS 2015 • Ye Wang, David B. Dunson
Learning of low dimensional structure in multidimensional data is a canonical problem in machine learning.
no code implementations • 6 Nov 2013 • Xinxi Wang, Yi Wang, David Hsu, Ye Wang
Current music recommender systems typically act in a greedy fashion by recommending songs with the highest user ratings.