Search Results for author: Bo Cheng

Found 37 papers, 15 papers with code

CEFW: A Comprehensive Evaluation Framework for Watermark in Large Language Models

1 code implementation24 Mar 2025 Shuhao Zhang, Bo Cheng, Jiale Han, Yuli Chen, Zhixuan Wu, Changbao Li, Pingli Gu

To fill this gap, we propose the Comprehensive Evaluation Framework for Watermark (CEFW), a unified framework that comprehensively evaluates watermarking methods across five key dimensions: ease of detection, fidelity of text quality, minimal embedding cost, robustness to adversarial attacks, and imperceptibility to prevent imitation or forgery.

PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models

no code implementations13 Mar 2025 Runze He, Bo Cheng, Yuhang Ma, Qingxiang Jia, Shanyuan Liu, Ao Ma, Xiaoyu Wu, Liebucha Wu, Dawei Leng, Yuhui Yin

In this paper, we propose a unified layout planning and image generation model, PlanGen, which can pre-plan spatial layout conditions before generating images.

Image Manipulation Layout-to-Image Generation

NAMI: Efficient Image Generation via Progressive Rectified Flow Transformers

no code implementations12 Mar 2025 Yuhang Ma, Bo Cheng, Shanyuan Liu, Ao Ma, Xiaoyu Wu, Liebucha Wu, Dawei Leng, Yuhui Yin

To enhance inference performance while maintaining generation quality, we propose progressive rectified flow transformers.

Image Generation

WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation

no code implementations11 Mar 2025 Jing Wang, Ao Ma, Ke Cao, Jun Zheng, Zhanjie Zhang, Jiasong Feng, Shanyuan Liu, Yuhang Ma, Bo Cheng, Dawei Leng, Yuhui Yin, Xiaodan Liang

Experimental results demonstrate that WISA can effectively enhance the compatibility of T2V models with real-world physical laws, achieving a considerable improvement on the VideoPhy benchmark.

Text-to-Video Generation Video Generation

Large-Scale AI in Telecom: Charting the Roadmap for Innovation, Scalability, and Enhanced Digital Experiences

no code implementations6 Mar 2025 Adnan Shahid, Adrian Kliks, Ahmed Al-Tahmeesschi, Ahmed Elbakary, Alexandros Nikou, Ali Maatouk, Ali Mokh, Amirreza Kazemi, Antonio De Domenico, Athanasios Karapantelakis, Bo Cheng, Bo Yang, Bohao Wang, Carlo Fischione, Chao Zhang, Chaouki Ben Issaid, Chau Yuen, Chenghui Peng, Chongwen Huang, Christina Chaccour, Christo Kurisummoottil Thomas, Dheeraj Sharma, Dimitris Kalogiros, Dusit Niyato, Eli de Poorter, Elissa Mhanna, Emilio Calvanese Strinati, Faouzi Bader, Fathi Abdeldayem, Fei Wang, Fenghao Zhu, Gianluca Fontanesi, Giovanni Geraci, Haibo Zhou, Hakimeh Purmehdi, Hamed Ahmadi, Hang Zou, Hongyang Du, Hoon Lee, Howard H. Yang, Iacopo Poli, Igor Carron, Ilias Chatzistefanidis, Inkyu Lee, Ioannis Pitsiorlas, Jaron Fontaine, Jiajun Wu, Jie Zeng, Jinan Li, Jinane Karam, Johny Gemayel, Juan Deng, Julien Frison, Kaibin Huang, Kehai Qiu, Keith Ball, Kezhi Wang, Kun Guo, Leandros Tassiulas, Lecorve Gwenole, Liexiang Yue, Lina Bariah, Louis Powell, Marcin Dryjanski, Maria Amparo Canaveras Galdon, Marios Kountouris, Maryam Hafeez, Maxime Elkael, Mehdi Bennis, Mehdi Boudjelli, Meiling Dai, Merouane Debbah, Michele Polese, Mohamad Assaad, Mohamed Benzaghta, Mohammad Al Refai, Moussab Djerrab, Mubeen Syed, Muhammad Amir, Na Yan, Najla Alkaabi, Nan Li, Nassim Sehad, Navid Nikaein, Omar Hashash, Pawel Sroka, Qianqian Yang, Qiyang Zhao, Rasoul Nikbakht Silab, Rex Ying, Roberto Morabito, Rongpeng Li, Ryad Madi, Salah Eddine El Ayoubi, Salvatore D'Oro, Samson Lasaulce, Serveh Shalmashi, Sige Liu, Sihem Cherrared, Swarna Bindu Chetty, Swastika Dutta, Syed A. R. Zaidi, Tianjiao Chen, Timothy Murphy, Tommaso Melodia, Tony Q. S. Quek, Vishnu Ram, Walid Saad, Wassim Hamidouche, Weilong Chen, Xiaoou Liu, Xiaoxue Yu, Xijun Wang, Xingyu Shang, Xinquan Wang, Xuelin Cao, Yang Su, Yanping Liang, Yansha Deng, Yifan Yang, Yingping Cui, Yu Sun, Yuxuan Chen, Yvan Pointurier, Zeinab Nehme, Zeinab Nezami, Zhaohui Yang, Zhaoyang Zhang, Zhe Liu, Zhenyu Yang, Zhu Han, Zhuang Zhou, Zihan Chen, Zirui Chen, Zitao Shuai

This white paper discusses the role of large-scale AI in the telecommunications industry, with a specific focus on the potential of generative AI to revolutionize network functions and user experiences, especially in the context of 6G systems.

Management

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers

no code implementations20 Feb 2025 Ke Cao, Jing Wang, Ao Ma, Jiasong Feng, Zhanjie Zhang, Xuanhua He, Shanyuan Liu, Bo Cheng, Dawei Leng, Yuhui Yin, Jie Zhang

The Diffusion Transformer plays a pivotal role in advancing text-to-image and text-to-video generation, owing primarily to its inherent scalability.

Text-to-Video Generation Video Generation

From Ceilings to Walls: Universal Dynamic Perching of Small Aerial Robots on Surfaces with Variable Orientations

no code implementations27 Dec 2024 Bryan Habas, Aaron Brown, Donghyeon Lee, Mitchell Goldman, Bo Cheng

Additionally, we investigated the effects of joint stiffness and damping in the landing gear on perching behaviors and performance.

Deep Reinforcement Learning

HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation

1 code implementation18 Oct 2024 Bo Cheng, Yuhang Ma, Liebucha Wu, Shanyuan Liu, Ao Ma, Xiaoyu Wu, Dawei Leng, Yuhui Yin

The task of layout-to-image generation involves synthesizing images based on the captions of objects and their spatial positions.

Disentanglement Layout Generation +1

FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance

1 code implementation15 Aug 2024 Jiasong Feng, Ao Ma, Jing Wang, Bo Cheng, Xiaodan Liang, Dawei Leng, Yuhui Yin

Then, TAR refines the correlation matrix between cross-frame textual conditions and latent features along the time dimension.

TAR Video Generation

Leveraging Parameter-Efficient Transfer Learning for Multi-Lingual Text-to-Speech Adaptation

no code implementations25 Jun 2024 Yingting Li, Ambuj Mehrish, Bryan Chew, Bo Cheng, Soujanya Poria

Different languages have distinct phonetic systems and vary in their prosodic features making it challenging to develop a Text-to-Speech (TTS) model that can effectively synthesise speech in multilingual settings.

Speech Synthesis text-to-speech +2

HyperTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks

1 code implementation6 Apr 2024 Yingting Li, Rishabh Bhardwaj, Ambuj Mehrish, Bo Cheng, Soujanya Poria

In this work, we present HyperTTS, which comprises a small learnable network, "hypernetwork", that generates parameters of the Adapter blocks, allowing us to condition Adapters on speaker representations and making them dynamic.

Domain Adaptation Speech Synthesis +2

Making Pre-trained Language Models Better Continual Few-Shot Relation Extractors

1 code implementation24 Feb 2024 Shengkun Ma, Jiale Han, Yi Liang, Bo Cheng

Continual Few-shot Relation Extraction (CFRE) is a practical problem that requires the model to continuously learn novel relations while avoiding forgetting old ones with few labeled training data.

Contrastive Learning Prompt Learning +2

Focus on Local Regions for Query-based Object Detection

no code implementations10 Oct 2023 Hongbin Xu, Yamei Xia, Shuai Zhao, Bo Cheng

We improve the self-attention by isolating connections between irrelevant objects that makes it focus on local regions but not global regions.

Computational Efficiency Object +2

Distributional Soft Actor-Critic with Three Refinements

2 code implementations9 Oct 2023 Jingliang Duan, Wenxuan Wang, Liming Xiao, Jiaxin Gao, Shengbo Eben Li, Chang Liu, Ya-Qin Zhang, Bo Cheng, Keqiang Li

To address this issue, we previously proposed the Distributional Soft Actor-Critic (DSAC or DSACv1), an off-policy RL algorithm that enhances value estimation accuracy by learning a continuous Gaussian value distribution.

Decision Making Reinforcement Learning (RL)

Generative Prompt Tuning for Relation Classification

1 code implementation22 Oct 2022 Jiale Han, Shuai Zhao, Bo Cheng, Shengkun Ma, Wei Lu

Current prompt tuning methods mostly convert the downstream tasks to masked language modeling problems by adding cloze-style phrases and mapping all labels to verbalizations with fixed length, which has proven effective for tasks with simple label spaces.

Classification Language Modeling +5

Inverted Landing in a Small Aerial Robot via Deep Reinforcement Learning for Triggering and Control of Rotational Maneuvers

no code implementations22 Sep 2022 Bryan Habas, Jack W. Langelaan, Bo Cheng

Inverted landing in a rapid and robust manner is a challenging feat for aerial robots, especially while depending entirely on onboard sensing and computation.

Deep Reinforcement Learning

Analyzing Modality Robustness in Multimodal Sentiment Analysis

1 code implementation NAACL 2022 Devamanyu Hazarika, Yingting Li, Bo Cheng, Shuai Zhao, Roger Zimmermann, Soujanya Poria

In this work, we hope to address that by (i) Proposing simple diagnostic checks for modality robustness in a trained multimodal model.

Diagnostic Multimodal Sentiment Analysis

Exploring Entity Interactions for Few-Shot Relation Learning (Student Abstract)

no code implementations4 May 2022 Yi Liang, Shuai Zhao, Bo Cheng, Yuwei Yin, Hao Yang

Few-shot relation learning refers to infer facts for relations with a limited number of observed triples.

Metric Learning Relation

FCM: A Fine-grained Comparison Model for Multi-turn Dialogue Reasoning

no code implementations Findings (EMNLP) 2021 Xu Wang, Hainan Zhang, Shuai Zhao, Yanyan Zou, Hongshen Chen, Zhuoye Ding, Bo Cheng, Yanyan Lan

Furthermore, the consistency signals between each candidate and the speaker's own history are considered to drive a model to prefer a candidate that is logically consistent with the speaker's history logic.

Reading Comprehension

Exploring Task Difficulty for Few-Shot Relation Extraction

1 code implementation EMNLP 2021 Jiale Han, Bo Cheng, Wei Lu

Few-shot relation extraction (FSRE) focuses on recognizing novel relations by learning with merely a handful of annotated instances.

Contrastive Learning Meta-Learning +2

DPN-SENet:A self-attention mechanism neural network for detection and diagnosis of COVID-19 from chest x-ray images

1 code implementation20 May 2021 Bo Cheng, Ruhui Xue, Hang Yang, Laili Zhu, Wei Xiang

We propose a deep learning model that can help radiologists and clinicians use chest X-rays to diagnose COVID-19 cases and show the diagnostic features of pneumonia.

Data Augmentation Diagnostic

Integrating Subgraph-aware Relation and DirectionReasoning for Question Answering

no code implementations1 Apr 2021 Xu Wang, Shuai Zhao, Bo Cheng, Jiale Han, Yingting Li, Hao Yang, Ivan Sekulic, Guoshun Nan

Question Answering (QA) models over Knowledge Bases (KBs) are capable of providing more precise answers by utilizing relation information among entities.

Question Answering Relation

Integrated Decision and Control: Towards Interpretable and Computationally Efficient Driving Intelligence

2 code implementations18 Mar 2021 Yang Guan, Yangang Ren, Qi Sun, Shengbo Eben Li, Haitong Ma, Jingliang Duan, Yifan Dai, Bo Cheng

In this paper, we present an interpretable and computationally efficient framework called integrated decision and control (IDC) for automated vehicles, which decomposes the driving task into static path planning and dynamic optimal tracking that are structured hierarchically.

Autonomous Driving Model-based Reinforcement Learning +2

Recurrent Model Predictive Control

no code implementations23 Feb 2021 Zhengyu Liu, Jingliang Duan, Wenxuan Wang, Shengbo Eben Li, Yuming Yin, Ziyu Lin, Qi Sun, Bo Cheng

This paper proposes an off-line algorithm, called Recurrent Model Predictive Control (RMPC), to solve general nonlinear finite-horizon optimal control problems.

model Model Predictive Control

Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model

2 code implementations23 Feb 2021 Yang Guan, Jingliang Duan, Shengbo Eben Li, Jie Li, Jianyu Chen, Bo Cheng

Formally, MPG is constructed as a weighted average of the data-driven and model-driven PGs, where the former is the derivative of the learned Q-value function, and the latter is that of the model-predictive return.

Decision Making Reinforcement Learning (RL) +1

Recurrent Model Predictive Control: Learning an Explicit Recurrent Controller for Nonlinear Systems

no code implementations20 Feb 2021 Zhengyu Liu, Jingliang Duan, Wenxuan Wang, Shengbo Eben Li, Yuming Yin, Ziyu Lin, Bo Cheng

This paper proposes an offline control algorithm, called Recurrent Model Predictive Control (RMPC), to solve large-scale nonlinear finite-horizon optimal control problems.

Model Predictive Control

Mixed Reinforcement Learning with Additive Stochastic Uncertainty

no code implementations28 Feb 2020 Yao Mu, Shengbo Eben Li, Chang Liu, Qi Sun, Bingbing Nie, Bo Cheng, Baiyu Peng

This paper presents a mixed reinforcement learning (mixed RL) algorithm by simultaneously using dual representations of environmental dynamics to search the optimal policy with the purpose of improving both learning accuracy and training speed.

reinforcement-learning Reinforcement Learning +1

Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors

3 code implementations9 Jan 2020 Jingliang Duan, Yang Guan, Shengbo Eben Li, Yangang Ren, Bo Cheng

In reinforcement learning (RL), function approximation errors are known to easily lead to the Q-value overestimations, thus greatly reducing policy performance.

continuous-control Continuous Control +4

Direct and indirect reinforcement learning

no code implementations23 Dec 2019 Yang Guan, Shengbo Eben Li, Jingliang Duan, Jie Li, Yangang Ren, Qi Sun, Bo Cheng

Reinforcement learning (RL) algorithms have been successfully applied to a range of challenging sequential decision making and control tasks.

Decision Making reinforcement-learning +3

Adaptive dynamic programming for nonaffine nonlinear optimal control problem with state constraints

no code implementations26 Nov 2019 Jingliang Duan, Zhengyu Liu, Shengbo Eben Li, Qi Sun, Zhenzhong Jia, Bo Cheng

CADP linearizes the constrained optimization problem locally into a quadratically constrained linear programming problem, and then obtains the optimal update of the policy network by solving its dual problem.

Intention-aware Long Horizon Trajectory Prediction of Surrounding Vehicles using Dual LSTM Networks

no code implementations6 Jun 2019 Long Xin, Pin Wang, Ching-Yao Chan, Jianyu Chen, Shengbo Eben Li, Bo Cheng

As autonomous vehicles (AVs) need to interact with other road users, it is of importance to comprehensively understand the dynamic traffic environment, especially the future possible trajectories of surrounding vehicles.

Autonomous Vehicles feature selection +3

An End-to-End Multi-task Learning Model for Fact Checking

no code implementations WS 2018 Sizhen Li, Shuai Zhao, Bo Cheng, Hao Yang

With huge amount of information generated every day on the web, fact checking is an important and challenging task which can help people identify the authenticity of most claims as well as providing evidences selected from knowledge source like Wikipedia.

Common Sense Reasoning Entity Linking +4

Cannot find the paper you are looking for? You can Submit a new open access paper.