Search Results for author: Yue Jiang

Found 21 papers, 5 papers with code

Using Mixed Incentives to Document Xi’an Guanzhong

no code implementations NIDCP (LREC) 2022 Juhong Zhan, Yue Jiang, Christopher Cieri, Mark Liberman, Jiahong Yuan, Yiya Chen, Odette Scharenborg

This paper describes our use of mixed incentives and the citizen science portal LanguageARC to prepare, collect and quality control a large corpus of object namings for the purpose of providing speech data to document the under-represented Guanzhong dialect of Chinese spoken in the Shaanxi province in the environs of Xi’an.

Graph4GUI: Graph Neural Networks for Representing Graphical User Interfaces

no code implementations21 Apr 2024 Yue Jiang, Changkong Zhou, Vikas Garg, Antti Oulasvirta

Present-day graphical user interfaces (GUIs) exhibit diverse arrangements of text, graphics, and interactive elements such as buttons and menus, but representations of GUIs have not kept up.

EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning

no code implementations15 Apr 2024 Yue Jiang, Zixin Guo, Hamed Rezazadegan Tavakoli, Luis A. Leiva, Antti Oulasvirta

From a visual perception perspective, modern graphical user interfaces (GUIs) comprise a complex graphics-rich two-dimensional visuospatial arrangement of text, images, and interactive objects such as buttons and menus.

reinforcement-learning

Can LLMs' Tuning Methods Work in Medical Multimodal Domain?

no code implementations11 Mar 2024 Jiawei Chen, Yue Jiang, Dingkang Yang, Mingcheng Li, Jinjie Wei, Ziyun Qian, Lihua Zhang

We show the different impacts of fine-tuning methods for large models on medical VLMs and develop the most efficient ways to fine-tune medical VLP models.

Transfer Learning World Knowledge

MISS: A Generative Pretraining and Finetuning Approach for Med-VQA

no code implementations10 Jan 2024 Jiawei Chen, Dingkang Yang, Yue Jiang, Yuxuan Lei, Lihua Zhang

However, most methods in the medical field treat VQA as an answer classification task which is difficult to transfer to practical application scenarios.

Medical Visual Question Answering Multi-Task Learning +3

RS-Corrector: Correcting the Racial Stereotypes in Latent Diffusion Models

no code implementations8 Dec 2023 Yue Jiang, Yueming Lyu, Tianxiang Ma, Bo Peng, Jing Dong

Extensive empirical evaluations demonstrate that the introduced \themodel effectively corrects the racial stereotypes of the well-trained Stable Diffusion model while leaving the original model unchanged.

Image Generation

Large Language Model based Long-tail Query Rewriting in Taobao Search

no code implementations7 Nov 2023 Wenjun Peng, Guiyang Li, Yue Jiang, Zilong Wang, Dan Ou, Xiaoyi Zeng, Derong Xu, Tong Xu, Enhong Chen

In the realm of e-commerce search, the significance of semantic matching cannot be overstated, as it directly impacts both user experience and company revenue.

Contrastive Learning Language Modelling +2

DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing

1 code implementation12 Oct 2023 Yueming Lyu, Kang Zhao, Bo Peng, Yue Jiang, Yingya Zhang, Jing Dong

Based on DeltaSpace, we propose a novel framework called DeltaEdit, which maps the CLIP visual feature differences to the latent space directions of a generative model during the training phase, and predicts the latent space directions from the CLIP textual feature differences during the inference phase.

text-guided-image-editing

ILuvUI: Instruction-tuned LangUage-Vision modeling of UIs from Machine Conversations

no code implementations7 Oct 2023 Yue Jiang, Eldon Schoop, Amanda Swearngin, Jeffrey Nichols

Multimodal Vision-Language Models (VLMs) enable powerful applications from their fused understanding of images and language, but many perform poorly on UI tasks due to the lack of UI training data.

Language Modelling Large Language Model

AXNav: Replaying Accessibility Tests from Natural Language

no code implementations3 Oct 2023 Maryam Taeb, Amanda Swearngin, Eldon Schoop, Ruijia Cheng, Yue Jiang, Jeffrey Nichols

Recently, Large Language Models (LLMs) have been used for a variety of tasks including automation of UIs, however to our knowledge no one has yet explored their use in controlling assistive technologies for the purposes of supporting accessibility testing.

Few-Shot Domain Adaptation for Charge Prediction on Unprofessional Descriptions

no code implementations29 Sep 2023 Jie Zhao, Ziyu Guan, Wei Zhao, Yue Jiang, Xiaofei He

Recent works considering professional legal-linguistic style (PLLS) texts have shown promising results on the charge prediction task.

Domain Adaptation

InfoStyler: Disentanglement Information Bottleneck for Artistic Style Transfer

no code implementations30 Jul 2023 Yueming Lyu, Yue Jiang, Bo Peng, Jing Dong

InfoStyler formulates the disentanglement representation learning as an information compression problem by eliminating style statistics from the content image and removing the content structure from the style image.

Disentanglement Style Transfer

3D-Aware Adversarial Makeup Generation for Facial Privacy Protection

no code implementations26 Jun 2023 Yueming Lyu, Yue Jiang, Ziwen He, Bo Peng, Yunfan Liu, Jing Dong

The privacy and security of face data on social media are facing unprecedented challenges as it is vulnerable to unauthorized access and identification.

Face Recognition Face Verification

HiFECap: Monocular High-Fidelity and Expressive Capture of Human Performances

no code implementations11 Oct 2022 Yue Jiang, Marc Habermann, Vladislav Golyanik, Christian Theobalt

Furthermore, we show that HiFECap outperforms the state-of-the-art human performance capture approaches qualitatively and quantitatively while for the first time capturing all aspects of the human.

Vocal Bursts Intensity Prediction

ReverseORC: Reverse Engineering of Resizable User Interface Layouts with OR-Constraints

no code implementations23 Feb 2022 Yue Jiang, Wolfgang Stuerzlinger, Christof Lutteroth

Furthermore, it can be used to detect and fix problems in legacy UIs, extend UIs with enhanced layout behaviours, and support the creation of flexible UI layouts.

BacHMMachine: An Interpretable and Scalable Model for Algorithmic Harmonization for Four-part Baroque Chorales

no code implementations15 Sep 2021 Yunyao Zhu, Stephen Hahn, Simon Mak, Yue Jiang, Cynthia Rudin

Algorithmic harmonization - the automated harmonization of a musical piece given its melodic line - is a challenging problem that has garnered much interest from both music theorists and computer scientists.

ORCSolver: An Efficient Solver for Adaptive GUI Layout with OR-Constraints

1 code implementation23 Feb 2020 Yue Jiang, Wolfgang Stuerzlinger, Matthias Zwicker, Christof Lutteroth

OR-constrained (ORC) graphical user interface layouts unify conventional constraint-based layouts with flow layouts, which enables the definition of flexible layouts that adapt to screens with different sizes, orientations, or aspect ratios with only a single layout specification.

ORC Layout: Adaptive GUI Layout with OR-Constraints

no code implementations17 Dec 2019 Yue Jiang, Ruofei Du, Christof Lutteroth, Wolfgang Stuerzlinger

We propose a novel approach for constraint-based graphical user interface (GUI) layout based on OR-constraints (ORC) in standard soft/hard linear constraint systems.

Human-Computer Interaction Graphics

SDFDiff: Differentiable Rendering of Signed Distance Fields for 3D Shape Optimization

1 code implementation CVPR 2020 Yue Jiang, Dantong Ji, Zhizhong Han, Matthias Zwicker

We propose SDFDiff, a novel approach for image-based shape optimization using differentiable rendering of 3D shapes represented by signed distance functions (SDFs).

 Ranked #1 on Single-View 3D Reconstruction on ShapeNet (using extra training data)

3D Reconstruction Multi-View 3D Reconstruction +1

Cannot find the paper you are looking for? You can Submit a new open access paper.