Search Results for author: Yue Jiang

Found 39 papers, 12 papers with code

Using Mixed Incentives to Document Xi’an Guanzhong

no code implementations NIDCP (LREC) 2022 Juhong Zhan, Yue Jiang, Christopher Cieri, Mark Liberman, Jiahong Yuan, Yiya Chen, Odette Scharenborg

This paper describes our use of mixed incentives and the citizen science portal LanguageARC to prepare, collect and quality control a large corpus of object namings for the purpose of providing speech data to document the under-represented Guanzhong dialect of Chinese spoken in the Shaanxi province in the environs of Xi’an.

Controllable GUI Exploration

no code implementations5 Feb 2025 Aryan Garg, Yue Jiang, Antti Oulasvirta

During the early stages of interface design, designers need to produce multiple sketches to explore a design space.

BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation

1 code implementation15 Jan 2025 Xiaolu Hou, Mingcheng Li, Dingkang Yang, Jiawei Chen, Ziyun Qian, Xiao Zhao, Yue Jiang, Jinjie Wei, Qingyao Xu, Lihua Zhang

To this end, we propose BloomScene, a lightweight structured 3D Gaussian splatting for crossmodal scene generation, which creates diverse and high-quality 3D scenes from text or image inputs.

Point cloud reconstruction Scene Generation

Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning

no code implementations5 Nov 2024 Mingcheng Li, Dingkang Yang, Yang Liu, Shunli Wang, Jiawei Chen, Shuaibing Wang, Jinjie Wei, Yue Jiang, Qingyao Xu, Xiaolu Hou, Mingyang Sun, Ziyun Qian, Dongliang Kou, Lihua Zhang

Specifically, we propose a fine-grained representation factorization module that sufficiently extracts valuable sentiment information by factorizing modality into sentiment-relevant and modality-specific representations through crossmodal translation and sentiment semantic reconstruction.

Multimodal Sentiment Analysis Representation Learning

nextlocllm: next location prediction using LLMs

no code implementations11 Oct 2024 Shuai Liu, Ning Cao, Yile Chen, Yue Jiang, Gao Cong

Experiments show that NextLocLLM outperforms existing models in next location prediction, excelling in both supervised and zero-shot settings.

Prediction

DreamStruct: Understanding Slides and User Interfaces via Synthetic Data Generation

1 code implementation30 Sep 2024 Yi-Hao Peng, Faria Huq, Yue Jiang, Jason Wu, Amanda Xin Yue Li, Jeffrey Bigham, Amy Pavel

Enabling machines to understand structured visuals like slides and user interfaces is essential for making them accessible to people with disabilities.

Code Generation Synthetic Data Generation

Dark Miner: Defend against undesired generation for text-to-image diffusion models

no code implementations26 Sep 2024 Zheling Meng, Bo Peng, Xiaochuan Jin, Yue Jiang, Jing Dong, Wei Wang

Most existing methods focus on modifying the generation probabilities conditioned on the texts containing target concepts.

Self-supervised Learning for Geospatial AI: A Survey

no code implementations22 Aug 2024 Yile Chen, Weiming Huang, Kaiqi Zhao, Yue Jiang, Gao Cong

The proliferation of geospatial data in urban and territorial environments has significantly facilitated the development of geospatial artificial intelligence (GeoAI) across various urban applications.

Self-Supervised Learning Survey

A New Dataset, Notation Software, and Representation for Computational Schenkerian Analysis

no code implementations13 Aug 2024 Stephen Ni-Hahn, Weihan Xu, Jerry Yin, Rico Zhu, Simon Mak, Yue Jiang, Cynthia Rudin

Schenkerian Analysis (SchA) is a uniquely expressive method of music analysis, combining elements of melody, harmony, counterpoint, and form to describe the hierarchical structure supporting a work of music.

SAGDFN: A Scalable Adaptive Graph Diffusion Forecasting Network for Multivariate Time Series Forecasting

1 code implementation18 Jun 2024 Yue Jiang, Xiucheng Li, Yile Chen, Shuai Liu, Weilong Kong, Antonis F. Lentzakis, Gao Cong

Time series forecasting is essential for our daily activities and precise modeling of the complex correlations and shared patterns among multiple time series is essential for improving forecasting performance.

Multivariate Time Series Forecasting Time Series

CoMT: Chain-of-Medical-Thought Reduces Hallucination in Medical Report Generation

no code implementations17 Jun 2024 Yue Jiang, Jiawei Chen, Dingkang Yang, Mingcheng Li, Shunli Wang, Tong Wu, Ke Li, Lihua Zhang

Automatic medical report generation (MRG), which possesses significant research value as it can aid radiologists in clinical diagnosis and report composition, has garnered increasing attention.

Diagnostic Hallucination +1

Enhancing Criminal Case Matching through Diverse Legal Factors

no code implementations17 Jun 2024 Jie Zhao, Ziyu Guan, Wei Zhao, Yue Jiang

In this paper, we propose a two-stage framework named Diverse Legal Factor-enhanced Criminal Case Matching (DLF-CCM).

Multi-Task Learning

Detecting and Evaluating Medical Hallucinations in Large Vision Language Models

no code implementations14 Jun 2024 Jiawei Chen, Dingkang Yang, Tong Wu, Yue Jiang, Xiaolu Hou, Mingcheng Li, Shunli Wang, Dongling Xiao, Ke Li, Lihua Zhang

To bridge this gap, we introduce Med-HallMark, the first benchmark specifically designed for hallucination detection and evaluation within the medical multimodal domain.

Hallucination Medical Visual Question Answering +2

SC2: Towards Enhancing Content Preservation and Style Consistency in Long Text Style Transfer

no code implementations7 Jun 2024 Jie Zhao, Ziyu Guan, Cai Xu, Wei Zhao, Yue Jiang

We design a style consistency loss to ensure the generated multiple sentences consistently reflect the target style polarity.

Decoder Denoising +2

PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications

1 code implementation29 May 2024 Dingkang Yang, Jinjie Wei, Dongling Xiao, Shunli Wang, Tong Wu, Gang Li, Mingcheng Li, Shuaibing Wang, Jiawei Chen, Yue Jiang, Qingyao Xu, Ke Li, Peng Zhai, Lihua Zhang

In the parameter-efficient secondary SFT phase, a mixture of universal-specific experts strategy is presented to resolve the competency conflict between medical generalist and pediatric expertise mastery.

Diagnostic Domain Adaptation

Impact of Design Decisions in Scanpath Modeling

no code implementations14 May 2024 Parvin Emami, Yue Jiang, Zixin Guo, Luis A. Leiva

We show that even small variations of these design parameters have a noticeable impact on standard evaluation metrics such as DTW or Eyenalysis.

Efficiency in Focus: LayerNorm as a Catalyst for Fine-tuning Medical Visual Language Pre-trained Models

no code implementations25 Apr 2024 Jiawei Chen, Dingkang Yang, Yue Jiang, Mingcheng Li, Jinjie Wei, Xiaolu Hou, Lihua Zhang

In the realm of Medical Visual Language Models (Med-VLMs), the quest for universal efficient fine-tuning mechanisms remains paramount, especially given researchers in interdisciplinary fields are often extremely short of training resources, yet largely unexplored.

Medical Visual Question Answering parameter-efficient fine-tuning +2

Graph4GUI: Graph Neural Networks for Representing Graphical User Interfaces

no code implementations21 Apr 2024 Yue Jiang, Changkong Zhou, Vikas Garg, Antti Oulasvirta

Present-day graphical user interfaces (GUIs) exhibit diverse arrangements of text, graphics, and interactive elements such as buttons and menus, but representations of GUIs have not kept up.

EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning

no code implementations15 Apr 2024 Yue Jiang, Zixin Guo, Hamed Rezazadegan Tavakoli, Luis A. Leiva, Antti Oulasvirta

From a visual perception perspective, modern graphical user interfaces (GUIs) comprise a complex graphics-rich two-dimensional visuospatial arrangement of text, images, and interactive objects such as buttons and menus.

Deep Reinforcement Learning reinforcement-learning

Can LLMs' Tuning Methods Work in Medical Multimodal Domain?

2 code implementations11 Mar 2024 Jiawei Chen, Yue Jiang, Dingkang Yang, Mingcheng Li, Jinjie Wei, Ziyun Qian, Lihua Zhang

In this paper, we delve into the fine-tuning methods of LLMs and conduct extensive experiments to investigate the impact of fine-tuning methods for large models on the existing multimodal model in the medical domain from the training data level and the model structure level.

Transfer Learning World Knowledge

MISS: A Generative Pretraining and Finetuning Approach for Med-VQA

1 code implementation10 Jan 2024 Jiawei Chen, Dingkang Yang, Yue Jiang, Yuxuan Lei, Lihua Zhang

Medical visual question answering (VQA) is a challenging multimodal task, where Vision-Language Pre-training (VLP) models can effectively improve the generalization performance.

Medical Visual Question Answering Multi-Task Learning +3

RS-Corrector: Correcting the Racial Stereotypes in Latent Diffusion Models

no code implementations8 Dec 2023 Yue Jiang, Yueming Lyu, Tianxiang Ma, Bo Peng, Jing Dong

Extensive empirical evaluations demonstrate that the introduced \themodel effectively corrects the racial stereotypes of the well-trained Stable Diffusion model while leaving the original model unchanged.

Image Generation

Large Language Model based Long-tail Query Rewriting in Taobao Search

1 code implementation7 Nov 2023 Wenjun Peng, Guiyang Li, Yue Jiang, Zilong Wang, Dan Ou, Xiaoyi Zeng, Derong Xu, Tong Xu, Enhong Chen

In the realm of e-commerce search, the significance of semantic matching cannot be overstated, as it directly impacts both user experience and company revenue.

Contrastive Learning Language Modeling +3

DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing

1 code implementation12 Oct 2023 Yueming Lyu, Kang Zhao, Bo Peng, Yue Jiang, Yingya Zhang, Jing Dong

Based on DeltaSpace, we propose a novel framework called DeltaEdit, which maps the CLIP visual feature differences to the latent space directions of a generative model during the training phase, and predicts the latent space directions from the CLIP textual feature differences during the inference phase.

text-guided-image-editing

ILuvUI: Instruction-tuned LangUage-Vision modeling of UIs from Machine Conversations

no code implementations7 Oct 2023 Yue Jiang, Eldon Schoop, Amanda Swearngin, Jeffrey Nichols

Multimodal Vision-Language Models (VLMs) enable powerful applications from their fused understanding of images and language, but many perform poorly on UI tasks due to the lack of UI training data.

Language Modeling Language Modelling +1

AXNav: Replaying Accessibility Tests from Natural Language

no code implementations3 Oct 2023 Maryam Taeb, Amanda Swearngin, Eldon Schoop, Ruijia Cheng, Yue Jiang, Jeffrey Nichols

Recently, Large Language Models (LLMs) have been used for a variety of tasks including automation of UIs, however to our knowledge no one has yet explored their use in controlling assistive technologies for the purposes of supporting accessibility testing.

Few-Shot Domain Adaptation for Charge Prediction on Unprofessional Descriptions

no code implementations29 Sep 2023 Jie Zhao, Ziyu Guan, Wei Zhao, Yue Jiang, Xiaofei He

Recent works considering professional legal-linguistic style (PLLS) texts have shown promising results on the charge prediction task.

Domain Adaptation Prediction

InfoStyler: Disentanglement Information Bottleneck for Artistic Style Transfer

no code implementations30 Jul 2023 Yueming Lyu, Yue Jiang, Bo Peng, Jing Dong

InfoStyler formulates the disentanglement representation learning as an information compression problem by eliminating style statistics from the content image and removing the content structure from the style image.

Disentanglement Style Transfer

3D-Aware Adversarial Makeup Generation for Facial Privacy Protection

no code implementations26 Jun 2023 Yueming Lyu, Yue Jiang, Ziwen He, Bo Peng, Yunfan Liu, Jing Dong

The privacy and security of face data on social media are facing unprecedented challenges as it is vulnerable to unauthorized access and identification.

Face Recognition Face Verification

HiFECap: Monocular High-Fidelity and Expressive Capture of Human Performances

no code implementations11 Oct 2022 Yue Jiang, Marc Habermann, Vladislav Golyanik, Christian Theobalt

Furthermore, we show that HiFECap outperforms the state-of-the-art human performance capture approaches qualitatively and quantitatively while for the first time capturing all aspects of the human.

Vocal Bursts Intensity Prediction

ReverseORC: Reverse Engineering of Resizable User Interface Layouts with OR-Constraints

no code implementations23 Feb 2022 Yue Jiang, Wolfgang Stuerzlinger, Christof Lutteroth

Furthermore, it can be used to detect and fix problems in legacy UIs, extend UIs with enhanced layout behaviours, and support the creation of flexible UI layouts.

Diversity

BacHMMachine: An Interpretable and Scalable Model for Algorithmic Harmonization for Four-part Baroque Chorales

no code implementations15 Sep 2021 Yunyao Zhu, Stephen Hahn, Simon Mak, Yue Jiang, Cynthia Rudin

Algorithmic harmonization - the automated harmonization of a musical piece given its melodic line - is a challenging problem that has garnered much interest from both music theorists and computer scientists.

ORCSolver: An Efficient Solver for Adaptive GUI Layout with OR-Constraints

1 code implementation23 Feb 2020 Yue Jiang, Wolfgang Stuerzlinger, Matthias Zwicker, Christof Lutteroth

OR-constrained (ORC) graphical user interface layouts unify conventional constraint-based layouts with flow layouts, which enables the definition of flexible layouts that adapt to screens with different sizes, orientations, or aspect ratios with only a single layout specification.

ORC Layout: Adaptive GUI Layout with OR-Constraints

no code implementations17 Dec 2019 Yue Jiang, Ruofei Du, Christof Lutteroth, Wolfgang Stuerzlinger

We propose a novel approach for constraint-based graphical user interface (GUI) layout based on OR-constraints (ORC) in standard soft/hard linear constraint systems.

Human-Computer Interaction Graphics

SDFDiff: Differentiable Rendering of Signed Distance Fields for 3D Shape Optimization

1 code implementation CVPR 2020 Yue Jiang, Dantong Ji, Zhizhong Han, Matthias Zwicker

We propose SDFDiff, a novel approach for image-based shape optimization using differentiable rendering of 3D shapes represented by signed distance functions (SDFs).

 Ranked #1 on Single-View 3D Reconstruction on ShapeNet (using extra training data)

3D Reconstruction Multi-View 3D Reconstruction +1

Cannot find the paper you are looking for? You can Submit a new open access paper.