no code implementations • NIDCP (LREC) 2022 • Juhong Zhan, Yue Jiang, Christopher Cieri, Mark Liberman, Jiahong Yuan, Yiya Chen, Odette Scharenborg
This paper describes our use of mixed incentives and the citizen science portal LanguageARC to prepare, collect and quality control a large corpus of object namings for the purpose of providing speech data to document the under-represented Guanzhong dialect of Chinese spoken in the Shaanxi province in the environs of Xi’an.
no code implementations • 5 Feb 2025 • Aryan Garg, Yue Jiang, Antti Oulasvirta
During the early stages of interface design, designers need to produce multiple sketches to explore a design space.
1 code implementation • 15 Jan 2025 • Xiaolu Hou, Mingcheng Li, Dingkang Yang, Jiawei Chen, Ziyun Qian, Xiao Zhao, Yue Jiang, Jinjie Wei, Qingyao Xu, Lihua Zhang
To this end, we propose BloomScene, a lightweight structured 3D Gaussian splatting for crossmodal scene generation, which creates diverse and high-quality 3D scenes from text or image inputs.
no code implementations • 5 Nov 2024 • Mingcheng Li, Dingkang Yang, Yang Liu, Shunli Wang, Jiawei Chen, Shuaibing Wang, Jinjie Wei, Yue Jiang, Qingyao Xu, Xiaolu Hou, Mingyang Sun, Ziyun Qian, Dongliang Kou, Lihua Zhang
Specifically, we propose a fine-grained representation factorization module that sufficiently extracts valuable sentiment information by factorizing modality into sentiment-relevant and modality-specific representations through crossmodal translation and sentiment semantic reconstruction.
no code implementations • 16 Oct 2024 • Jinjie Wei, Dingkang Yang, Yanshu Li, Qingyao Xu, Zhaoyu Chen, Mingcheng Li, Yue Jiang, Xiaolu Hou, Lihua Zhang
Large Language Model (LLM)-driven interactive systems currently show potential promise in healthcare domains.
no code implementations • 11 Oct 2024 • Shuai Liu, Ning Cao, Yile Chen, Yue Jiang, Gao Cong
Experiments show that NextLocLLM outperforms existing models in next location prediction, excelling in both supervised and zero-shot settings.
1 code implementation • 30 Sep 2024 • Yi-Hao Peng, Faria Huq, Yue Jiang, Jason Wu, Amanda Xin Yue Li, Jeffrey Bigham, Amy Pavel
Enabling machines to understand structured visuals like slides and user interfaces is essential for making them accessible to people with disabilities.
no code implementations • 26 Sep 2024 • Zheling Meng, Bo Peng, Xiaochuan Jin, Yue Jiang, Jing Dong, Wei Wang
Most existing methods focus on modifying the generation probabilities conditioned on the texts containing target concepts.
no code implementations • 22 Aug 2024 • Yile Chen, Weiming Huang, Kaiqi Zhao, Yue Jiang, Gao Cong
The proliferation of geospatial data in urban and territorial environments has significantly facilitated the development of geospatial artificial intelligence (GeoAI) across various urban applications.
no code implementations • 13 Aug 2024 • Stephen Ni-Hahn, Weihan Xu, Jerry Yin, Rico Zhu, Simon Mak, Yue Jiang, Cynthia Rudin
Schenkerian Analysis (SchA) is a uniquely expressive method of music analysis, combining elements of melody, harmony, counterpoint, and form to describe the hierarchical structure supporting a work of music.
no code implementations • 18 Jun 2024 • Yue Jiang, Qin Chao, Yile Chen, Xiucheng Li, Shuai Liu, Gao Cong
Location-based services play an critical role in improving the quality of our daily lives.
1 code implementation • 18 Jun 2024 • Yue Jiang, Xiucheng Li, Yile Chen, Shuai Liu, Weilong Kong, Antonis F. Lentzakis, Gao Cong
Time series forecasting is essential for our daily activities and precise modeling of the complex correlations and shared patterns among multiple time series is essential for improving forecasting performance.
no code implementations • 17 Jun 2024 • Yue Jiang, Jiawei Chen, Dingkang Yang, Mingcheng Li, Shunli Wang, Tong Wu, Ke Li, Lihua Zhang
Automatic medical report generation (MRG), which possesses significant research value as it can aid radiologists in clinical diagnosis and report composition, has garnered increasing attention.
no code implementations • 17 Jun 2024 • Jie Zhao, Ziyu Guan, Wei Zhao, Yue Jiang
In this paper, we propose a two-stage framework named Diverse Legal Factor-enhanced Criminal Case Matching (DLF-CCM).
no code implementations • 14 Jun 2024 • Jiawei Chen, Dingkang Yang, Tong Wu, Yue Jiang, Xiaolu Hou, Mingcheng Li, Shunli Wang, Dongling Xiao, Ke Li, Lihua Zhang
To bridge this gap, we introduce Med-HallMark, the first benchmark specifically designed for hallucination detection and evaluation within the medical multimodal domain.
no code implementations • 7 Jun 2024 • Jie Zhao, Ziyu Guan, Cai Xu, Wei Zhao, Yue Jiang
We design a style consistency loss to ensure the generated multiple sentences consistently reflect the target style polarity.
1 code implementation • 29 May 2024 • Dingkang Yang, Jinjie Wei, Dongling Xiao, Shunli Wang, Tong Wu, Gang Li, Mingcheng Li, Shuaibing Wang, Jiawei Chen, Yue Jiang, Qingyao Xu, Ke Li, Peng Zhai, Lihua Zhang
In the parameter-efficient secondary SFT phase, a mixture of universal-specific experts strategy is presented to resolve the competency conflict between medical generalist and pediatric expertise mastery.
no code implementations • 14 May 2024 • Parvin Emami, Yue Jiang, Zixin Guo, Luis A. Leiva
We show that even small variations of these design parameters have a noticeable impact on standard evaluation metrics such as DTW or Eyenalysis.
no code implementations • 25 Apr 2024 • Jiawei Chen, Dingkang Yang, Yue Jiang, Mingcheng Li, Jinjie Wei, Xiaolu Hou, Lihua Zhang
In the realm of Medical Visual Language Models (Med-VLMs), the quest for universal efficient fine-tuning mechanisms remains paramount, especially given researchers in interdisciplinary fields are often extremely short of training resources, yet largely unexplored.
Medical Visual Question Answering
parameter-efficient fine-tuning
+2
no code implementations • 21 Apr 2024 • Yue Jiang, Changkong Zhou, Vikas Garg, Antti Oulasvirta
Present-day graphical user interfaces (GUIs) exhibit diverse arrangements of text, graphics, and interactive elements such as buttons and menus, but representations of GUIs have not kept up.
no code implementations • 15 Apr 2024 • Yue Jiang, Zixin Guo, Hamed Rezazadegan Tavakoli, Luis A. Leiva, Antti Oulasvirta
From a visual perception perspective, modern graphical user interfaces (GUIs) comprise a complex graphics-rich two-dimensional visuospatial arrangement of text, images, and interactive objects such as buttons and menus.
2 code implementations • 11 Mar 2024 • Jiawei Chen, Yue Jiang, Dingkang Yang, Mingcheng Li, Jinjie Wei, Ziyun Qian, Lihua Zhang
In this paper, we delve into the fine-tuning methods of LLMs and conduct extensive experiments to investigate the impact of fine-tuning methods for large models on the existing multimodal model in the medical domain from the training data level and the model structure level.
1 code implementation • 10 Jan 2024 • Jiawei Chen, Dingkang Yang, Yue Jiang, Yuxuan Lei, Lihua Zhang
Medical visual question answering (VQA) is a challenging multimodal task, where Vision-Language Pre-training (VLP) models can effectively improve the generalization performance.
no code implementations • 8 Dec 2023 • Yue Jiang, Yueming Lyu, Tianxiang Ma, Bo Peng, Jing Dong
Extensive empirical evaluations demonstrate that the introduced \themodel effectively corrects the racial stereotypes of the well-trained Stable Diffusion model while leaving the original model unchanged.
1 code implementation • 7 Nov 2023 • Wenjun Peng, Guiyang Li, Yue Jiang, Zilong Wang, Dan Ou, Xiaoyi Zeng, Derong Xu, Tong Xu, Enhong Chen
In the realm of e-commerce search, the significance of semantic matching cannot be overstated, as it directly impacts both user experience and company revenue.
1 code implementation • 12 Oct 2023 • Yueming Lyu, Kang Zhao, Bo Peng, Yue Jiang, Yingya Zhang, Jing Dong
Based on DeltaSpace, we propose a novel framework called DeltaEdit, which maps the CLIP visual feature differences to the latent space directions of a generative model during the training phase, and predicts the latent space directions from the CLIP textual feature differences during the inference phase.
no code implementations • 7 Oct 2023 • Yue Jiang, Eldon Schoop, Amanda Swearngin, Jeffrey Nichols
Multimodal Vision-Language Models (VLMs) enable powerful applications from their fused understanding of images and language, but many perform poorly on UI tasks due to the lack of UI training data.
no code implementations • 3 Oct 2023 • Maryam Taeb, Amanda Swearngin, Eldon Schoop, Ruijia Cheng, Yue Jiang, Jeffrey Nichols
Recently, Large Language Models (LLMs) have been used for a variety of tasks including automation of UIs, however to our knowledge no one has yet explored their use in controlling assistive technologies for the purposes of supporting accessibility testing.
no code implementations • 29 Sep 2023 • Jie Zhao, Ziyu Guan, Wei Zhao, Yue Jiang, Xiaofei He
Recent works considering professional legal-linguistic style (PLLS) texts have shown promising results on the charge prediction task.
no code implementations • 30 Jul 2023 • Yueming Lyu, Yue Jiang, Bo Peng, Jing Dong
InfoStyler formulates the disentanglement representation learning as an information compression problem by eliminating style statistics from the content image and removing the content structure from the style image.
no code implementations • 26 Jun 2023 • Yueming Lyu, Yue Jiang, Ziwen He, Bo Peng, Yunfan Liu, Jing Dong
The privacy and security of face data on social media are facing unprecedented challenges as it is vulnerable to unauthorized access and identification.
no code implementations • 11 Oct 2022 • Yue Jiang, Marc Habermann, Vladislav Golyanik, Christian Theobalt
Furthermore, we show that HiFECap outperforms the state-of-the-art human performance capture approaches qualitatively and quantitatively while for the first time capturing all aspects of the human.
1 code implementation • 30 Jun 2022 • Bo Peng, Wei Xiang, Yue Jiang, Wei Wang, Jing Dong, Zhenan Sun, Zhen Lei, Siwei Lyu
There is a two-party game between DeepFake creators and defenders.
no code implementations • 23 Feb 2022 • Yue Jiang, Wolfgang Stuerzlinger, Christof Lutteroth
Furthermore, it can be used to detect and fix problems in legacy UIs, extend UIs with enhanced layout behaviours, and support the creation of flexible UI layouts.
1 code implementation • 15 Feb 2022 • Meng Zhou, Xin Li, Yue Jiang, Lidong Bing
Prompting shows promising results in few-shot scenarios.
no code implementations • 15 Sep 2021 • Yunyao Zhu, Stephen Hahn, Simon Mak, Yue Jiang, Cynthia Rudin
Algorithmic harmonization - the automated harmonization of a musical piece given its melodic line - is a challenging problem that has garnered much interest from both music theorists and computer scientists.
1 code implementation • 23 Feb 2020 • Yue Jiang, Wolfgang Stuerzlinger, Matthias Zwicker, Christof Lutteroth
OR-constrained (ORC) graphical user interface layouts unify conventional constraint-based layouts with flow layouts, which enables the definition of flexible layouts that adapt to screens with different sizes, orientations, or aspect ratios with only a single layout specification.
no code implementations • 17 Dec 2019 • Yue Jiang, Ruofei Du, Christof Lutteroth, Wolfgang Stuerzlinger
We propose a novel approach for constraint-based graphical user interface (GUI) layout based on OR-constraints (ORC) in standard soft/hard linear constraint systems.
Human-Computer Interaction Graphics
1 code implementation • CVPR 2020 • Yue Jiang, Dantong Ji, Zhizhong Han, Matthias Zwicker
We propose SDFDiff, a novel approach for image-based shape optimization using differentiable rendering of 3D shapes represented by signed distance functions (SDFs).
Ranked #1 on
Single-View 3D Reconstruction
on ShapeNet
(using extra training data)