Search Results for author: Yuheng Li

Found 22 papers, 11 papers with code

Bigger Data or Fairer Data? Augmenting BERT via Active Sampling for Educational Text Classification

1 code implementation COLING 2022 Lele Sha, Yuheng Li, Dragan Gasevic, Guanliang Chen

Pretrained Language Models (PLMs), though popular, have been diagnosed to encode bias against protected groups in the representations they learn, which may harm the prediction fairness of downstream models.

Fairness text-classification +1

Edit One for All: Interactive Batch Image Editing

no code implementations18 Jan 2024 Thao Nguyen, Utkarsh Ojha, Yuheng Li, Haotian Liu, Yong Jae Lee

With increased human control, it is now possible to edit an image in a plethora of ways; from specifying in text what we want to change, to straight up dragging the contents of the image in an interactive point-based manner.

Visual Instruction Inversion: Image Editing via Visual Prompting

1 code implementation26 Jul 2023 Thao Nguyen, Yuheng Li, Utkarsh Ojha, Yong Jae Lee

Given pairs of example that represent the "before" and "after" images of an edit, our goal is to learn a text-based editing direction that can be used to perform the same edit on new images.

Visual Prompting

Towards Automatic Boundary Detection for Human-AI Collaborative Hybrid Essay in Education

2 code implementations23 Jul 2023 Zijie Zeng, Lele Sha, Yuheng Li, Kaixun Yang, Dragan Gašević, Guanliang Chen

Then we proposed a two-step approach where we (1) separated AI-generated content from human-written content during the encoder training process; and (2) calculated the distances between every two adjacent prototypes and assumed that the boundaries exist between the two adjacent prototypes that have the furthest distance from each other.

Boundary Detection Text Detection

Generate Anything Anywhere in Any Scene

no code implementations29 Jun 2023 Yuheng Li, Haotian Liu, Yangming Wen, Yong Jae Lee

Text-to-image diffusion models have attracted considerable interest due to their wide applicability across diverse fields.

Data Augmentation Object

Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding

no code implementations9 Jun 2023 Mu Cai, Zeyi Huang, Yuheng Li, Haohan Wang, Yong Jae Lee

By leveraging the XML-based textual descriptions of SVG representations instead of raster images, we aim to bridge the gap between the visual and textual modalities, allowing LLMs to directly understand and manipulate images without the need for parameterized visual components.

Image Classification In-Context Learning +2

Synthetic CT Generation from MRI using 3D Transformer-based Denoising Diffusion Model

1 code implementation31 May 2023 Shaoyan Pan, Elham Abouei, Jacob Wynne, Tonghe Wang, Richard L. J. Qiu, Yuheng Li, Chih-Wei Chang, Junbo Peng, Justin Roper, Pretesh Patel, David S. Yu, Hui Mao, Xiaofeng Yang

The proposed model consists of two processes: a forward process which adds Gaussian noise to real CT scans, and a reverse process in which a shifted-window transformer V-net (Swin-Vnet) denoises the noisy CT scans conditioned on the MRI from the same patient to produce noise-free CT scans.

Anatomy Denoising +3

BreastSAM: A Study of Segment Anything Model for Breast Tumor Detection in Ultrasound Images

no code implementations21 May 2023 Mingzhe Hu, Yuheng Li, Xiaofeng Yang

We conducted a thorough investigation of the Segment Anything Model (SAM) for the task of interactive segmentation of breast tumors in ultrasound images.

Interactive Segmentation Segmentation +1

Cross-Shaped Windows Transformer with Self-supervised Pretraining for Clinically Significant Prostate Cancer Detection in Bi-parametric MRI

no code implementations30 Apr 2023 Yuheng Li, Jacob Wynne, Jing Wang, Richard L. J. Qiu, Justin Roper, Shaoyan Pan, Ashesh B. Jani, Tian Liu, Pretesh R. Patel, Hui Mao, Xiaofeng Yang

We introduce a novel end-to-end Cross-Shaped windows (CSwin) transformer UNet model, CSwin UNet, to detect clinically significant prostate cancer (csPCa) in prostate bi-parametric MR imaging (bpMRI) and demonstrate the effectiveness of our proposed self-supervised pre-training framework.

Self-Supervised Learning

Polyp-SAM: Transfer SAM for Polyp Segmentation

1 code implementation29 Apr 2023 Yuheng Li, Mingzhe Hu, Xiaofeng Yang

In this study, we propose Poly-SAM, a finetuned SAM model for polyp segmentation, and compare its performance to several state-of-the-art polyp segmentation models.

Image Segmentation Medical Image Segmentation +3

SkinSAM: Empowering Skin Cancer Segmentation with Segment Anything Model

no code implementations27 Apr 2023 Mingzhe Hu, Yuheng Li, Xiaofeng Yang

Skin cancer is a prevalent and potentially fatal disease that requires accurate and efficient diagnosis and treatment.

Image Segmentation Segmentation +2

Advancing Medical Imaging with Language Models: A Journey from N-grams to ChatGPT

no code implementations11 Apr 2023 Mingzhe Hu, Shaoyan Pan, Yuheng Li, Xiaofeng Yang

In this paper, we aimed to provide a review and tutorial for researchers in the field of medical imaging using language models to improve their tasks at hand.

Image Captioning Question Answering +1

Practical and Ethical Challenges of Large Language Models in Education: A Systematic Scoping Review

no code implementations17 Mar 2023 Lixiang Yan, Lele Sha, Linxuan Zhao, Yuheng Li, Roberto Martinez-Maldonado, Guanliang Chen, Xinyu Li, Yueqiao Jin, Dragan Gašević

Educational technology innovations leveraging large language models (LLMs) have shown the potential to automate the laborious process of generating and analysing textual content.

Question Generation Question-Generation

Towards Universal Fake Image Detectors that Generalize Across Generative Models

1 code implementation CVPR 2023 Utkarsh Ojha, Yuheng Li, Yong Jae Lee

In this work, we first show that the existing paradigm, which consists of training a deep network for real-vs-fake classification, fails to detect fake images from newer breeds of generative models when trained to detect GAN fake images.

Classification Language Modelling

GIRAFFE HD: A High-Resolution 3D-aware Generative Model

1 code implementation CVPR 2022 Yang Xue, Yuheng Li, Krishna Kumar Singh, Yong Jae Lee

3D-aware generative models have shown that the introduction of 3D information can lead to more controllable image generation.

Disentanglement Image Generation +2

CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes

1 code implementation23 May 2021 Hao Huang, Yongtao Wang, Zhaoyu Chen, Yuze Zhang, Yuheng Li, Zhi Tang, Wei Chu, Jingdong Chen, Weisi Lin, Kai-Kuang Ma

Then, we design a two-level perturbation fusion strategy to alleviate the conflict between the adversarial watermarks generated by different facial images and models.

Adversarial Attack Face Swapping +1

MixNMatch: Multifactor Disentanglement and Encoding for Conditional Image Generation

3 code implementations CVPR 2020 Yuheng Li, Krishna Kumar Singh, Utkarsh Ojha, Yong Jae Lee

We present MixNMatch, a conditional generative model that learns to disentangle and encode background, object pose, shape, and texture from real images with minimal supervision, for mix-and-match image generation.

Conditional Image Generation Disentanglement

Cannot find the paper you are looking for? You can Submit a new open access paper.