Search Results for author: Yichi Zhang

Found 95 papers, 58 papers with code

SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models

1 code implementation • 13 Nov 2023 • Ziyi Lin, Chris Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Chen Lin, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Hongsheng Li, Yu Qiao

We present SPHINX, a versatile multi-modal large language model (MLLM) with a joint mixing of model weights, tuning tasks, and visual embeddings.

Ranked #2 on Visual Question Answering on BenchLMM

Described Object Detection Language Modelling +4

2,493

Paper
Code

Learning with Limited Annotations: A Survey on Deep Semi-Supervised Learning for Medical Image Segmentation

1 code implementation • 28 Jul 2022 • Rushi Jiao, Yichi Zhang, Le Ding, Rong Cai, Jicong Zhang

Medical image segmentation is a fundamental and critical step in many image-guided clinical approaches.

Image Segmentation Medical Image Segmentation +2

1,989

Paper
Code

Alternating Recurrent Dialog Model with Large-scale Pre-trained Language Models

1 code implementation • EACL 2021 • Qingyang Wu, Yichi Zhang, Yu Li, Zhou Yu

Existing dialog system models require extensive human annotations and are difficult to generalize to different tasks.

Language Modelling Response Generation

819

Paper
Code

Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions

1 code implementation • 7 Jan 2024 • Yichi Zhang, Zhenrong Shen, Rushi Jiao

Due to the inherent flexibility of prompting, foundation models have emerged as the predominant force in the fields of natural language processing and computer vision.

Benchmarking Image Segmentation +3

519

Paper
Code

Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation

1 code implementation • 7 Feb 2024 • Ziyang Wang, Jian-Qing Zheng, Yichi Zhang, Ge Cui, Lei LI

Mamba-UNet adopts a pure Visual Mamba (VMamba)-based encoder-decoder structure, infused with skip connections to preserve spatial information across different scales of the network.

Cardiac Segmentation Computational Efficiency +3

225

Paper
Code

AbdomenCT-1K: Is Abdominal Organ Segmentation A Solved Problem?

1 code implementation • 28 Oct 2020 • Jun Ma, Yao Zhang, Song Gu, Cheng Zhu, Cheng Ge, Yichi Zhang, Xingle An, Congcong Wang, Qiyuan Wang, Xin Liu, Shucheng Cao, Qi Zhang, Shangqing Liu, Yunpeng Wang, Yuhui Li, Jian He, Xiaoping Yang

With the unprecedented developments in deep learning, automatic segmentation of main abdominal organs seems to be a solved problem as state-of-the-art (SOTA) methods have achieved comparable results with inter-rater variability on many benchmark datasets.

Continual Learning Organ Segmentation +2

191

Paper
Code

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

3 code implementations • 8 Feb 2024 • Zhuo Chen, Yichi Zhang, Yin Fang, Yuxia Geng, Lingbing Guo, Xiang Chen, Qian Li, Wen Zhang, Jiaoyan Chen, Yushan Zhu, Jiaqi Li, Xiaoze Liu, Jeff Z. Pan, Ningyu Zhang, Huajun Chen

In this survey, we carefully review over 300 articles, focusing on KG-aware research in two principal aspects: KG-driven Multi-Modal (KG4MM) learning, where KGs support multi-modal tasks, and Multi-Modal Knowledge Graph (MM4KG), which extends KG studies into the MMKG realm.

Entity Alignment Image Classification +4

186

Paper
Code

PokeBNN: A Binary Pursuit of Lightweight Accuracy

1 code implementation • CVPR 2022 • Yichi Zhang, Zhiru Zhang, Lukasz Lew

In order to enable joint optimization of the cost together with accuracy, we define arithmetic computation effort (ACE), a hardware- and energy-inspired cost metric for quantized and binarized networks.

Ranked #1 on Binarization on ImageNet

Binarization

169

Paper
Code

Binarized Neural Machine Translation

1 code implementation • NeurIPS 2023 • Yichi Zhang, Ankush Garg, Yuan Cao, Łukasz Lew, Behrooz Ghorbani, Zhiru Zhang, Orhan Firat

In this work, we propose a novel binarization technique for Transformers applied to machine translation (BMT), the first of its kind.

Binarization Machine Translation +2

169

Paper
Code

MyGO: Discrete Modality Information as Fine-Grained Tokens for Multi-modal Knowledge Graph Completion

2 code implementations • 15 Apr 2024 • Yichi Zhang, Zhuo Chen, Lingbing Guo, Yajing Xu, Binbin Hu, Ziqi Liu, Huajun Chen, Wen Zhang

To overcome their inherent incompleteness, multi-modal knowledge graph completion (MMKGC) aims to discover unobserved knowledge from given MMKGs, leveraging both structural information from the triples and multi-modal information of the entities.

Contrastive Learning Descriptive +3

166

Paper
Code

Physics-Informed Machine Learning: A Survey on Problems, Methods and Applications

1 code implementation • 15 Nov 2022 • Zhongkai Hao, Songming Liu, Yichi Zhang, Chengyang Ying, Yao Feng, Hang Su, Jun Zhu

Recent work shows that it provides potential benefits for machine learning models by incorporating the physical prior and collected data, which makes the intersection of machine learning and physics become a prevailing paradigm.

Physics-informed machine learning

136

Paper
Code

PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEs

1 code implementation • 15 Jun 2023 • Zhongkai Hao, Jiachen Yao, Chang Su, Hang Su, Ziao Wang, Fanzhi Lu, Zeyu Xia, Yichi Zhang, Songming Liu, Lu Lu, Jun Zhu

In addition to providing a standardized means of assessing performance, PINNacle also offers an in-depth analysis to guide future research, particularly in areas such as domain decomposition methods and loss reweighting for handling multi-scale problems and complex geometry.

Benchmarking

136

Paper
Code

Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering

1 code implementation • 11 Nov 2023 • Yichi Zhang, Zhuo Chen, Yin Fang, Yanxi Lu, Fangming Li, Wen Zhang, Huajun Chen

Deploying large language models (LLMs) to real scenarios for domain-specific question answering (QA) is a key thrust for LLM applications, which poses numerous challenges, especially in ensuring that responses are both accommodating to user requirements and appropriately leveraging domain-specific knowledge bases.

Knowledge Graphs Question Answering

130

Paper
Code

Task-Oriented Dialog Systems that Consider Multiple Appropriate Responses under the Same Context

6 code implementations • 24 Nov 2019 • Yichi Zhang, Zhijian Ou, Zhou Yu

Conversations have an intrinsic one-to-many property, which means that multiple responses can be appropriate for the same dialog context.

Ranked #6 on End-To-End Dialogue Modelling on MULTIWOZ 2.0

Data Augmentation End-To-End Dialogue Modelling +1

Paper
Code

Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model: Explorations with GPT4-Vision and Beyond

1 code implementation • 3 Oct 2023 • Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Tianyu Liu, Baobao Chang

In this study, we explore the potential of Multimodal Large Language Models (MLLMs) in improving embodied decision-making processes for agents.

Decision Making Language Modelling +2

Paper
Code

PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain

1 code implementation • 21 Feb 2024 • Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Xiangdi Meng, Tianyu Liu, Baobao Chang

To address this, we introduce Embodied-Instruction-Evolution (EIE), an automatic framework for synthesizing instruction tuning examples in multimodal embodied environments.

Autonomous Driving Decision Making

Paper
Code

Making Large Language Models Perform Better in Knowledge Graph Completion

1 code implementation • 10 Oct 2023 • Yichi Zhang, Zhuo Chen, Lingbing Guo, Yajing Xu, Wen Zhang, Huajun Chen

In this paper, we explore methods to incorporate structural information into the LLMs, with the overarching goal of facilitating structure-aware reasoning.

In-Context Learning Knowledge Graph Completion +2

Paper
Code

Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-modal Pretraining

1 code implementation • ICCV 2021 • Xunlin Zhan, Yangxin Wu, Xiao Dong, Yunchao Wei, Minlong Lu, Yichi Zhang, Hang Xu, Xiaodan Liang

In this paper, we investigate a more realistic setting that aims to perform weakly-supervised multi-modal instance-level product retrieval among fine-grained product categories.

Retrieval

Paper
Code

How Robust is Google's Bard to Adversarial Image Attacks?

1 code implementation • 21 Sep 2023 • Yinpeng Dong, Huanran Chen, Jiawei Chen, Zhengwei Fang, Xiao Yang, Yichi Zhang, Yu Tian, Hang Su, Jun Zhu

By attacking white-box surrogate vision encoders or MLLMs, the generated adversarial examples can mislead Bard to output wrong image descriptions with a 22% success rate based solely on the transferability.

Adversarial Robustness Chatbot +1

Paper
Code

FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations

2 code implementations • 22 Dec 2020 • Yichi Zhang, Junhao Pan, Xinheng Liu, Hongzheng Chen, Deming Chen, Zhiru Zhang

We design an efficient FPGA-based accelerator for our novel BNN model that supports the fractional activations.

Binarization Image Classification

Paper
Code

Precision Gating: Improving Neural Network Efficiency with Dynamic Dual-Precision Activations

1 code implementation • ICLR 2020 • Yichi Zhang, Ritchie Zhao, Weizhe Hua, Nayun Xu, G. Edward Suh, Zhiru Zhang

The proposed approach is applicable to a variety of DNN architectures and significantly reduces the computational cost of DNN execution with almost no accuracy loss.

Quantization

Paper
Code

Dynamic Anticipation and Completion for Multi-Hop Reasoning over Sparse Knowledge Graph

1 code implementation • EMNLP 2020 • Xin Lv, Xu Han, Lei Hou, Juanzi Li, Zhiyuan Liu, Wei zhang, Yichi Zhang, Hao Kong, Suhui Wu

On the one hand, sparse KGs contain less information, which makes it difficult for the model to choose correct paths.

Paper
Code

MEAformer: Multi-modal Entity Alignment Transformer for Meta Modality Hybrid

1 code implementation • 29 Dec 2022 • Zhuo Chen, Jiaoyan Chen, Wen Zhang, Lingbing Guo, Yin Fang, Yufeng Huang, Yichi Zhang, Yuxia Geng, Jeff Z. Pan, Wenting Song, Huajun Chen

Multi-modal entity alignment (MMEA) aims to discover identical entities across different knowledge graphs (KGs) whose entities are associated with relevant images.

Ranked #1 on Entity Alignment on FBYG15k (using extra training data)

Knowledge Graphs Multi-modal Entity Alignment

Paper
Code

Meta-Learning with Dynamic-Memory-Based Prototypical Network for Few-Shot Event Detection

1 code implementation • 25 Oct 2019 • Shumin Deng, Ningyu Zhang, Jiaojian Kang, Yichi Zhang, Wei zhang, Huajun Chen

Differing from vanilla prototypical networks simply computing event prototypes by averaging, which only consume event mentions once, our model is more robust and is capable of distilling contextual information from event mentions for multiple times due to the multi-hop mechanism of DMNs.

Event Detection Event Extraction +2

Paper
Code

Rethinking Model Ensemble in Transfer-based Adversarial Attacks

2 code implementations • 16 Mar 2023 • Huanran Chen, Yichi Zhang, Yinpeng Dong, Xiao Yang, Hang Su, Jun Zhu

It is widely recognized that deep learning models lack robustness to adversarial examples.

Image Classification Language Modelling +2

Paper
Code

Autonomous Evaluation and Refinement of Digital Agents

1 code implementation • 9 Apr 2024 • Jiayi Pan, Yichi Zhang, Nicholas Tomlin, Yifei Zhou, Sergey Levine, Alane Suhr

We show that domain-general automatic evaluators can significantly improve the performance of agents for web navigation and device control.

Paper
Code

ML-Bench: Evaluating Large Language Models for Code Generation in Repository-Level Machine Learning Tasks

1 code implementation • 16 Nov 2023 • Yuliang Liu, Xiangru Tang, Zefan Cai, Junjie Lu, Yichi Zhang, Yanjun Shao, Zexuan Deng, Helan Hu, Kaikai An, Ruijun Huang, Shuzheng Si, Sheng Chen, Haozhe Zhao, Liang Chen, Yan Wang, Tianyu Liu, Zhiwei Jiang, Baobao Chang, Yujia Qin, Wangchunshu Zhou, Yilun Zhao, Arman Cohan, Mark Gerstein

While Large Language Models (LLMs) have demonstrated proficiency in code generation benchmarks, translating these results into practical development scenarios - where leveraging existing repository-level libraries is the norm - remains challenging.

Code Generation Navigate

Paper
Code

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

1 code implementation • 19 Mar 2021 • Zhe Xie, Chengxuan Liu, Yichi Zhang, Hongtao Lu, Dong Wang, Yue Ding

To solve the above problem, in this work, we propose a novel method called Adversarial and Contrastive Variational Autoencoder (ACVAE) for sequential recommendation.

Collaborative Filtering Sequential Recommendation

Paper
Code

Dual-Task Mutual Learning for Semi-Supervised Medical Image Segmentation

1 code implementation • 8 Mar 2021 • Yichi Zhang, Jicong Zhang

The success of deep learning methods in medical image segmentation tasks usually requires a large amount of labeled data.

Image Segmentation Segmentation +2

Paper
Code

Hierarchical Task Learning from Language Instructions with Unified Transformers and Self-Monitoring

1 code implementation • Findings (ACL) 2021 • Yichi Zhang, Joyce Chai

On the ALFRED benchmark for task learning, the published state-of-the-art system only achieves a task success rate of less than 10% in an unseen environment, compared to the human performance of over 90%.

Paper
Code

Tele-Knowledge Pre-training for Fault Analysis

1 code implementation • 20 Oct 2022 • Zhuo Chen, Wen Zhang, Yufeng Huang, Mingyang Chen, Yuxia Geng, Hongtao Yu, Zhen Bi, Yichi Zhang, Zhen Yao, Wenting Song, Xinliang Wu, Yi Yang, Mingyi Chen, Zhaoyang Lian, YingYing Li, Lei Cheng, Huajun Chen

In this work, we share our experience on tele-knowledge pre-training for fault analysis, a crucial task in telecommunication applications that requires a wide range of knowledge normally found in both machine log data and product documents.

Language Modelling

Paper
Code

Rethinking Uncertainly Missing and Ambiguous Visual Modality in Multi-Modal Entity Alignment

1 code implementation • 30 Jul 2023 • Zhuo Chen, Lingbing Guo, Yin Fang, Yichi Zhang, Jiaoyan Chen, Jeff Z. Pan, Yangning Li, Huajun Chen, Wen Zhang

As a crucial extension of entity alignment (EA), multi-modal entity alignment (MMEA) aims to identify identical entities across disparate knowledge graphs (KGs) by exploiting associated visual information.

Ranked #1 on Multi-modal Entity Alignment on UMVM-oea-d-w-v2 (using extra training data)

Benchmarking Knowledge Graph Embeddings +2

Paper
Code

A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning

1 code implementation • EMNLP 2020 • Yichi Zhang, Zhijian Ou, Huixin Wang, Junlan Feng

In this paper we aim at alleviating the reliance on belief state labels in building end-to-end dialog systems, by leveraging unlabeled dialog data towards semi-supervised learning.

Ranked #2 on End-To-End Dialogue Modelling on MULTIWOZ 2.1

End-To-End Dialogue Modelling

Paper
Code

Unrestricted Adversarial Attacks on ImageNet Competition

1 code implementation • 17 Oct 2021 • Yuefeng Chen, Xiaofeng Mao, Yuan He, Hui Xue, Chao Li, Yinpeng Dong, Qi-An Fu, Xiao Yang, Tianyu Pang, Hang Su, Jun Zhu, Fangcheng Liu, Chao Zhang, Hongyang Zhang, Yichi Zhang, Shilong Liu, Chang Liu, Wenzhao Xiang, Yajie Wang, Huipeng Zhou, Haoran Lyu, Yidan Xu, Zixuan Xu, Taoyu Zhu, Wenjun Li, Xianfeng Gao, Guoqiu Wang, Huanqian Yan, Ying Guo, Chaoning Zhang, Zheng Fang, Yang Wang, Bingyang Fu, Yunfei Zheng, Yekui Wang, Haorong Luo, Zhen Yang

Many works have investigated the adversarial attacks or defenses under the settings where a bounded and imperceptible perturbation can be added to the input.

Adversarial Attack

Paper
Code

Logic Synthesis Meets Machine Learning: Trading Exactness for Generalization

1 code implementation • 4 Dec 2020 • Shubham Rai, Walter Lau Neto, Yukio Miyasaka, Xinpei Zhang, Mingfei Yu, Qingyang Yi Masahiro Fujita, Guilherme B. Manske, Matheus F. Pontes, Leomar S. da Rosa Junior, Marilton S. de Aguiar, Paulo F. Butzen, Po-Chun Chien, Yu-Shan Huang, Hoa-Ren Wang, Jie-Hong R. Jiang, Jiaqi Gu, Zheng Zhao, Zixuan Jiang, David Z. Pan, Brunno A. de Abreu, Isac de Souza Campos, Augusto Berndt, Cristina Meinhardt, Jonata T. Carvalho, Mateus Grellert, Sergio Bampi, Aditya Lohana, Akash Kumar, Wei Zeng, Azadeh Davoodi, Rasit O. Topaloglu, Yuan Zhou, Jordan Dotzel, Yichi Zhang, Hanyu Wang, Zhiru Zhang, Valerio Tenace, Pierre-Emmanuel Gaillardon, Alan Mishchenko, Satrajit Chatterjee

If the function is incompletely-specified, the implementation has to be true only on the care set.

BIG-bench Machine Learning

Paper
Code

Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning Interpretability

1 code implementation • EMNLP 2021 • Xin Lv, Yixin Cao, Lei Hou, Juanzi Li, Zhiyuan Liu, Yichi Zhang, Zelin Dai

However, we find in experiments that many paths given by these models are actually unreasonable, while little works have been done on interpretability evaluation for them.

Benchmarking Link Prediction

Paper
Code

DANLI: Deliberative Agent for Following Natural Language Instructions

1 code implementation • 22 Oct 2022 • Yichi Zhang, Jianing Yang, Jiayi Pan, Shane Storks, Nikhil Devraj, Ziqiao Ma, Keunwoo Peter Yu, Yuwei Bao, Joyce Chai

These reactive agents are insufficient for long-horizon complex tasks.

Instruction Following Vision-Language Navigation

Paper
Code

Understanding the Robustness of 3D Object Detection with Bird's-Eye-View Representations in Autonomous Driving

1 code implementation • CVPR 2023 • Zijian Zhu, Yichi Zhang, Hai Chen, Yinpeng Dong, Shu Zhao, Wenbo Ding, Jiachen Zhong, Shibao Zheng

However, there still lacks a systematic understanding of the robustness of these vision-dependent BEV models, which is closely related to the safety of autonomous driving systems.

3D Object Detection Adversarial Robustness +2

Paper
Code

Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding

1 code implementation • Findings (EMNLP) 2021 • Shane Storks, Qiaozi Gao, Yichi Zhang, Joyce Chai

However, evaluations only based on end task performance shed little light on machines' true ability in language understanding and reasoning.

valid

Paper
Code

Unleashing the Power of Imbalanced Modality Information for Multi-modal Knowledge Graph Completion

1 code implementation • 22 Feb 2024 • Yichi Zhang, Zhuo Chen, Lei Liang, Huajun Chen, Wen Zhang

To address the mentioned problems, we propose Adaptive Multi-modal Fusion and Modality Adversarial Training (AdaMF-MAT) to unleash the power of imbalanced modality information for MMKGC.

Multi-modal Knowledge Graph

Paper
Code

Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs

1 code implementation • 1 Apr 2024 • Xiaoze Liu, Feijie Wu, Tianyang Xu, Zhuo Chen, Yichi Zhang, Xiaoqian Wang, Jing Gao

In this paper, we propose GraphEval to evaluate an LLM's performance using a substantially large test dataset.

Knowledge Graphs

Paper
Code

CausE: Towards Causal Knowledge Graph Embedding

1 code implementation • 21 Jul 2023 • Yichi Zhang, Wen Zhang

Knowledge graph embedding (KGE) focuses on representing the entities and relations of a knowledge graph (KG) into the continuous vector spaces, which can be employed to predict the missing triples to achieve knowledge graph completion (KGC).

Disentanglement Knowledge Graph Completion +1

Paper
Code

Bridge the Gap Between CV and NLP! A Gradient-based Textual Adversarial Attack Framework

1 code implementation • 28 Oct 2021 • Lifan Yuan, Yichi Zhang, Yangyi Chen, Wei Wei

In this paper, we instantiate our framework with an attack algorithm named Textual Projected Gradient Descent (T-PGD).

Adversarial Attack Language Modelling

Paper
Code

Understanding Hyperdimensional Computing for Parallel Single-Pass Learning

1 code implementation • 10 Feb 2022 • Tao Yu, Yichi Zhang, Zhiru Zhang, Christopher De Sa

Using representation theory, we characterize which similarity matrices can be "expressed" by finite group VSA hypervectors, and we show how these VSAs can be constructed.

Paper
Code

Diffusion Noise Feature: Accurate and Fast Generated Image Detection

1 code implementation • 5 Dec 2023 • Yichi Zhang, Xiaogang Xu

DNF is extracted from the estimated noise generated during the inverse diffusion process.

Paper
Code

The Power of Noise: Toward a Unified Multi-modal Knowledge Graph Representation Framework

1 code implementation • 11 Mar 2024 • Zhuo Chen, Yin Fang, Yichi Zhang, Lingbing Guo, Jiaoyan Chen, Huajun Chen, Wen Zhang

In this work, to evaluate models' ability to accurately embed entities within MMKGs, we focus on two widely researched tasks: Multi-modal Knowledge Graph Completion (MKGC) and Multi-modal Entity Alignment (MMEA).

Knowledge Graph Completion Misconceptions +3

Paper
Code

Exploring the Transferability of Visual Prompting for Multimodal Large Language Models

1 code implementation • 17 Apr 2024 • Yichi Zhang, Yinpeng Dong, Siyuan Zhang, Tianzan Min, Hang Su, Jun Zhu

To achieve this, we propose Transferable Visual Prompting (TVP), a simple and effective approach to generate visual prompts that can transfer to different models and improve their performance on downstream tasks after trained on only one model.

Hallucination Multimodal Reasoning +2

Paper
Code

Modality-Aware Negative Sampling for Multi-modal Knowledge Graph Embedding

1 code implementation • 23 Apr 2023 • Yichi Zhang, Mingyang Chen, Wen Zhang

Negative sampling (NS) is widely used in knowledge graph embedding (KGE), which aims to generate negative triples to make a positive-negative contrast during training.

Knowledge Graph Embedding Multi-modal Knowledge Graph

Paper
Code

Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?

1 code implementation • 31 Oct 2023 • Yichi Zhang, Jiayi Pan, Yuchen Zhou, Rui Pan, Joyce Chai

Vision-Language Models (VLMs) are trained on vast amounts of data captured by humans emulating our understanding of the world.

Paper
Code

Paraphrase Augmented Task-Oriented Dialog Generation

1 code implementation • ACL 2020 • Silin Gao, Yichi Zhang, Zhijian Ou, Zhou Yu

Neural generative models have achieved promising performance on dialog generation tasks if given a huge data set.

Data Augmentation Response Generation

Paper
Code

Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Making

1 code implementation • ACL 2021 • Zijun Yao, Chengjiang Li, Tiansi Dong, Xin Lv, Jifan Yu, Lei Hou, Juanzi Li, Yichi Zhang, Zelin Dai

Using a set of comparison features and a limited amount of annotated data, KAT Induction learns an efficient decision tree that can be interpreted by generating entity matching rules whose structure is advocated by domain experts.

Attribute Decision Making +2

Paper
Code

D-Net: Dynamic Large Kernel with Dynamic Feature Fusion for Volumetric Medical Image Segmentation

1 code implementation • 15 Mar 2024 • Jin Yang, Peijie Qiu, Yichi Zhang, Daniel S. Marcus, Aristeidis Sotiras

D-Net is able to effectively utilize a multi-scale large receptive field and adaptively harness global contextual information.

Brain Tumor Segmentation Image Segmentation +4

Paper
Code

Face Alignment Assisted by Head Pose Estimation

1 code implementation • 11 Jul 2015 • Heng Yang, Wenxuan Mou, Yichi Zhang, Ioannis Patras, Hatice Gunes, Peter Robinson

In this paper we propose a supervised initialization scheme for cascaded face alignment based on explicit head pose estimation.

Face Alignment Head Pose Estimation

Paper
Code

MACO: A Modality Adversarial and Contrastive Framework for Modality-missing Multi-modal Knowledge Graph Completion

1 code implementation • 13 Aug 2023 • Yichi Zhang, Zhuo Chen, Wen Zhang

Nevertheless, existing methods emphasize the design of elegant KGC models to facilitate modality interaction, neglecting the real-life problem of missing modalities in KGs.

Multi-modal Knowledge Graph

Paper
Code

Can Foundation Models Watch, Talk and Guide You Step by Step to Make a Cake?

1 code implementation • 1 Nov 2023 • Yuwei Bao, Keunwoo Peter Yu, Yichi Zhang, Shane Storks, Itamar Bar-Yossef, Alexander De La Iglesia, Megan Su, Xiao Lin Zheng, Joyce Chai

Despite tremendous advances in AI, it remains a significant challenge to develop interactive task guidance systems that can offer situated, personalized guidance and assist humans in various tasks.

Decision Making

Paper
Code

U-Net-and-a-half: Convolutional network for biomedical image segmentation using multiple expert-driven annotations

1 code implementation • 10 Aug 2021 • Yichi Zhang, Jesper Kers, Clarissa A. Cassol, Joris J. Roelofs, Najia Idrees, Alik Farber, Samir Haroon, Kevin P. Daly, Suvranu Ganguli, Vipul C. Chitalia, Vijaya B. Kolachalama

If more than a single expert is involved in the annotation of the same images, then the inter-expert agreement is not necessarily perfect, and no single expert annotation can precisely capture the so-called ground truth of the regions of interest on all images.

Image Segmentation Semantic Segmentation +1

Paper
Code

A Latent Variable Approach to Gaussian Process Modeling with Qualitative and Quantitative Factors

1 code implementation • 19 Jun 2018 • Yichi Zhang, Siyu Tao, Wei Chen, Daniel W. Apley

Computer simulations often involve both qualitative and numerical inputs.

Dimensionality Reduction

Paper
Code

Learning Sparse Structured Ensembles with SG-MCMC and Network Pruning

no code implementations • ICLR 2018 • Yichi Zhang, Zhijian Ou

An ensemble of neural networks is known to be more robust and accurate than an individual network, however usually with linearly-increased cost in both training and testing.

Language Modelling Network Pruning

Paper
Add Code

Elastic CRFs for Open-ontology Slot Filling

no code implementations • 4 Nov 2018 • Yinpei Dai, Yichi Zhang, Zhijian Ou, Yanmeng Wang, Junlan Feng

Second, the one-hot encoding of slot labels ignores the semantic meanings and relations for slots, which are implicit in their natural language descriptions.

slot-filling Slot Filling

Paper
Add Code

Data-Centric Mixed-Variable Bayesian Optimization For Materials Design

no code implementations • 4 Jul 2019 • Akshay Iyer, Yichi Zhang, Aditya Prasad, Siyu Tao, Yixing Wang, Linda Schadler, L Catherine Brinson, Wei Chen

To this end, we present a data-centric, mixed-variable Bayesian Optimization framework that integrates data from literature, experiments, and simulations for knowledge discovery and computational materials design.

Bayesian Optimization Navigate

Paper
Add Code

Bayesian Optimization for Materials Design with Mixed Quantitative and Qualitative Variables

no code implementations • 3 Oct 2019 • Yichi Zhang, Daniel Apley, Wei Chen

We present in this paper the integration of a novel latent-variable (LV) approach for mixed-variable GP modeling with the BO framework for materials design.

Bayesian Optimization

Paper
Add Code

Covariance Estimation for Matrix-valued Data

no code implementations • 11 Apr 2020 • Yichi Zhang, Weining Shen, Dehan Kong

Covariance estimation for matrix-valued data has received an increasing interest in applications.

Paper
Add Code

Bridging 2D and 3D Segmentation Networks for Computation Efficient Volumetric Medical Image Segmentation: An Empirical Study of 2.5D Solutions

no code implementations • 13 Oct 2020 • Yichi Zhang, Qingcheng Liao, Le Ding, Jicong Zhang

Despite these works lead to improvements on a variety of segmentation tasks, to the best of our knowledge, there has not previously been a large-scale empirical comparison of these methods.

Image Segmentation Segmentation +2

Paper
Add Code

Distributed Representations of Entities in Open-World Knowledge Graphs

no code implementations • 16 Oct 2020 • Lingbing Guo, Zhuo Chen, Jiaoyan Chen, Yichi Zhang, Zequn Sun, Zhongpo Bo, Yin Fang, Xiaoze Liu, Huajun Chen, Wen Zhang

DAN leverages neighbor context as the query vector to score the neighbors of an entity, thereby distributing the entity semantics only among its neighbor embeddings.

Entity Alignment Graph Representation Learning +2

Paper
Add Code

Cascaded Convolutional Neural Network for Automatic Myocardial Infarction Segmentation from Delayed-Enhancement Cardiac MRI

no code implementations • 28 Dec 2020 • Yichi Zhang

Automatic segmentation of myocardial contours and relevant areas like infraction and no-reflow is an important step for the quantitative evaluation of myocardial infarction.

Segmentation

Paper
Add Code

Exploiting Shared Knowledge from Non-COVID Lesions for Annotation-Efficient COVID-19 CT Lung Infection Segmentation

no code implementations • 31 Dec 2020 • Yichi Zhang, Qingcheng Liao, Lin Yuan, He Zhu, Jiezhen Xing, Jicong Zhang

In this paper, we propose a novel relation-driven collaborative learning model to exploit shared knowledge from non-COVID lesions for annotation-efficient COVID-19 CT lung infection segmentation.

Computed Tomography (CT) Lesion Segmentation +1

Paper
Add Code

Exact Recovery of Community Structures Using DeepWalk and Node2vec

no code implementations • 18 Jan 2021 • Yichi Zhang, Minh Tang

Random-walk based network embedding algorithms like DeepWalk and node2vec are widely used to obtain Euclidean representation of the nodes in a network prior to performing downstream inference tasks.

Clustering Community Detection +1

Paper
Add Code

Improving Conversational Recommendation System by Pretraining on Billions Scale of Knowledge Graph

no code implementations • 30 Apr 2021 • Chi-Man Wong, Fan Feng, Wen Zhang, Chi-Man Vong, Hui Chen, Yichi Zhang, Peng He, Huan Chen, Kun Zhao, Huajun Chen

We first construct a billion-scale conversation knowledge graph (CKG) from information about users, items and conversations, and then pretrain CKG by introducing knowledge graph embedding method and graph convolution network to encode semantic and structural information respectively. To make the CTR prediction model sensible of current state of users and the relationship between dialogues and items, we introduce user-state and dialogue-interaction representations based on pre-trained CKG and propose K-DCN. In K-DCN, we fuse the user-state representation, dialogue-interaction representation and other normal feature representations via deep cross network, which will give the rank of candidate items to be recommended. We experimentally prove that our proposal significantly outperforms baselines and show it's real application in Alime.

Click-Through Rate Prediction Knowledge Graph Embedding +1

Paper
Add Code

Deep Learning methods for automatic evaluation of delayed enhancement-MRI. The results of the EMIDEC challenge

no code implementations • 9 Aug 2021 • Alain Lalande, Zhihao Chen, Thibaut Pommier, Thomas Decourselle, Abdul Qayyum, Michel Salomon, Dominique Ginhac, Youssef Skandarani, Arnaud Boucher, Khawla Brahim, Marleen de Bruijne, Robin Camarasa, Teresa M. Correia, Xue Feng, Kibrom B. Girum, Anja Hennemuth, Markus Huellebrand, Raabid Hussain, Matthias Ivantsits, Jun Ma, Craig Meyer, Rishabh Sharma, Jixi Shi, Nikolaos V. Tsekos, Marta Varela, Xiyue Wang, Sen yang, Hannu Zhang, Yichi Zhang, Yuncheng Zhou, Xiahai Zhuang, Raphael Couturier, Fabrice Meriaudeau

The publicly available database consists of 150 exams divided into 50 cases with normal MRI after injection of a contrast agent and 100 cases with myocardial infarction (and then with a hyperenhanced area on DE-MRI), whatever their inclusion in the cardiac emergency department.

Paper
Add Code

BulletTrain: Accelerating Robust Neural Network Training via Boundary Example Mining

no code implementations • NeurIPS 2021 • Weizhe Hua, Yichi Zhang, Chuan Guo, Zhiru Zhang, G. Edward Suh

Neural network robustness has become a central topic in machine learning in recent years.

Paper
Add Code

Adversarial Semantic Contour for Object Detection

no code implementations • ICML Workshop AML 2021 • Yichi Zhang, Zijian Zhu, Xiao Yang, Jun Zhu

To address this issue, we propose a novel method of Adversarial Semantic Contour (ASC) guided by object contour as prior.

Object object-detection +1

Paper
Add Code

SAU-Net: Efficient 3D Spine MRI Segmentation Using Inter-Slice Attention

no code implementations • MIDL 2019 • Yichi Zhang, Lin Yuan, Yujia Wang, Jicong Zhang

Accurate segmentation of spine Magnetic Resonance Imaging (MRI) is highly demanded in morphological research, quantitative analysis, and diseases identification, such as spinal canal stenosis, disc herniation and degeneration.

MRI segmentation Segmentation

Paper
Add Code

Uncertainty-Guided Mutual Consistency Learning for Semi-Supervised Medical Image Segmentation

no code implementations • 5 Dec 2021 • Yichi Zhang, Rushi Jiao, Qingcheng Liao, Dongyang Li, Jicong Zhang

In this paper, we propose a novel uncertainty-guided mutual consistency learning framework to effectively exploit unlabeled data by integrating intra-task consistency learning from up-to-date predictions for self-ensembling and cross-task consistency learning from task-level regularization to exploit geometric shape information.

Brain Tumor Segmentation Image Segmentation +4

Paper
Add Code

Perturbation Analysis of Randomized SVD and its Applications to High-dimensional Statistics

no code implementations • 19 Mar 2022 • Yichi Zhang, Minh Tang

We first derive upper bounds for the $\ell_2$ (spectral norm) and $\ell_{2\to\infty}$ (maximum row-wise $\ell_2$ norm) distances between the approximate singular vectors of $\hat{\mathbf{M}}$ and the true singular vectors of the signal matrix $\mathbf{M}$.

Community Detection Matrix Completion

Paper
Add Code

Knowledge Graph Completion with Pre-trained Multimodal Transformer and Twins Negative Sampling

no code implementations • 15 Sep 2022 • Yichi Zhang, Wen Zhang

Twins negative sampling is suitable for multimodal scenarios and could align different embeddings for entities.

Link Prediction World Knowledge

Paper
Add Code

To Make Yourself Invisible with Adversarial Semantic Contours

no code implementations • 1 Mar 2023 • Yichi Zhang, Zijian Zhu, Hang Su, Jun Zhu, Shibao Zheng, Yuan He, Hui Xue

In this paper, we propose Adversarial Semantic Contour (ASC), an MAP estimate of a Bayesian formulation of sparse attack with a deceived prior of object contour.

Autonomous Driving Object +2

Paper
Add Code

DA-VEGAN: Differentiably Augmenting VAE-GAN for microstructure reconstruction from extremely small data sets

no code implementations • 17 Feb 2023 • Yichi Zhang, Paul Seibert, Alexandra Otto, Alexander Raßloff, Marreddy Ambati, Markus Kästner

Microstructure reconstruction is an important and emerging field of research and an essential foundation to improving inverse computational materials engineering (ICME).

Data Augmentation

Paper
Add Code

Mastering Asymmetrical Multiplayer Game with Multi-Agent Asymmetric-Evolution Reinforcement Learning

no code implementations • 20 Apr 2023 • Chenglu Sun, Yichi Zhang, Yu Zhang, Ziling Lu, Jingbin Liu, Sijia Xu, Weidong Zhang

We propose asymmetric-evolution training (AET), a novel multi-agent reinforcement learning framework that can train multiple kinds of agents simultaneously in AMP game.

Multi-agent Reinforcement Learning reinforcement-learning

Paper
Add Code

Learning to Detect Novel and Fine-Grained Acoustic Sequences Using Pretrained Audio Representations

no code implementations • 3 May 2023 • Vasudha Kowtha, Miquel Espi Marques, Jonathan Huang, Yichi Zhang, Carlos Avendano

This work investigates pretrained audio representations for few shot Sound Event Detection.

Event Detection Few-Shot Learning +1

Paper
Add Code

Towards Segment Anything Model (SAM) for Medical Image Segmentation: A Survey

no code implementations • 5 May 2023 • Yichi Zhang, Rushi Jiao

Due to the flexibility of prompting, foundation models have become the dominant force in the domains of natural language processing and image generation.

Benchmarking Image Generation +4

Paper
Add Code

Robust Detection of Lead-Lag Relationships in Lagged Multi-Factor Models

no code implementations • 11 May 2023 • Yichi Zhang, Mihai Cucuringu, Alexander Y. Shestopaloff, Stefan Zohren

In multivariate time series systems, key insights can be obtained by discovering lead-lag relationships inherent in the data, which refer to the dependence between two time series shifted in time relative to one another, and which can be leveraged for the purposes of control, forecasting or clustering.

Clustering Time Series

Paper
Add Code

Dynamic Time Warping for Lead-Lag Relationships in Lagged Multi-Factor Models

no code implementations • 15 Sep 2023 • Yichi Zhang, Mihai Cucuringu, Alexander Y. Shestopaloff, Stefan Zohren

In multivariate time series systems, lead-lag relationships reveal dependencies between time series when they are shifted in time relative to each other.

Dynamic Time Warping Time Series

Paper
Add Code

Competitive Ensembling Teacher-Student Framework for Semi-Supervised Left Atrium MRI Segmentation

no code implementations • 21 Oct 2023 • Yuyan Shi, Yichi Zhang, Shasha Wang

Semi-supervised learning has greatly advanced medical image segmentation since it effectively alleviates the need of acquiring abundant annotations from experts and utilizes unlabeled data which is much easier to acquire.

Image Segmentation Left Atrium Segmentation +4

Paper
Add Code

Enhancing the Reliability of Segment Anything Model for Auto-Prompting Medical Image Segmentation with Uncertainty Rectification

no code implementations • 17 Nov 2023 • Yichi Zhang, Shiyao Hu, Sijie Ren, Chen Jiang, Yuan Cheng, Yuan Qi

The Segment Anything Model (SAM) has recently emerged as a groundbreaking foundation model for prompt-driven image segmentation tasks.

Image Segmentation Medical Image Segmentation +2

Paper
Add Code

SemiSAM: Exploring SAM for Enhancing Semi-Supervised Medical Image Segmentation with Extremely Limited Annotations

no code implementations • 11 Dec 2023 • Yichi Zhang, Yuan Cheng, Yuan Qi

Semi-supervised learning has attracted much attention due to its less dependence on acquiring abundant annotations from experts compared to fully supervised methods, which is especially important for medical image segmentation which typically requires intensive pixel/voxel-wise labeling by domain experts.

Image Segmentation Segmentation +2

Paper
Add Code

Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding

no code implementations • 21 Jan 2024 • Yichi Zhang, Zhihao Duan, Ming Lu, Dandan Ding, Fengqing Zhu, Zhan Ma

While convolution and self-attention are extensively used in learned image compression (LIC) for transform coding, this paper proposes an alternative called Contextual Clustering based LIC (CLIC) which primarily relies on clustering operations and local attention for correlation characterization and compact representation of an image.

Clustering Image Compression +3

Paper
Add Code

Trainable Fixed-Point Quantization for Deep Learning Acceleration on FPGAs

no code implementations • 31 Jan 2024 • Dingyi Dai, Yichi Zhang, Jiahao Zhang, Zhanqiu Hu, Yaohui Cai, Qi Sun, Zhiru Zhang

Quantization is a crucial technique for deploying deep learning models on resource-constrained devices, such as embedded FPGAs.

Quantization

Paper
Add Code

Prioritizing Safeguarding Over Autonomy: Risks of LLM Agents for Science

no code implementations • 6 Feb 2024 • Xiangru Tang, Qiao Jin, Kunlun Zhu, Tongxin Yuan, Yichi Zhang, Wangchunshu Zhou, Meng Qu, Yilun Zhao, Jian Tang, Zhuosheng Zhang, Arman Cohan, Zhiyong Lu, Mark Gerstein

Intelligent agents powered by large language models (LLMs) have demonstrated substantial promise in autonomously conducting experiments and facilitating scientific discoveries across various disciplines.

Paper
Add Code

UMOEA/D: A Multiobjective Evolutionary Algorithm for Uniform Pareto Objectives based on Decomposition

no code implementations • 14 Feb 2024 • Xiaoyuan Zhang, Xi Lin, Yichi Zhang, Yifan Chen, Qingfu Zhang

Multiobjective optimization (MOO) is prevalent in numerous applications, in which a Pareto front (PF) is constructed to display optima under various preferences.

Multiobjective Optimization

Paper
Add Code

Spot Check Equivalence: an Interpretable Metric for Information Elicitation Mechanisms

no code implementations • 21 Feb 2024 • Shengwei Xu, Yichi Zhang, Paul Resnick, Grant Schoenebeck

However, different metrics lead to divergent and even contradictory results in various contexts.

Paper
Add Code

GROUNDHOG: Grounding Large Language Models to Holistic Segmentation

no code implementations • 26 Feb 2024 • Yichi Zhang, Ziqiao Ma, Xiaofeng Gao, Suhaila Shakiah, Qiaozi Gao, Joyce Chai

Most multimodal large language models (MLLMs) learn language-to-object grounding through causal language modeling where grounded objects are captured by bounding boxes as sequences of location tokens.

Ranked #1 on Generalized Referring Expression Segmentation on gRefCOCO (using extra training data)

Causal Language Modeling Generalized Referring Expression Segmentation +2

Paper
Add Code

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

no code implementations • 21 Mar 2024 • Renrui Zhang, Dongzhi Jiang, Yichi Zhang, Haokun Lin, Ziyu Guo, Pengshuo Qiu, Aojun Zhou, Pan Lu, Kai-Wei Chang, Peng Gao, Hongsheng Li

To this end, we introduce MathVerse, an all-around visual math benchmark designed for an equitable and in-depth evaluation of MLLMs.

Math Mathematical Reasoning

Paper
Add Code

Theoretical Bound-Guided Hierarchical VAE for Neural Image Codecs

1 code implementation • 27 Mar 2024 • Yichi Zhang, Zhihao Duan, Yuning Huang, Fengqing Zhu

Recent studies reveal a significant theoretical link between variational autoencoders (VAEs) and rate-distortion theory, notably in utilizing VAEs to estimate the theoretical upper bound of the information rate-distortion function of images.

Paper
Code

Surveying Attitudinal Alignment Between Large Language Models Vs. Humans Towards 17 Sustainable Development Goals

no code implementations • 22 Apr 2024 • Qingyang Wu, Ying Xu, Tingsong Xiao, Yunze Xiao, Yitong Li, Tianyang Wang, Yichi Zhang, Shanghai Zhong, Yuwei Zhang, Wei Lu, Yifan Yang

This study conducts a comprehensive review and analysis of the existing literature on the attitudes of LLMs towards the 17 SDGs, emphasizing the comparison between their attitudes and support for each goal and those of humans.

Paper
Add Code

Beyond Pixel-Wise Supervision for Medical Image Segmentation: From Traditional Models to Foundation Models

no code implementations • 20 Apr 2024 • Yuyan Shi, Jialu Ma, Jin Yang, Shasha Wang, Yichi Zhang

Medical image segmentation plays an important role in many image-guided clinical approaches.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.