Search Results for author: Yichi Zhang

Found 119 papers, 70 papers with code

HonkaiChat: Companions from Anime that feel alive!

no code implementations5 Jan 2025 Yueze Liu, Yichi Zhang, Shaan Om Patel, Zhaoyang Zhu, Shilong Guo

Modern conversational agents, including anime-themed chatbots, are frequently reactive and personality-driven but fail to capture the dynamic nature of human interactions.

Chatbot

6DMA-Aided Hybrid Beamforming with Joint Antenna Position and Orientation Optimization

no code implementations22 Dec 2024 Yichi Zhang, Yuchen Zhang, Lipeng Zhu, Sa Xiao, Wanbin Tang, Yonina C. Eldar, Rui Zhang

This paper studies a sub-connected six-dimensional movable antenna (6DMA)-aided multi-user communication system.

Position

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

1 code implementation16 Dec 2024 Liang Chen, Zekun Wang, Shuhuai Ren, Lei LI, Haozhe Zhao, Yunshui Li, Zefan Cai, Hongcheng Guo, Lei Zhang, Yizhe Xiong, Yichi Zhang, Ruoyu Wu, Qingxiu Dong, Ge Zhang, Jian Yang, Lingwei Meng, Shujie Hu, Yulong Chen, Junyang Lin, Shuai Bai, Andreas Vlachos, Xu Tan, Minjia Zhang, Wen Xiao, Aaron Yee, Tianyu Liu, Baobao Chang

As Large Language Models (LLMs) have advanced to unify understanding and generation tasks within the textual modality, recent research has shown that tasks from different modalities can also be effectively encapsulated within the NTP framework, transforming the multimodal information into tokens and predict the next one given the context.

Language Modeling Language Modelling +1

PyOD 2: A Python Library for Outlier Detection with LLM-powered Model Selection

1 code implementation11 Dec 2024 Sihan Chen, Zhuangzhuang Qian, Wingchun Siu, Xingcan Hu, Jiaqi Li, Shawn Li, Yuehan Qin, Tiankai Yang, Zhuo Xiao, Wanghao Ye, Yichi Zhang, Yushun Dong, Yue Zhao

Outlier detection (OD), also known as anomaly detection, is a critical machine learning (ML) task with applications in fraud detection, network intrusion detection, clickstream analysis, recommendation systems, and social network moderation.

Anomaly Detection Fraud Detection +5

Scaling Laws for Black box Adversarial Attacks

no code implementations25 Nov 2024 Chuan Liu, Huanran Chen, Yichi Zhang, Yinpeng Dong, Jun Zhu

By analyzing the relationship between the number of surrogate models and transferability of adversarial examples, we conclude with clear scaling laws, emphasizing the potential of using more surrogate models to enhance adversarial transferability.

MKGL: Mastery of a Three-Word Language

no code implementations10 Oct 2024 Lingbing Guo, Zhongpu Bo, Zhuo Chen, Yichi Zhang, Jiaoyan Chen, Yarong Lan, Mengshu Sun, Zhiqiang Zhang, Yangyifei Luo, Qian Li, Qiang Zhang, Wen Zhang, Huajun Chen

Large language models (LLMs) have significantly advanced performance across a spectrum of natural language processing (NLP) tasks.

Knowledge Graphs Sentence

Liver Cancer Knowledge Graph Construction based on dynamic entity replacement and masking strategies RoBERTa-BiLSTM-CRF model

no code implementations8 Oct 2024 Yichi Zhang, Hailing Wang, Yongbin Gao, Xiaojun Hu, Yingfang Fan, Zhijun Fang

The knowledge graph construction process consists of six steps: conceptual layer design, data preprocessing, entity identification, entity normalization, knowledge fusion, and graph visualization.

graph construction named-entity-recognition +1

MetaOOD: Automatic Selection of OOD Detection Models

no code implementations4 Oct 2024 Yuehan Qin, Yichi Zhang, Yi Nian, Xueying Ding, Yue Zhao

How can we automatically select an out-of-distribution (OOD) detection model for various underlying tasks?

Autonomous Driving Meta-Learning +1

A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation

1 code implementation2 Oct 2024 Liang Chen, Sinan Tan, Zefan Cai, Weichu Xie, Haozhe Zhao, Yichi Zhang, Junyang Lin, Jinze Bai, Tianyu Liu, Baobao Chang

This work tackles the information loss bottleneck of vector-quantization (VQ) autoregressive image generation by introducing a novel model architecture called the 2-Dimensional Autoregression (DnD) Transformer.

Image Generation Quantization

Robust Training of Neural Networks at Arbitrary Precision and Sparsity

no code implementations14 Sep 2024 Chengxi Ye, Grace Chu, Yanfeng Liu, Yichi Zhang, Lukasz Lew, Andrew Howard

The discontinuous operations inherent in quantization and sparsification introduce obstacles to backpropagation.

Denoising Quantization

Unleashing the Potential of SAM2 for Biomedical Images and Videos: A Survey

1 code implementation23 Aug 2024 Yichi Zhang, Zhenrong Shen

The unprecedented developments in segmentation foundational models have become a dominant force in the field of computer vision, introducing a multitude of previously unexplored capabilities in a wide range of natural images and videos.

Image Segmentation Segmentation +4

Prompt Your Brain: Scaffold Prompt Tuning for Efficient Adaptation of fMRI Pre-trained Model

no code implementations20 Aug 2024 Zijian Dong, Yilei Wu, Zijiao Chen, Yichi Zhang, Yueming Jin, Juan Helen Zhou

We introduce Scaffold Prompt Tuning (ScaPT), a novel prompt-based framework for adapting large-scale functional magnetic resonance imaging (fMRI) pre-trained models to downstream tasks, with high parameter efficiency and improved performance compared to fine-tuning and baselines for prompt tuning.

Timeliness-Fidelity Tradeoff in 3D Scene Representations

no code implementations23 Jul 2024 Xiangmin Xu, Zhen Meng, Yichi Zhang, Changyang She, Philip G. Zhao

We test our framework and the proposed approach with different well-known 3D scene representation methods.

Mixed Reality

MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine

1 code implementation11 Jul 2024 Renrui Zhang, Xinyu Wei, Dongzhi Jiang, Ziyu Guo, Shicheng Li, Yichi Zhang, Chengzhuo Tong, Jiaming Liu, Aojun Zhou, Bin Wei, Shanghang Zhang, Peng Gao, Chunyuan Li, Hongsheng Li

The mathematical capabilities of Multi-modal Large Language Models (MLLMs) remain under-explored with three areas to be improved: visual encoding of math diagrams, diagram-language alignment, and chain-of-thought (CoT) reasoning.

Contrastive Learning Language Modelling +4

MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation

1 code implementation29 Jun 2024 Jinsheng Huang, Liang Chen, Taian Guo, Fu Zeng, Yusheng Zhao, Bohan Wu, Ye Yuan, Haozhe Zhao, Zhihui Guo, Yichi Zhang, Jingyang Yuan, Wei Ju, Luchen Liu, Tianyu Liu, Baobao Chang, Ming Zhang

Large Multimodal Models (LMMs) exhibit impressive cross-modal understanding and reasoning abilities, often assessed through multiple-choice questions (MCQs) that include an image, a question, and several options.

Multiple-choice

MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception

no code implementations22 Jun 2024 Guanqun Wang, Xinyu Wei, Jiaming Liu, Ray Zhang, Yichi Zhang, Kevin Zhang, Maurice Chong, Shanghang Zhang

In recent years, multimodal large language models (MLLMs) have shown remarkable capabilities in tasks like visual question answering and common sense reasoning, while visual perception models have made significant strides in perception tasks, such as detection and segmentation.

Common Sense Reasoning Language Modelling +6

On Efficient Neural Network Architectures for Image Compression

1 code implementation14 Jun 2024 Yichi Zhang, Zhihao Duan, Fengqing Zhu

Recent advances in learning-based image compression typically come at the cost of high complexity.

Efficient Neural Network Image Compression

MultiTrust: A Comprehensive Benchmark Towards Trustworthy Multimodal Large Language Models

no code implementations11 Jun 2024 Yichi Zhang, Yao Huang, Yitong Sun, Chang Liu, Zhe Zhao, Zhengwei Fang, Yifan Wang, Huanran Chen, Xiao Yang, Xingxing Wei, Hang Su, Yinpeng Dong, Jun Zhu

Despite the superior capabilities of Multimodal Large Language Models (MLLMs) across diverse tasks, they still face significant trustworthiness challenges.

Benchmarking Fairness

PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling

2 code implementations4 Jun 2024 Zefan Cai, Yichi Zhang, Bofei Gao, Yuliang Liu, Tianyu Liu, Keming Lu, Wayne Xiong, Yue Dong, Baobao Chang, Junjie Hu, Wen Xiao

Our experimental evaluations, utilizing the LongBench benchmark, show that PyramidKV matches the performance of models with a full KV cache while retaining only 12% of the KV cache, thus significantly reducing memory usage.

Multiple Heads are Better than One: Mixture of Modality Knowledge Experts for Entity Representation Learning

1 code implementation27 May 2024 Yichi Zhang, Zhuo Chen, Lingbing Guo, Yajing Xu, Binbin Hu, Ziqi Liu, Wen Zhang, Huajun Chen

Learning high-quality multi-modal entity representations is an important goal of multi-modal knowledge graph (MMKG) representation learning, which can enhance reasoning tasks within the MMKGs, such as MMKG completion (MMKGC).

Multi-modal Knowledge Graph Relation +1

Eliciting Informative Text Evaluations with Large Language Models

1 code implementation23 May 2024 Yuxuan Lu, Shengwei Xu, Yichi Zhang, Yuqing Kong, Grant Schoenebeck

We highlight the results that on the ICLR dataset, our mechanisms can differentiate three quality levels -- human-written reviews, GPT-4-generated reviews, and GPT-3. 5-generated reviews in terms of expected scores.

Multiple-choice

Multi-domain Knowledge Graph Collaborative Pre-training and Prompt Tuning for Diverse Downstream Tasks

1 code implementation21 May 2024 Yichi Zhang, Binbin Hu, Zhuo Chen, Lingbing Guo, Ziqi Liu, Zhiqiang Zhang, Lei Liang, Huajun Chen, Wen Zhang

In response to the lack of open-source benchmarks, we constructed a new multi-domain KGP benchmark called KPI with two large-scale KGs and six different sub-domain tasks to evaluate our method and open-sourced it for subsequent research.

Knowledge Graphs

Surveying Attitudinal Alignment Between Large Language Models Vs. Humans Towards 17 Sustainable Development Goals

no code implementations22 Apr 2024 Qingyang Wu, Ying Xu, Tingsong Xiao, Yunze Xiao, Yitong Li, Tianyang Wang, Yichi Zhang, Shanghai Zhong, Yuwei Zhang, Wei Lu, Yifan Yang

This study conducts a comprehensive review and analysis of the existing literature on the attitudes of LLMs towards the 17 SDGs, emphasizing the comparison between their attitudes and support for each goal and those of humans.

Decision Making

Exploring the Transferability of Visual Prompting for Multimodal Large Language Models

1 code implementation CVPR 2024 Yichi Zhang, Yinpeng Dong, Siyuan Zhang, Tianzan Min, Hang Su, Jun Zhu

To achieve this, we propose Transferable Visual Prompting (TVP), a simple and effective approach to generate visual prompts that can transfer to different models and improve their performance on downstream tasks after trained on only one model.

Hallucination Multimodal Reasoning +2

Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity Representation

2 code implementations15 Apr 2024 Yichi Zhang, Zhuo Chen, Lingbing Guo, Yajing Xu, Binbin Hu, Ziqi Liu, Wen Zhang, Huajun Chen

To further augment the multi-modal representations, MyGO incorporates fine-grained contrastive learning to highlight the specificity of the entity representations.

Contrastive Learning Descriptive +3

Autonomous Evaluation and Refinement of Digital Agents

1 code implementation9 Apr 2024 Jiayi Pan, Yichi Zhang, Nicholas Tomlin, Yifei Zhou, Sergey Levine, Alane Suhr

We show that domain-general automatic evaluators can significantly improve the performance of agents for web navigation and device control.

Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs

1 code implementation1 Apr 2024 Xiaoze Liu, Feijie Wu, Tianyang Xu, Zhuo Chen, Yichi Zhang, Xiaoqian Wang, Jing Gao

In this paper, we propose GraphEval to evaluate an LLM's performance using a substantially large test dataset.

Knowledge Graphs

Theoretical Bound-Guided Hierarchical VAE for Neural Image Codecs

1 code implementation27 Mar 2024 Yichi Zhang, Zhihao Duan, Yuning Huang, Fengqing Zhu

Recent studies reveal a significant theoretical link between variational autoencoders (VAEs) and rate-distortion theory, notably in utilizing VAEs to estimate the theoretical upper bound of the information rate-distortion function of images.

Noise-powered Multi-modal Knowledge Graph Representation Framework

1 code implementation11 Mar 2024 Zhuo Chen, Yin Fang, Yichi Zhang, Lingbing Guo, Jiaoyan Chen, Jeff Z. Pan, Huajun Chen, Wen Zhang

In this work, we explore the efficacy of models in accurately embedding entities within MMKGs through two pivotal tasks: Multi-modal Knowledge Graph Completion (MKGC) and Multi-modal Entity Alignment (MMEA).

Knowledge Graph Completion Misconceptions +3

GROUNDHOG: Grounding Large Language Models to Holistic Segmentation

no code implementations CVPR 2024 Yichi Zhang, Ziqiao Ma, Xiaofeng Gao, Suhaila Shakiah, Qiaozi Gao, Joyce Chai

Most multimodal large language models (MLLMs) learn language-to-object grounding through causal language modeling where grounded objects are captured by bounding boxes as sequences of location tokens.

Causal Language Modeling Generalized Referring Expression Segmentation +4

Unleashing the Power of Imbalanced Modality Information for Multi-modal Knowledge Graph Completion

1 code implementation22 Feb 2024 Yichi Zhang, Zhuo Chen, Lei Liang, Huajun Chen, Wen Zhang

To address the mentioned problems, we propose Adaptive Multi-modal Fusion and Modality Adversarial Training (AdaMF-MAT) to unleash the power of imbalanced modality information for MMKGC.

Multi-modal Knowledge Graph

Spot Check Equivalence: an Interpretable Metric for Information Elicitation Mechanisms

no code implementations21 Feb 2024 Shengwei Xu, Yichi Zhang, Paul Resnick, Grant Schoenebeck

However, different metrics lead to divergent and even contradictory results in various contexts.

PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain

1 code implementation21 Feb 2024 Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Xiangdi Meng, Tianyu Liu, Baobao Chang

To address this, we introduce Embodied-Instruction-Evolution (EIE), an automatic framework for synthesizing instruction tuning examples in multimodal embodied environments.

Autonomous Driving Decision Making

UMOEA/D: A Multiobjective Evolutionary Algorithm for Uniform Pareto Objectives based on Decomposition

no code implementations14 Feb 2024 Xiaoyuan Zhang, Xi Lin, Yichi Zhang, Yifan Chen, Qingfu Zhang

Multiobjective optimization (MOO) is prevalent in numerous applications, in which a Pareto front (PF) is constructed to display optima under various preferences.

Diversity Multiobjective Optimization

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

6 code implementations8 Feb 2024 Zhuo Chen, Yichi Zhang, Yin Fang, Yuxia Geng, Lingbing Guo, Xiang Chen, Qian Li, Wen Zhang, Jiaoyan Chen, Yushan Zhu, Jiaqi Li, Xiaoze Liu, Jeff Z. Pan, Ningyu Zhang, Huajun Chen

In this survey, we carefully review over 300 articles, focusing on KG-aware research in two principal aspects: KG-driven Multi-Modal (KG4MM) learning, where KGs support multi-modal tasks, and Multi-Modal Knowledge Graph (MM4KG), which extends KG studies into the MMKG realm.

Entity Alignment Image Classification +5

Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation

1 code implementation7 Feb 2024 Ziyang Wang, Jian-Qing Zheng, Yichi Zhang, Ge Cui, Lei LI

Mamba-UNet adopts a pure Visual Mamba (VMamba)-based encoder-decoder structure, infused with skip connections to preserve spatial information across different scales of the network.

Cardiac Segmentation Computational Efficiency +6

Prioritizing Safeguarding Over Autonomy: Risks of LLM Agents for Science

no code implementations6 Feb 2024 Xiangru Tang, Qiao Jin, Kunlun Zhu, Tongxin Yuan, Yichi Zhang, Wangchunshu Zhou, Meng Qu, Yilun Zhao, Jian Tang, Zhuosheng Zhang, Arman Cohan, Zhiyong Lu, Mark Gerstein

Intelligent agents powered by large language models (LLMs) have demonstrated substantial promise in autonomously conducting experiments and facilitating scientific discoveries across various disciplines.

Trainable Fixed-Point Quantization for Deep Learning Acceleration on FPGAs

no code implementations31 Jan 2024 Dingyi Dai, Yichi Zhang, Jiahao Zhang, Zhanqiu Hu, Yaohui Cai, Qi Sun, Zhiru Zhang

Quantization is a crucial technique for deploying deep learning models on resource-constrained devices, such as embedded FPGAs.

Deep Learning Quantization

Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding

no code implementations21 Jan 2024 Yichi Zhang, Zhihao Duan, Ming Lu, Dandan Ding, Fengqing Zhu, Zhan Ma

While convolution and self-attention are extensively used in learned image compression (LIC) for transform coding, this paper proposes an alternative called Contextual Clustering based LIC (CLIC) which primarily relies on clustering operations and local attention for correlation characterization and compact representation of an image.

Clustering Image Compression +3

Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions

1 code implementation7 Jan 2024 Yichi Zhang, Zhenrong Shen, Rushi Jiao

Due to the inherent flexibility of prompting, foundation models have emerged as the predominant force in the fields of natural language processing and computer vision.

Benchmarking Image Segmentation +4

SemiSAM: Enhancing Semi-Supervised Medical Image Segmentation via SAM-Assisted Consistency Regularization

1 code implementation11 Dec 2023 Yichi Zhang, Jin Yang, Yuchen Liu, Yuan Cheng, Yuan Qi

Semi-supervised learning has attracted much attention due to its less dependence on acquiring abundant annotations from experts compared to fully supervised methods, which is especially important for medical image segmentation which typically requires intensive pixel/voxel-wise labeling by domain experts.

Image Segmentation Segmentation +2

Diffusion Noise Feature: Accurate and Fast Generated Image Detection

1 code implementation5 Dec 2023 Yichi Zhang, Xiaogang Xu

DNF is extracted from the estimated noise generated during the inverse diffusion process.

ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code

1 code implementation16 Nov 2023 Xiangru Tang, Yuliang Liu, Zefan Cai, Yanjun Shao, Junjie Lu, Yichi Zhang, Zexuan Deng, Helan Hu, Kaikai An, Ruijun Huang, Shuzheng Si, Sheng Chen, Haozhe Zhao, Liang Chen, Yan Wang, Tianyu Liu, Zhiwei Jiang, Baobao Chang, Yin Fang, Yujia Qin, Wangchunshu Zhou, Yilun Zhao, Arman Cohan, Mark Gerstein

Despite Large Language Models (LLMs) like GPT-4 achieving impressive results in function-level code generation, they struggle with repository-scale code understanding (e. g., coming up with the right arguments for calling routines), requiring a deeper comprehension of complex file interactions.

Code Generation Navigate +1

Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering

1 code implementation11 Nov 2023 Yichi Zhang, Zhuo Chen, Yin Fang, Yanxi Lu, Fangming Li, Wen Zhang, Huajun Chen

Deploying large language models (LLMs) to real scenarios for domain-specific question answering (QA) is a key thrust for LLM applications, which poses numerous challenges, especially in ensuring that responses are both accommodating to user requirements and appropriately leveraging domain-specific knowledge bases.

Knowledge Graphs Question Answering

Can Foundation Models Watch, Talk and Guide You Step by Step to Make a Cake?

1 code implementation1 Nov 2023 Yuwei Bao, Keunwoo Peter Yu, Yichi Zhang, Shane Storks, Itamar Bar-Yossef, Alexander De La Iglesia, Megan Su, Xiao Lin Zheng, Joyce Chai

Despite tremendous advances in AI, it remains a significant challenge to develop interactive task guidance systems that can offer situated, personalized guidance and assist humans in various tasks.

Decision Making

Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?

1 code implementation31 Oct 2023 Yichi Zhang, Jiayi Pan, Yuchen Zhou, Rui Pan, Joyce Chai

Vision-Language Models (VLMs) are trained on vast amounts of data captured by humans emulating our understanding of the world.

Competitive Ensembling Teacher-Student Framework for Semi-Supervised Left Atrium MRI Segmentation

no code implementations21 Oct 2023 Yuyan Shi, Yichi Zhang, Shasha Wang

Semi-supervised learning has greatly advanced medical image segmentation since it effectively alleviates the need of acquiring abundant annotations from experts and utilizes unlabeled data which is much easier to acquire.

Image Segmentation Left Atrium Segmentation +4

Making Large Language Models Perform Better in Knowledge Graph Completion

1 code implementation10 Oct 2023 Yichi Zhang, Zhuo Chen, Lingbing Guo, Yajing Xu, Wen Zhang, Huajun Chen

In this paper, we explore methods to incorporate structural information into the LLMs, with the overarching goal of facilitating structure-aware reasoning.

In-Context Learning Language Modeling +2

How Robust is Google's Bard to Adversarial Image Attacks?

1 code implementation21 Sep 2023 Yinpeng Dong, Huanran Chen, Jiawei Chen, Zhengwei Fang, Xiao Yang, Yichi Zhang, Yu Tian, Hang Su, Jun Zhu

By attacking white-box surrogate vision encoders or MLLMs, the generated adversarial examples can mislead Bard to output wrong image descriptions with a 22% success rate based solely on the transferability.

Adversarial Robustness Chatbot +1

Dynamic Time Warping for Lead-Lag Relationships in Lagged Multi-Factor Models

no code implementations15 Sep 2023 Yichi Zhang, Mihai Cucuringu, Alexander Y. Shestopaloff, Stefan Zohren

In multivariate time series systems, lead-lag relationships reveal dependencies between time series when they are shifted in time relative to each other.

Dynamic Time Warping Time Series

MACO: A Modality Adversarial and Contrastive Framework for Modality-missing Multi-modal Knowledge Graph Completion

1 code implementation13 Aug 2023 Yichi Zhang, Zhuo Chen, Wen Zhang

Nevertheless, existing methods emphasize the design of elegant KGC models to facilitate modality interaction, neglecting the real-life problem of missing modalities in KGs.

Multi-modal Knowledge Graph

Rethinking Uncertainly Missing and Ambiguous Visual Modality in Multi-Modal Entity Alignment

1 code implementation30 Jul 2023 Zhuo Chen, Lingbing Guo, Yin Fang, Yichi Zhang, Jiaoyan Chen, Jeff Z. Pan, Yangning Li, Huajun Chen, Wen Zhang

As a crucial extension of entity alignment (EA), multi-modal entity alignment (MMEA) aims to identify identical entities across disparate knowledge graphs (KGs) by exploiting associated visual information.

 Ranked #1 on Multi-modal Entity Alignment on UMVM-oea-d-w-v2 (using extra training data)

Benchmarking Knowledge Graph Embeddings +2

CausE: Towards Causal Knowledge Graph Embedding

1 code implementation21 Jul 2023 Yichi Zhang, Wen Zhang

Knowledge graph embedding (KGE) focuses on representing the entities and relations of a knowledge graph (KG) into the continuous vector spaces, which can be employed to predict the missing triples to achieve knowledge graph completion (KGC).

Disentanglement Knowledge Graph Completion +1

PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEs

2 code implementations15 Jun 2023 Zhongkai Hao, Jiachen Yao, Chang Su, Hang Su, Ziao Wang, Fanzhi Lu, Zeyu Xia, Yichi Zhang, Songming Liu, Lu Lu, Jun Zhu

In addition to providing a standardized means of assessing performance, PINNacle also offers an in-depth analysis to guide future research, particularly in areas such as domain decomposition methods and loss reweighting for handling multi-scale problems and complex geometry.

Benchmarking

Robust Detection of Lead-Lag Relationships in Lagged Multi-Factor Models

no code implementations11 May 2023 Yichi Zhang, Mihai Cucuringu, Alexander Y. Shestopaloff, Stefan Zohren

In multivariate time series systems, key insights can be obtained by discovering lead-lag relationships inherent in the data, which refer to the dependence between two time series shifted in time relative to one another, and which can be leveraged for the purposes of control, forecasting or clustering.

Clustering Time Series

Towards Segment Anything Model (SAM) for Medical Image Segmentation: A Survey

no code implementations5 May 2023 Yichi Zhang, Rushi Jiao

Due to the flexibility of prompting, foundation models have become the dominant force in the domains of natural language processing and image generation.

Benchmarking Image Generation +5

Modality-Aware Negative Sampling for Multi-modal Knowledge Graph Embedding

1 code implementation23 Apr 2023 Yichi Zhang, Mingyang Chen, Wen Zhang

Negative sampling (NS) is widely used in knowledge graph embedding (KGE), which aims to generate negative triples to make a positive-negative contrast during training.

Knowledge Graph Embedding Multi-modal Knowledge Graph

Mastering Asymmetrical Multiplayer Game with Multi-Agent Asymmetric-Evolution Reinforcement Learning

no code implementations20 Apr 2023 Chenglu Sun, Yichi Zhang, Yu Zhang, Ziling Lu, Jingbin Liu, Sijia Xu, Weidong Zhang

We propose asymmetric-evolution training (AET), a novel multi-agent reinforcement learning framework that can train multiple kinds of agents simultaneously in AMP game.

Multi-agent Reinforcement Learning reinforcement-learning

Understanding the Robustness of 3D Object Detection with Bird's-Eye-View Representations in Autonomous Driving

1 code implementation CVPR 2023 Zijian Zhu, Yichi Zhang, Hai Chen, Yinpeng Dong, Shu Zhao, Wenbo Ding, Jiachen Zhong, Shibao Zheng

However, there still lacks a systematic understanding of the robustness of these vision-dependent BEV models, which is closely related to the safety of autonomous driving systems.

3D Object Detection Adversarial Robustness +2

To Make Yourself Invisible with Adversarial Semantic Contours

no code implementations1 Mar 2023 Yichi Zhang, Zijian Zhu, Hang Su, Jun Zhu, Shibao Zheng, Yuan He, Hui Xue

In this paper, we propose Adversarial Semantic Contour (ASC), an MAP estimate of a Bayesian formulation of sparse attack with a deceived prior of object contour.

Autonomous Driving Object +2

DA-VEGAN: Differentiably Augmenting VAE-GAN for microstructure reconstruction from extremely small data sets

no code implementations17 Feb 2023 Yichi Zhang, Paul Seibert, Alexandra Otto, Alexander Raßloff, Marreddy Ambati, Markus Kästner

Microstructure reconstruction is an important and emerging field of research and an essential foundation to improving inverse computational materials engineering (ICME).

Data Augmentation

Binarized Neural Machine Translation

1 code implementation NeurIPS 2023 Yichi Zhang, Ankush Garg, Yuan Cao, Łukasz Lew, Behrooz Ghorbani, Zhiru Zhang, Orhan Firat

In this work, we propose a novel binarization technique for Transformers applied to machine translation (BMT), the first of its kind.

Binarization Machine Translation +2

MEAformer: Multi-modal Entity Alignment Transformer for Meta Modality Hybrid

1 code implementation29 Dec 2022 Zhuo Chen, Jiaoyan Chen, Wen Zhang, Lingbing Guo, Yin Fang, Yufeng Huang, Yichi Zhang, Yuxia Geng, Jeff Z. Pan, Wenting Song, Huajun Chen

Multi-modal entity alignment (MMEA) aims to discover identical entities across different knowledge graphs (KGs) whose entities are associated with relevant images.

 Ranked #1 on Entity Alignment on FBYG15k (using extra training data)

Knowledge Graphs Multi-modal Entity Alignment

Physics-Informed Machine Learning: A Survey on Problems, Methods and Applications

1 code implementation15 Nov 2022 Zhongkai Hao, Songming Liu, Yichi Zhang, Chengyang Ying, Yao Feng, Hang Su, Jun Zhu

Recent work shows that it provides potential benefits for machine learning models by incorporating the physical prior and collected data, which makes the intersection of machine learning and physics become a prevailing paradigm.

Physics-informed machine learning

Tele-Knowledge Pre-training for Fault Analysis

1 code implementation20 Oct 2022 Zhuo Chen, Wen Zhang, Yufeng Huang, Mingyang Chen, Yuxia Geng, Hongtao Yu, Zhen Bi, Yichi Zhang, Zhen Yao, Wenting Song, Xinliang Wu, Yi Yang, Mingyi Chen, Zhaoyang Lian, YingYing Li, Lei Cheng, Huajun Chen

In this work, we share our experience on tele-knowledge pre-training for fault analysis, a crucial task in telecommunication applications that requires a wide range of knowledge normally found in both machine log data and product documents.

Language Modeling Language Modelling

Knowledge Graph Completion with Pre-trained Multimodal Transformer and Twins Negative Sampling

no code implementations15 Sep 2022 Yichi Zhang, Wen Zhang

Twins negative sampling is suitable for multimodal scenarios and could align different embeddings for entities.

Link Prediction World Knowledge

Perturbation Analysis of Randomized SVD and its Applications to High-dimensional Statistics

no code implementations19 Mar 2022 Yichi Zhang, Minh Tang

We first derive upper bounds for the $\ell_2$ (spectral norm) and $\ell_{2\to\infty}$ (maximum row-wise $\ell_2$ norm) distances between the approximate singular vectors of $\hat{\mathbf{M}}$ and the true singular vectors of the signal matrix $\mathbf{M}$.

Community Detection Matrix Completion

Understanding Hyperdimensional Computing for Parallel Single-Pass Learning

1 code implementation10 Feb 2022 Tao Yu, Yichi Zhang, Zhiru Zhang, Christopher De Sa

Using representation theory, we characterize which similarity matrices can be "expressed" by finite group VSA hypervectors, and we show how these VSAs can be constructed.

Uncertainty-Guided Mutual Consistency Learning for Semi-Supervised Medical Image Segmentation

no code implementations5 Dec 2021 Yichi Zhang, Rushi Jiao, Qingcheng Liao, Dongyang Li, Jicong Zhang

In this paper, we propose a novel uncertainty-guided mutual consistency learning framework to effectively exploit unlabeled data by integrating intra-task consistency learning from up-to-date predictions for self-ensembling and cross-task consistency learning from task-level regularization to exploit geometric shape information.

Brain Tumor Segmentation Image Segmentation +4

PokeBNN: A Binary Pursuit of Lightweight Accuracy

1 code implementation CVPR 2022 Yichi Zhang, Zhiru Zhang, Lukasz Lew

In order to enable joint optimization of the cost together with accuracy, we define arithmetic computation effort (ACE), a hardware- and energy-inspired cost metric for quantized and binarized networks.

Binarization

Bridge the Gap Between CV and NLP! A Gradient-based Textual Adversarial Attack Framework

1 code implementation28 Oct 2021 Lifan Yuan, Yichi Zhang, Yangyi Chen, Wei Wei

In this paper, we instantiate our framework with an attack algorithm named Textual Projected Gradient Descent (T-PGD).

Adversarial Attack Language Modelling

Adversarial Semantic Contour for Object Detection

no code implementations ICML Workshop AML 2021 Yichi Zhang, Zijian Zhu, Xiao Yang, Jun Zhu

To address this issue, we propose a novel method of Adversarial Semantic Contour (ASC) guided by object contour as prior.

Object object-detection +1

Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding

1 code implementation Findings (EMNLP) 2021 Shane Storks, Qiaozi Gao, Yichi Zhang, Joyce Chai

However, evaluations only based on end task performance shed little light on machines' true ability in language understanding and reasoning.

valid

U-Net-and-a-half: Convolutional network for biomedical image segmentation using multiple expert-driven annotations

1 code implementation10 Aug 2021 Yichi Zhang, Jesper Kers, Clarissa A. Cassol, Joris J. Roelofs, Najia Idrees, Alik Farber, Samir Haroon, Kevin P. Daly, Suvranu Ganguli, Vipul C. Chitalia, Vijaya B. Kolachalama

If more than a single expert is involved in the annotation of the same images, then the inter-expert agreement is not necessarily perfect, and no single expert annotation can precisely capture the so-called ground truth of the regions of interest on all images.

Image Segmentation Semantic Segmentation +1

Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-modal Pretraining

1 code implementation ICCV 2021 Xunlin Zhan, Yangxin Wu, Xiao Dong, Yunchao Wei, Minlong Lu, Yichi Zhang, Hang Xu, Xiaodan Liang

In this paper, we investigate a more realistic setting that aims to perform weakly-supervised multi-modal instance-level product retrieval among fine-grained product categories.

Retrieval

Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Making

1 code implementation ACL 2021 Zijun Yao, Chengjiang Li, Tiansi Dong, Xin Lv, Jifan Yu, Lei Hou, Juanzi Li, Yichi Zhang, Zelin Dai

Using a set of comparison features and a limited amount of annotated data, KAT Induction learns an efficient decision tree that can be interpreted by generating entity matching rules whose structure is advocated by domain experts.

Attribute Decision Making +3

Hierarchical Task Learning from Language Instructions with Unified Transformers and Self-Monitoring

1 code implementation Findings (ACL) 2021 Yichi Zhang, Joyce Chai

On the ALFRED benchmark for task learning, the published state-of-the-art system only achieves a task success rate of less than 10% in an unseen environment, compared to the human performance of over 90%.

Improving Conversational Recommendation System by Pretraining on Billions Scale of Knowledge Graph

no code implementations30 Apr 2021 Chi-Man Wong, Fan Feng, Wen Zhang, Chi-Man Vong, Hui Chen, Yichi Zhang, Peng He, Huan Chen, Kun Zhao, Huajun Chen

We first construct a billion-scale conversation knowledge graph (CKG) from information about users, items and conversations, and then pretrain CKG by introducing knowledge graph embedding method and graph convolution network to encode semantic and structural information respectively. To make the CTR prediction model sensible of current state of users and the relationship between dialogues and items, we introduce user-state and dialogue-interaction representations based on pre-trained CKG and propose K-DCN. In K-DCN, we fuse the user-state representation, dialogue-interaction representation and other normal feature representations via deep cross network, which will give the rank of candidate items to be recommended. We experimentally prove that our proposal significantly outperforms baselines and show it's real application in Alime.

Click-Through Rate Prediction Conversational Recommendation +2

Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning Interpretability

1 code implementation EMNLP 2021 Xin Lv, Yixin Cao, Lei Hou, Juanzi Li, Zhiyuan Liu, Yichi Zhang, Zelin Dai

However, we find in experiments that many paths given by these models are actually unreasonable, while little works have been done on interpretability evaluation for them.

Benchmarking Link Prediction

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

1 code implementation19 Mar 2021 Zhe Xie, Chengxuan Liu, Yichi Zhang, Hongtao Lu, Dong Wang, Yue Ding

To solve the above problem, in this work, we propose a novel method called Adversarial and Contrastive Variational Autoencoder (ACVAE) for sequential recommendation.

Collaborative Filtering Sequential Recommendation

Dual-Task Mutual Learning for Semi-Supervised Medical Image Segmentation

1 code implementation8 Mar 2021 Yichi Zhang, Jicong Zhang

The success of deep learning methods in medical image segmentation tasks usually requires a large amount of labeled data.

Image Segmentation Segmentation +2

Exact Recovery of Community Structures Using DeepWalk and Node2vec

no code implementations18 Jan 2021 Yichi Zhang, Minh Tang

Random-walk based network embedding algorithms like DeepWalk and node2vec are widely used to obtain Euclidean representation of the nodes in a network prior to performing downstream inference tasks.

Clustering Community Detection +1

Exploiting Shared Knowledge from Non-COVID Lesions for Annotation-Efficient COVID-19 CT Lung Infection Segmentation

no code implementations31 Dec 2020 Yichi Zhang, Qingcheng Liao, Lin Yuan, He Zhu, Jiezhen Xing, Jicong Zhang

In this paper, we propose a novel relation-driven collaborative learning model to exploit shared knowledge from non-COVID lesions for annotation-efficient COVID-19 CT lung infection segmentation.

Computed Tomography (CT) Lesion Segmentation +1

Cascaded Convolutional Neural Network for Automatic Myocardial Infarction Segmentation from Delayed-Enhancement Cardiac MRI

no code implementations28 Dec 2020 Yichi Zhang

Automatic segmentation of myocardial contours and relevant areas like infraction and no-reflow is an important step for the quantitative evaluation of myocardial infarction.

Segmentation

AbdomenCT-1K: Is Abdominal Organ Segmentation A Solved Problem?

1 code implementation28 Oct 2020 Jun Ma, Yao Zhang, Song Gu, Cheng Zhu, Cheng Ge, Yichi Zhang, Xingle An, Congcong Wang, Qiyuan Wang, Xin Liu, Shucheng Cao, Qi Zhang, Shangqing Liu, Yunpeng Wang, Yuhui Li, Jian He, Xiaoping Yang

With the unprecedented developments in deep learning, automatic segmentation of main abdominal organs seems to be a solved problem as state-of-the-art (SOTA) methods have achieved comparable results with inter-rater variability on many benchmark datasets.

Continual Learning Organ Segmentation +2

Distributed Representations of Entities in Open-World Knowledge Graphs

no code implementations16 Oct 2020 Lingbing Guo, Zhuo Chen, Jiaoyan Chen, Yichi Zhang, Zequn Sun, Zhongpo Bo, Yin Fang, Xiaoze Liu, Huajun Chen, Wen Zhang

DAN leverages neighbor context as the query vector to score the neighbors of an entity, thereby distributing the entity semantics only among its neighbor embeddings.

Entity Alignment Graph Neural Network +3

Bridging 2D and 3D Segmentation Networks for Computation Efficient Volumetric Medical Image Segmentation: An Empirical Study of 2.5D Solutions

no code implementations13 Oct 2020 Yichi Zhang, Qingcheng Liao, Le Ding, Jicong Zhang

Despite these works lead to improvements on a variety of segmentation tasks, to the best of our knowledge, there has not previously been a large-scale empirical comparison of these methods.

Image Segmentation Segmentation +2

Dynamic Anticipation and Completion for Multi-Hop Reasoning over Sparse Knowledge Graph

1 code implementation EMNLP 2020 Xin Lv, Xu Han, Lei Hou, Juanzi Li, Zhiyuan Liu, Wei zhang, Yichi Zhang, Hao Kong, Suhui Wu

On the one hand, sparse KGs contain less information, which makes it difficult for the model to choose correct paths.

A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning

1 code implementation EMNLP 2020 Yichi Zhang, Zhijian Ou, Huixin Wang, Junlan Feng

In this paper we aim at alleviating the reliance on belief state labels in building end-to-end dialog systems, by leveraging unlabeled dialog data towards semi-supervised learning.

End-To-End Dialogue Modelling

Paraphrase Augmented Task-Oriented Dialog Generation

1 code implementation ACL 2020 Silin Gao, Yichi Zhang, Zhijian Ou, Zhou Yu

Neural generative models have achieved promising performance on dialog generation tasks if given a huge data set.

Data Augmentation Response Generation

Covariance Estimation for Matrix-valued Data

no code implementations11 Apr 2020 Yichi Zhang, Weining Shen, Dehan Kong

Covariance estimation for matrix-valued data has received an increasing interest in applications.

Precision Gating: Improving Neural Network Efficiency with Dynamic Dual-Precision Activations

1 code implementation ICLR 2020 Yichi Zhang, Ritchie Zhao, Weizhe Hua, Nayun Xu, G. Edward Suh, Zhiru Zhang

The proposed approach is applicable to a variety of DNN architectures and significantly reduces the computational cost of DNN execution with almost no accuracy loss.

Quantization

SAU-Net: Efficient 3D Spine MRI Segmentation Using Inter-Slice Attention

no code implementations MIDL 2019 Yichi Zhang, Lin Yuan, Yujia Wang, Jicong Zhang

Accurate segmentation of spine Magnetic Resonance Imaging (MRI) is highly demanded in morphological research, quantitative analysis, and diseases identification, such as spinal canal stenosis, disc herniation and degeneration.

MRI segmentation Segmentation

Task-Oriented Dialog Systems that Consider Multiple Appropriate Responses under the Same Context

6 code implementations24 Nov 2019 Yichi Zhang, Zhijian Ou, Zhou Yu

Conversations have an intrinsic one-to-many property, which means that multiple responses can be appropriate for the same dialog context.

Data Augmentation Diversity +2

Meta-Learning with Dynamic-Memory-Based Prototypical Network for Few-Shot Event Detection

1 code implementation25 Oct 2019 Shumin Deng, Ningyu Zhang, Jiaojian Kang, Yichi Zhang, Wei zhang, Huajun Chen

Differing from vanilla prototypical networks simply computing event prototypes by averaging, which only consume event mentions once, our model is more robust and is capable of distilling contextual information from event mentions for multiple times due to the multi-hop mechanism of DMNs.

Event Detection Event Extraction +2

Bayesian Optimization for Materials Design with Mixed Quantitative and Qualitative Variables

no code implementations3 Oct 2019 Yichi Zhang, Daniel Apley, Wei Chen

We present in this paper the integration of a novel latent-variable (LV) approach for mixed-variable GP modeling with the BO framework for materials design.

Bayesian Optimization

Data-Centric Mixed-Variable Bayesian Optimization For Materials Design

no code implementations4 Jul 2019 Akshay Iyer, Yichi Zhang, Aditya Prasad, Siyu Tao, Yixing Wang, Linda Schadler, L Catherine Brinson, Wei Chen

To this end, we present a data-centric, mixed-variable Bayesian Optimization framework that integrates data from literature, experiments, and simulations for knowledge discovery and computational materials design.

Bayesian Optimization Navigate

Elastic CRFs for Open-ontology Slot Filling

no code implementations4 Nov 2018 Yinpei Dai, Yichi Zhang, Hong Liu, Zhijian Ou, Yi Huang, Junlan Feng

An ontology is defined by the collection of slots and the values that each slot can take.

slot-filling Slot Filling

Learning Sparse Structured Ensembles with SG-MCMC and Network Pruning

no code implementations ICLR 2018 Yichi Zhang, Zhijian Ou

An ensemble of neural networks is known to be more robust and accurate than an individual network, however usually with linearly-increased cost in both training and testing.

Language Modeling Language Modelling +1

Face Alignment Assisted by Head Pose Estimation

1 code implementation11 Jul 2015 Heng Yang, Wenxuan Mou, Yichi Zhang, Ioannis Patras, Hatice Gunes, Peter Robinson

In this paper we propose a supervised initialization scheme for cascaded face alignment based on explicit head pose estimation.

Face Alignment Head Pose Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.