Search Results for author: Tong Zheng

Found 18 papers, 7 papers with code

Learning to Reason via Mixture-of-Thought for Logical Reasoning

1 code implementation21 May 2025 Tong Zheng, Lichang Chen, Simeng Han, R. Thomas McCoy, Heng Huang

To fill in this gap, we propose Mixture-of-Thought (MoT), a framework that enables LLMs to reason across three complementary modalities: natural language, code, and a newly introduced symbolic modality, truth-table, which systematically enumerates logical cases and partially mitigates key failure modes in natural language reasoning.

Logical Reasoning Natural Language Inference

Towards Optimal Multi-draft Speculative Decoding

no code implementations26 Feb 2025 Zhengmian Hu, Tong Zheng, Vignesh Viswanathan, Ziyi Chen, Ryan A. Rossi, Yihan Wu, Dinesh Manocha, Heng Huang

For a fixed draft sampling method, the optimal acceptance rate is a solution to an optimal transport problem, but the complexity of this problem makes it difficult to solve for the optimal acceptance rate and measure the gap between existing verification algorithms and the theoretical upper bound.

Asymmetric Conflict and Synergy in Post-training for LLM-based Multilingual Machine Translation

no code implementations16 Feb 2025 Tong Zheng, Yan Wen, Huiwen Bao, Junfeng Guo, Heng Huang

The emergence of Large Language Models (LLMs) has advanced the multilingual machine translation (MMT), yet the Curse of Multilinguality (CoM) remains a major challenge.

Machine Translation

Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning

no code implementations5 Nov 2024 Bei Li, Tong Zheng, Rui Wang, Jiahao Liu, Qingyan Guo, Junliang Guo, Xu Tan, Tong Xiao, Jingbo Zhu, Jingang Wang, Xunliang Cai

First, we introduce a predictor-corrector learning framework to minimize truncation errors, which consists of a high-order predictor and a multistep corrector.

Abstractive Text Summarization Language Modeling +4

Exploiting Memory-aware Q-distribution Prediction for Nuclear Fusion via Modern Hopfield Network

no code implementations11 Oct 2024 Qingchuan Ma, Shiao Wang, Tong Zheng, Xiaodong Dai, Yifeng Wang, Qingquan Yang, Xiao Wang

This study addresses the critical challenge of predicting the Q-distribution in long-term stable nuclear fusion task, a key component for advancing clean energy solutions.

Prediction

TNF: Tri-branch Neural Fusion for Multimodal Medical Data Classification

no code implementations4 Mar 2024 Tong Zheng, Shusaku Sone, Yoshitaka Ushiku, Yuki Oba, Jiaxin Ma

This paper presents a Tri-branch Neural Fusion (TNF) approach designed for classifying multimodal medical images and tabular data.

Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering Pairs

1 code implementation26 Oct 2023 Yuxin Zuo, Bei Li, Chuanhao Lv, Tong Zheng, Tong Xiao, Jingbo Zhu

This paper presents an in-depth study of multimodal machine translation (MMT), examining the prevailing understanding that MMT systems exhibit decreased sensitivity to visual information when text inputs are complete.

Attribute Multimodal Machine Translation +2

PartialFormer: Modeling Part Instead of Whole for Machine Translation

1 code implementation23 Oct 2023 Tong Zheng, Bei Li, Huiwen Bao, Jiale Wang, Weiqiao Shan, Tong Xiao, Jingbo Zhu

In this work, we emphasize the importance of hidden dimensions in designing lightweight FFNs, a factor often overlooked in previous architectures.

Abstractive Text Summarization Machine Translation +1

EIT: Enhanced Interactive Transformer

2 code implementations20 Dec 2022 Tong Zheng, Bei Li, Huiwen Bao, Tong Xiao, Jingbo Zhu

Two principles: the complementary principle and the consensus principle are widely acknowledged in the literature of multi-view learning.

Abstractive Text Summarization Language Modeling +4

Learning Multiscale Transformer Models for Sequence Generation

1 code implementation19 Jun 2022 Bei Li, Tong Zheng, Yi Jing, Chengbo Jiao, Tong Xiao, Jingbo Zhu

In this work, we define those scales in different linguistic units, including sub-words, words and phrases.

Multi-modality super-resolution loss for GAN-based super-resolution of clinical CT images using micro CT image database

no code implementations30 Dec 2019 Tong Zheng, Hirohisa ODA, Takayasu MORIYA, Shota NAKAMURA, Masahiro Oda, Masaki MORI, Horitsugu Takabatake, Hiroshi NATORI, Kensaku MORI

This paper newly introduces multi-modality loss function for GAN-based super-resolution that can maintain image structure and intensity on unpaired training dataset of clinical CT and micro CT volumes.

Computed Tomography (CT) Super-Resolution +1

DeepIlluminance: Contextual Illuminance Estimation via Deep Neural Networks

1 code implementation12 May 2019 Jun Zhang, Tong Zheng, Shengping Zhang, Meng Wang

First, the contextual net with a center-surround architecture extracts local contextual features from image patches, and generates initial illuminant estimates and the corresponding color corrected patches.

Color Constancy

Cannot find the paper you are looking for? You can Submit a new open access paper.