Search Results for author: Ji Qi

Found 24 papers, 11 papers with code

An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning

1 code implementation • 23 Feb 2024 • Zui Chen, Yezeng Chen, Jiaqi Han, Zhijie Huang, Ji Qi, Yi Zhou

Large language models (LLMs) are displaying emergent abilities for math reasoning tasks, and there is a growing attention on enhancing the ability of open-source LLMs through supervised fine-tuning (SFT). In this paper, we aim to explore a general data strategy for supervised data to help optimize and expand math reasoning ability. Firstly, we determine the ability boundary of reasoning paths augmentation by identifying these paths' minimal optimal set. Secondly, we validate that different abilities of the model can be cumulatively enhanced by Mix of Minimal Optimal Sets of corresponding types of data, while our models MMOS achieve SOTA performance on series base models under much lower construction costs. Besides, we point out GSM-HARD is not really hard and today's LLMs no longer lack numerical robustness. Also, we provide an Auto Problem Generator for robustness testing and educational applications. Our code and data are publicly available at https://github. com/cyzhh/MMOS.

Ranked #2 on Math Word Problem Solving on ASDiv-A (using extra training data)

Arithmetic Reasoning Math Word Problem Solving

Paper
Code

CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations

1 code implementation • 6 Feb 2024 • Ji Qi, Ming Ding, Weihan Wang, Yushi Bai, Qingsong Lv, Wenyi Hong, Bin Xu, Lei Hou, Juanzi Li, Yuxiao Dong, Jie Tang

Drawing inspiration from human cognition in solving visual problems (e. g., marking, zoom in), this paper introduces Chain of Manipulations, a mechanism that enables VLMs to solve problems step-by-step with evidence.

Visual Reasoning

129

Paper
Code

LongAlign: A Recipe for Long Context Alignment of Large Language Models

1 code implementation • 31 Jan 2024 • Yushi Bai, Xin Lv, Jiajie Zhang, Yuze He, Ji Qi, Lei Hou, Jie Tang, Yuxiao Dong, Juanzi Li

Extending large language models to effectively handle long contexts requires instruction fine-tuning on input sequences of similar length.

Instruction Following

116

Paper
Code

CogVLM: Visual Expert for Pretrained Language Models

1 code implementation • 6 Nov 2023 • Weihan Wang, Qingsong Lv, Wenmeng Yu, Wenyi Hong, Ji Qi, Yan Wang, Junhui Ji, Zhuoyi Yang, Lei Zhao, Xixuan Song, Jiazheng Xu, Bin Xu, Juanzi Li, Yuxiao Dong, Ming Ding, Jie Tang

We introduce CogVLM, a powerful open-source visual language foundation model.

Ranked #4 on Visual Question Answering (VQA) on InfiMM-Eval

Language Modelling Visual Question Answering

5,315

Paper
Code

Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment

no code implementations • 16 Oct 2023 • Ji Qi, Kaixuan Ji, Xiaozhi Wang, Jifan Yu, Kaisheng Zeng, Lei Hou, Juanzi Li, Bin Xu

Open Information Extraction (OIE) aims to extract objective structured knowledge from natural texts, which has attracted growing attention to build dedicated models with human experience.

In-Context Learning Open Information Extraction

Paper
Add Code

VidCoM: Fast Video Comprehension through Large Language Models with Multimodal Tools

no code implementations • 16 Oct 2023 • Ji Qi, Kaixuan Ji, Jifan Yu, Duokang Wang, Bin Xu, Lei Hou, Juanzi Li

Building models that comprehends videos and responds specific user instructions is a practical and challenging topic, as it requires mastery of both vision understanding and knowledge reasoning.

Caption Generation Descriptive +3

Paper
Add Code

Tree-of-Mixed-Thought: Combining Fast and Slow Thinking for Multi-hop Visual Reasoning

no code implementations • 18 Aug 2023 • Pengbo Hu, Ji Qi, Xingyu Li, Hong Li, Xinqi Wang, Bing Quan, Ruiyu Wang, Yi Zhou

Our approach succeeds in performance while significantly saving inference steps.

Visual Reasoning

Paper
Add Code

KoLA: Carefully Benchmarking World Knowledge of Large Language Models

1 code implementation • 15 Jun 2023 • Jifan Yu, Xiaozhi Wang, Shangqing Tu, Shulin Cao, Daniel Zhang-li, Xin Lv, Hao Peng, Zijun Yao, Xiaohan Zhang, Hanming Li, Chunyang Li, Zheyuan Zhang, Yushi Bai, Yantao Liu, Amy Xin, Nianyi Lin, Kaifeng Yun, Linlu Gong, Jianhui Chen, Zhili Wu, Yunjia Qi, Weikai Li, Yong Guan, Kaisheng Zeng, Ji Qi, Hailong Jin, Jinxin Liu, Yu Gu, Yuan YAO, Ning Ding, Lei Hou, Zhiyuan Liu, Bin Xu, Jie Tang, Juanzi Li

The unprecedented performance of large language models (LLMs) necessitates improvements in evaluations.

Benchmarking Hallucination +1

Paper
Code

Preserving Knowledge Invariance: Rethinking Robustness Evaluation of Open Information Extraction

1 code implementation • 23 May 2023 • Ji Qi, Chuchun Zhang, Xiaozhi Wang, Kaisheng Zeng, Jifan Yu, Jinxin Liu, Jiuding Sun, Yuxiang Chen, Lei Hou, Juanzi Li, Bin Xu

In this paper, we present the first benchmark that simulates the evaluation of open information extraction models in the real world, where the syntactic and expressive distributions under the same knowledge meaning may drift variously.

Language Modelling Large Language Model +1

Paper
Code

GOAL: A Challenging Knowledge-grounded Video Captioning Benchmark for Real-time Soccer Commentary Generation

1 code implementation • 26 Mar 2023 • Ji Qi, Jifan Yu, Teng Tu, Kunyu Gao, Yifan Xu, Xinyu Guan, Xiaozhi Wang, Yuxiao Dong, Bin Xu, Lei Hou, Juanzi Li, Jie Tang, Weidong Guo, Hui Liu, Yu Xu

Despite the recent emergence of video captioning models, how to generate vivid, fine-grained video descriptions based on the background knowledge (i. e., long and informative commentary about the domain-specific scenes with appropriate reasoning) is still far from being solved, which however has great applications such as automatic sports narrative.

Video Captioning

Paper
Code

Syntactically Robust Training on Partially-Observed Data for Open Information Extraction

1 code implementation • 17 Jan 2023 • Ji Qi, Yuxiang Chen, Lei Hou, Juanzi Li, Bin Xu

In this paper, we propose a syntactically robust training framework that enables models to be trained on a syntactic-abundant distribution based on diverse paraphrase generation.

Open Information Extraction Paraphrase Generation +2

Paper
Code

Privacy-Preserving Federated Deep Clustering based on GAN

no code implementations • 30 Nov 2022 • Jie Yan, Jing Liu, Ji Qi, Zhong-Yuan Zhang

Federated clustering (FC) is an essential extension of centralized clustering designed for the federated setting, wherein the challenge lies in constructing a global similarity measure without the need to share private data.

Clustering Deep Clustering +4

Paper
Add Code

Self-supervised remote sensing feature learning: Learning Paradigms, Challenges, and Future Works

no code implementations • 15 Nov 2022 • Chao Tao, Ji Qi, Mingning Guo, Qing Zhu, Haifeng Li

Deep learning has achieved great success in learning features from massive remote sensing images (RSIs).

Paper
Add Code

Federated clustering with GAN-based data synthesis

1 code implementation • 29 Oct 2022 • Jie Yan, Jing Liu, Ji Qi, Zhong-Yuan Zhang

Federated clustering (FC) is an extension of centralized clustering in federated settings.

Clustering Federated Learning +1

Paper
Code

ConstGCN: Constrained Transmission-based Graph Convolutional Networks for Document-level Relation Extraction

no code implementations • 8 Oct 2022 • Ji Qi, Bin Xu, Kaisheng Zeng, Jinxin Liu, Jifan Yu, Qi Gao, Juanzi Li, Lei Hou

Document-level relation extraction with graph neural networks faces a fundamental graph construction gap between training and inference - the golden graph structure only available during training, which causes that most methods adopt heuristic or syntactic rules to construct a prior graph as a pseudo proxy.

Document-level Relation Extraction graph construction +1

Paper
Add Code

Selective clustering ensemble based on kappa and F-score

no code implementations • 23 Apr 2022 • Jie Yan, Xin Liu, Ji Qi, Tao You, Zhong-Yuan Zhang

Clustering ensemble has an impressive performance in improving the accuracy and robustness of partition results and has received much attention in recent years.

Clustering Clustering Ensemble

Paper
Add Code

DiaKG: an Annotated Diabetes Dataset for Medical Knowledge Graph Construction

1 code implementation • 31 May 2021 • Dejie Chang, Mosha Chen, Chaozhen Liu, LiPing Liu, Dongdong Li, Wei Li, Fei Kong, Bangchang Liu, Xiaobin Luo, Ji Qi, Qiao Jin, Bin Xu

In order to accelerate the research for domain-specific knowledge graphs in the medical domain, we introduce DiaKG, a high-quality Chinese dataset for Diabetes knowledge graph, which contains 22, 050 entities and 6, 890 relations in total.

graph construction Knowledge Graphs +4

Paper
Code

Ordered Semiautomatic Rings with Applications to Geometry

no code implementations • 11 Mar 2021 • ZiYuan Gao, Sanjay Jain, Ji Qi, Philipp Schlicht, Frank Stephan, Jacob Tarr

The present work looks at semiautomatic rings with automatic addition and comparisons which are dense subrings of the real numbers and asks how these can be used to represent geometric objects such that certain operations and transformations are automatic.

Formal Languages and Automata Theory Logic

Paper
Add Code

Remote Sensing Image Scene Classification with Self-Supervised Paradigm under Limited Labeled Samples

no code implementations • 2 Oct 2020 • Chao Tao, Ji Qi, Weipeng Lu, Hao Wang, Haifeng Li

With the development of deep learning, supervised learning methods perform well in remote sensing images (RSIs) scene classification.

Classification General Classification +2

Paper
Add Code

Spatial Information Inference Net: Road Extraction Using Road-Specific Contextual Information

1 code implementation • ISPRS Journal of Photogrammetry and Remote Sensing 2019 • Chao Tao, Ji Qi, Yansheng Li, Hao Wang, Haifeng Li

The validation experiments using three large datasets of very high-resolution (VHR) satellite imagery show that the proposed method can improve road extraction accuracy and provide an output that is more in line with human expectations.

Road Segmentation Segmentation

Paper
Code

A Self-Attentive Model with Gate Mechanism for Spoken Language Understanding

no code implementations • EMNLP 2018 • Changliang Li, Liang Li, Ji Qi

In this work, we propose a novel self-attentive model with gate mechanism to fully utilize the semantic correlation between slot and intent.

Automatic Speech Recognition (ASR) Intent Detection +5

Paper
Add Code

Chinese Grammatical Error Diagnosis Based on Policy Gradient LSTM Model

no code implementations • WS 2018 • Changliang Li, Ji Qi

Chinese grammatical error diagnosis system is a very important tool, which can help Chinese learners automatically diagnose grammatical errors in many scenarios.

Chinese Word Segmentation Sentence

Paper
Add Code

Endoscopic Depth Measurement and Super-Spectral-Resolution Imaging

no code implementations • 19 Jun 2017 • Jianyu Lin, Neil T. Clancy, Yang Hu, Ji Qi, Taran Tatla, Danail Stoyanov, Lena Maier-Hein, Daniel S. Elson

Intra-operative measurements of tissue shape and multi/ hyperspectral information have the potential to provide surgical guidance and decision making support.

Decision Making Optical Flow Estimation

Paper
Add Code

Probe-based Rapid Hybrid Hyperspectral and Tissue Surface Imaging Aided by Fully Convolutional Networks

no code implementations • 15 Jun 2016 • Jianyu Lin, Neil T. Clancy, Xueqing Sun, Ji Qi, Mirek Janatka, Danail Stoyanov, Daniel S. Elson

In HSI mode standard endoscopic illumination is used, with the fibre probe collecting reflected light and encoding the spatial information into a linear format that can be imaged onto the slit of a spectrograph.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.