Search Results for author: Jiaxin Zhang

Found 49 papers, 21 papers with code

ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer

3 code implementations • ICCV 2023 • Mingxin Huang, Jiaxin Zhang, Dezhi Peng, Hao Lu, Can Huang, Yuliang Liu, Xiang Bai, Lianwen Jin

To this end, we introduce a new model named Explicit Synergy-based Text Spotting Transformer framework (ESTextSpotter), which achieves explicit synergy by modeling discriminative and interactive features for text detection and recognition within a single decoder.

Text Detection Text Spotting

253

Paper
Code

RoMe: Towards Large Scale Road Surface Reconstruction via Mesh Representation

1 code implementation • 20 Jun 2023 • Ruohong Mei, Wei Sui, Jiaxin Zhang, Xue Qin, Gang Wang, Tao Peng, Cong Yang

This paper introduces RoMe, a novel framework designed for the robust reconstruction of large-scale road surfaces.

Autonomous Driving Computational Efficiency +1

169

Paper
Code

SPTS: Single-Point Text Spotting

1 code implementation • 15 Dec 2021 • Dezhi Peng, Xinyu Wang, Yuliang Liu, Jiaxin Zhang, Mingxin Huang, Songxuan Lai, Shenggao Zhu, Jing Li, Dahua Lin, Chunhua Shen, Xiang Bai, Lianwen Jin

For the first time, we demonstrate that training scene text spotting models can be achieved with an extremely low-cost annotation of a single-point for each instance.

Ranked #3 on Text Spotting on SCUT-CTW1500

Language Modelling Text Detection +1

126

Paper
Code

SPTS v2: Single-Point Scene Text Spotting

3 code implementations • 4 Jan 2023 • Yuliang Liu, Jiaxin Zhang, Dezhi Peng, Mingxin Huang, Xinyu Wang, Jingqun Tang, Can Huang, Dahua Lin, Chunhua Shen, Xiang Bai, Lianwen Jin

Within the context of our SPTS v2 framework, our experiments suggest a potential preference for single-point representation in scene text spotting when compared to other representations.

Ranked #15 on Text Spotting on ICDAR 2015

Text Detection Text Spotting

126

Paper
Code

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

1 code implementation • 24 Jan 2021 • Jiapeng Wang, Chongyu Liu, Lianwen Jin, Guozhi Tang, Jiaxin Zhang, Shuaitao Zhang, Qianying Wang, Yaqiang Wu, Mingxiang Cai

Visual information extraction (VIE) has attracted considerable attention recently owing to its various advanced applications such as document understanding, automatic marking and intelligent education.

3D Feature Matching document understanding +2

Paper
Code

A Vision-Centric Approach for Static Map Element Annotation

1 code implementation • 21 Sep 2023 • Jiaxin Zhang, Shiyuan Chen, Haoran Yin, Ruohong Mei, Xuan Liu, Cong Yang, Qian Zhang, Wei Sui

The recent development of online static map element (a. k. a.

Paper
Code

M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis

1 code implementation • CVPR 2023 • Hiuyi Cheng, Peirong Zhang, Sihang Wu, Jiaxin Zhang, Qiyuan Zhu, Zecheng Xie, Jing Li, Kai Ding, Lianwen Jin

Document layout analysis is a crucial prerequisite for document understanding, including document retrieval and conversion.

Document Layout Analysis document understanding +3

Paper
Code

Towards Accurate Ground Plane Normal Estimation from Ego-Motion

1 code implementation • 8 Dec 2022 • Jiaxin Zhang, Wei Sui, Qian Zhang, Tao Chen, Cong Yang

In this paper, we introduce a novel approach for ground plane normal estimation of wheeled vehicles.

3D Object Detection Autonomous Driving +3

Paper
Code

Inverse design of two-dimensional materials with invertible neural networks

1 code implementation • 6 Jun 2021 • Victor Fung, Jiaxin Zhang, Guoxiang Hu, P. Ganesh, Bobby G. Sumpter

The ability to readily design novel materials with chosen functional properties on-demand represents a next frontier in materials discovery.

Band Gap Vocal Bursts Valence Prediction

Paper
Code

Accelerating Inverse Learning via Intelligent Localization with Exploratory Sampling

1 code implementation • 2 Dec 2022 • Jiaxin Zhang, Sirui Bi, Victor Fung

In the scope of "AI for Science", solving inverse problems is a longstanding challenge in materials and drug discovery, where the goal is to determine the hidden structures given a set of desirable properties.

Drug Discovery

Paper
Code

Auditing Privacy Defenses in Federated Learning via Generative Gradient Leakage

1 code implementation • CVPR 2022 • Zhuohang Li, Jiaxin Zhang, Luyang Liu, Jian Liu

Federated Learning (FL) framework brings privacy benefits to distributed learning systems by allowing multiple clients to participate in a learning task under the coordination of a central server without exchanging their private data.

Bayesian Optimization Federated Learning

Paper
Code

Can Programming Languages Boost Each Other via Instruction Tuning?

1 code implementation • 31 Aug 2023 • Daoguang Zan, Ailun Yu, Bo Shen, Jiaxin Zhang, Taihong Chen, Bing Geng, Bei Chen, Jichuan Ji, Yafen Yao, Yongji Wang, Qianxiang Wang

Results demonstrate that programming languages can significantly improve each other.

Paper
Code

SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency

1 code implementation • 3 Nov 2023 • Jiaxin Zhang, Zhuohang Li, Kamalika Das, Bradley A. Malin, Sricharan Kumar

Hallucination detection is a critical step toward understanding the trustworthiness of modern language models (LMs).

Hallucination Question Answering

Paper
Code

ELASTIC: Numerical Reasoning with Adaptive Symbolic Compiler

1 code implementation • 18 Oct 2022 • Jiaxin Zhang, Yashar Moshfeghi

Numerical reasoning over text is a challenging task of Artificial Intelligence (AI), requiring reading comprehension and numerical reasoning abilities.

Ranked #1 on Math Word Problem Solving on MathQA

Math Word Problem Solving Question Answering +1

Paper
Code

VRSO: Visual-Centric Reconstruction for Static Object Annotation

1 code implementation • 22 Mar 2024 • Chenyao Yu, Yingfeng Cai, Jiaxin Zhang, Hui Kong, Wei Sui, Cong Yang

As a part of the perception results of intelligent driving systems, static object detection (SOD) in 3D space provides crucial cues for driving environment understanding.

Object object-detection +1

Paper
Code

DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models

1 code implementation • 4 Jan 2024 • Wendi Cui, Jiaxin Zhang, Zhuohang Li, Lopez Damien, Kamalika Das, Bradley Malin, Sricharan Kumar

Evaluating the quality and variability of text generated by Large Language Models (LLMs) poses a significant, yet unresolved research challenge.

Hallucination Sentence

Paper
Code

Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild

1 code implementation • 23 Jul 2022 • Jiaxin Zhang, Canjie Luo, Lianwen Jin, Fengjun Guo, Kai Ding

To address this issue, we propose a novel approach called Marior (Margin Removal and \Iterative Content Rectification).

Optical Character Recognition (OCR)

Paper
Code

Atomic structure generation from reconstructing structural fingerprints

1 code implementation • 27 Jul 2022 • Victor Fung, Shuyi Jia, Jiaxin Zhang, Sirui Bi, Junqi Yin, P. Ganesh

These methods would help identify or, in the case of generative models, even create novel crystal structures of materials with a set of specified functional properties to then be synthesized or isolated in the laboratory.

BIG-bench Machine Learning valid

Paper
Code

A Novel Evolution Strategy with Directional Gaussian Smoothing for Blackbox Optimization

1 code implementation • 7 Feb 2020 • Jiaxin Zhang, Hoang Tran, Dan Lu, Guannan Zhang

Standard ES methods with $d$-dimensional Gaussian smoothing suffer from the curse of dimensionality due to the high variance of Monte Carlo (MC) based gradient estimators.

Paper
Code

Normality Learning-based Graph Anomaly Detection via Multi-Scale Contrastive Learning

1 code implementation • 12 Sep 2023 • Jingcan Duan, Pei Zhang, Siwei Wang, Jingtao Hu, Hu Jin, Jiaxin Zhang, Haifang Zhou, Xinwang Liu

Finally, the model is refined with the only input of reliable normal nodes and learns a more accurate estimate of normality so that anomalous nodes can be more easily distinguished.

Contrastive Learning Graph Anomaly Detection

Paper
Code

GeoEval: Benchmark for Evaluating LLMs and Multi-Modal Models on Geometry Problem-Solving

1 code implementation • 15 Feb 2024 • Jiaxin Zhang, Zhongzhi Li, Mingliang Zhang, Fei Yin, ChengLin Liu, Yashar Moshfeghi

Yet, their proficiency in tackling geometry math problems, which necessitates an integrated understanding of both textual and visual information, has not been thoroughly evaluated.

Geometry Problem Solving Math

Paper
Code

PBGen: Partial Binarization of Deconvolution-Based Generators for Edge Intelligence

no code implementations • 26 Feb 2018 • Jinglan Liu, Jiaxin Zhang, Yukun Ding, Xiaowei Xu, Meng Jiang, Yiyu Shi

This work explores the binarization of the deconvolution-based generator in a GAN for memory saving and speedup of image construction.

Binarization

Paper
Add Code

Power-Law Graph Cuts

no code implementations • 29 Oct 2014 • Xiangyang Zhou, Jiaxin Zhang, Brian Kulis

Despite strong performance for a number of clustering tasks, spectral graph cut algorithms still suffer from several limitations: first, they require the number of clusters to be known in advance, but this information is often unknown a priori; second, they tend to produce clusters with uniform sizes.

Clustering Image Segmentation +1

Paper
Add Code

Robust data-driven approach for predicting the configurational energy of high entropy alloys

no code implementations • 10 Aug 2019 • Jiaxin Zhang, Xianglin Liu, Sirui Bi, Junqi Yin, Guannan Zhang, Markus Eisenbach

In this study, a robust data-driven framework based on Bayesian approaches is proposed and demonstrated on the accurate and efficient prediction of configurational energy of high entropy alloys.

feature selection Small Data Image Classification

Paper
Add Code

Learning nonlinear level sets for dimensionality reduction in function approximation

no code implementations • NeurIPS 2019 • Guannan Zhang, Jiaxin Zhang, Jacob Hinkle

We developed a Nonlinear Level-set Learning (NLL) method for dimensionality reduction in high-dimensional function approximation with small data.

Functional Analysis

Paper
Add Code

Scalable Deep-Learning-Accelerated Topology Optimization for Additively Manufactured Materials

no code implementations • 28 Nov 2020 • Sirui Bi, Jiaxin Zhang, Guannan Zhang

Unlike the existing studies of DL for TO, our framework accelerates TO by learning the iterative history data and simultaneously training on the mapping between the given design and its gradient.

Paper
Add Code

Thermodynamic Consistent Neural Networks for Learning Material Interfacial Mechanics

no code implementations • 28 Nov 2020 • Jiaxin Zhang, Congjie Wei, Chenglin Wu

In this paper, we propose a thermodynamic consistent neural network (TCNN) approach to build a data-driven model of the TSR with sparse experimental data.

Paper
Add Code

A Scalable Gradient-Free Method for Bayesian Experimental Design with Implicit Models

no code implementations • 14 Mar 2021 • Jiaxin Zhang, Sirui Bi, Guannan Zhang

However, the approach requires a sampling path to compute the pathwise gradient of the MI lower bound with respect to the design variables, and such a pathwise gradient is usually inaccessible for implicit models.

Experimental Design

Paper
Add Code

A Hybrid Gradient Method to Designing Bayesian Experiments for Implicit Models

no code implementations • 14 Mar 2021 • Jiaxin Zhang, Sirui Bi, Guannan Zhang

However, the approach in Kleinegesse et al., 2020 requires a pathwise sampling path to compute the gradient of the MI lower bound with respect to the design variables, and such a pathwise sampling path is usually inaccessible for implicit models.

Experimental Design

Paper
Add Code

Deep Online Correction for Monocular Visual Odometry

no code implementations • 18 Mar 2021 • Jiaxin Zhang, Wei Sui, Xinggang Wang, Wenming Meng, Hongmei Zhu, Qian Zhang

Second, the poses predicted by CNNs are further improved by minimizing photometric errors via gradient updates of poses during inference phases.

Monocular Visual Odometry RTE

Paper
Add Code

Byzantine-robust Federated Learning through Spatial-temporal Analysis of Local Model Updates

no code implementations • 3 Jul 2021 • Zhuohang Li, Luyang Liu, Jiaxin Zhang, Jian Liu

Federated Learning (FL) enables multiple distributed clients (e. g., mobile devices) to collaboratively train a centralized model while keeping the training data locally on the client.

Federated Learning

Paper
Add Code

AutoNF: Automated Architecture Optimization of Normalizing Flows Using a Mixture Distribution Formulation

no code implementations • 29 Sep 2021 • Yu Wang, Jan Drgona, Jiaxin Zhang, Karthik Somayaji NS, Frank Y Liu, Malachi Schram, Peng Li

Although various flow models based on different transformations have been proposed, there still lacks a quantitative analysis of performance-cost trade-offs between different flows as well as a systematic way of constructing the best flow architecture.

Paper
Add Code

On the Stochastic Stability of Deep Markov Models

no code implementations • NeurIPS 2021 • Ján Drgoňa, Sayak Mukherjee, Jiaxin Zhang, Frank Liu, Mahantesh Halappanavar

Deep Markov models (DMM) are generative models that are scalable and expressive generalization of Markov models for representation, learning, and inference problems.

Representation Learning

Paper
Add Code

Simulation Intelligence: Towards a New Generation of Scientific Methods

no code implementations • 6 Dec 2021 • Alexander Lavin, David Krakauer, Hector Zenil, Justin Gottschlich, Tim Mattson, Johann Brehmer, Anima Anandkumar, Sanjay Choudry, Kamil Rocki, Atılım Güneş Baydin, Carina Prunkl, Brooks Paige, Olexandr Isayev, Erik Peterson, Peter L. McMahon, Jakob Macke, Kyle Cranmer, Jiaxin Zhang, Haruko Wainwright, Adi Hanuka, Manuela Veloso, Samuel Assefa, Stephan Zheng, Avi Pfeffer

We present the "Nine Motifs of Simulation Intelligence", a roadmap for the development and integration of the essential algorithms necessary for a merger of scientific computing, scientific simulation, and artificial intelligence.

Probabilistic Programming

Paper
Add Code

Road-aware Monocular Structure from Motion and Homography Estimation

no code implementations • 16 Dec 2021 • Wei Sui, Teng Chen, Jiaxin Zhang, Jiao Lu, Qian Zhang

The Depth-CNN and Pose-CNN estimate dense depth map and ego-motion respectively, solving SFM, while the Pose-CNN and Ground-CNN followed by a homography layer solve the ground plane estimation problem.

Autonomous Driving Homography Estimation +1

Paper
Add Code

Speech Privacy Leakage from Shared Gradients in Distributed Learning

no code implementations • 21 Feb 2023 • Zhuohang Li, Jiaxin Zhang, Jian Liu

Distributed machine learning paradigms, such as federated learning, have been recently adopted in many privacy-critical applications for speech analysis.

Federated Learning Keyword Spotting

Paper
Add Code

RecUP-FL: Reconciling Utility and Privacy in Federated Learning via User-configurable Privacy Defense

no code implementations • 11 Apr 2023 • Yue Cui, Syed Irfan Ali Meerza, Zhuohang Li, Luyang Liu, Jiaxin Zhang, Jian Liu

In this paper, we seek to reconcile utility and privacy in FL by proposing a user-configurable privacy defense, RecUP-FL, that can better focus on the user-specified sensitive attributes while obtaining significant improvements in utility over traditional defenses.

Adversarial Attack Attribute +4

Paper
Add Code

M$^{6}$Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis

no code implementations • 15 May 2023 • Hiuyi Cheng, Peirong Zhang, Sihang Wu, Jiaxin Zhang, Qiyuan Zhu, Zecheng Xie, Jing Li, Kai Ding, Lianwen Jin

Document layout analysis is a crucial prerequisite for document understanding, including document retrieval and conversion.

Document Layout Analysis document understanding +3

Paper
Add Code

DocAligner: Annotating Real-world Photographic Document Images by Simply Taking Pictures

no code implementations • 9 Jun 2023 • Jiaxin Zhang, Bangdong Chen, Hiuyi Cheng, Fengjun Guo, Kai Ding, Lianwen Jin

Furthermore, considering the importance of fine-grained elements in document images, we present a details recurrent refinement module to enhance the output in a high-resolution space.

Self-Supervised Learning

Paper
Add Code

PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback

no code implementations • 27 Jul 2023 • Bo Shen, Jiaxin Zhang, Taihong Chen, Daoguang Zan, Bing Geng, An Fu, Muhan Zeng, Ailun Yu, Jichuan Ji, Jingyang Zhao, Yuenan Guo, Qianxiang Wang

In this paper, we propose a novel RRTF (Rank Responses to align Test&Teacher Feedback) framework, which can effectively and efficiently boost pre-trained large language models for code generation.

Ranked #32 on Code Generation on HumanEval

Code Generation

Paper
Add Code

On the Quantification of Image Reconstruction Uncertainty without Training Data

no code implementations • 16 Nov 2023 • Sirui Bi, Victor Fung, Jiaxin Zhang

This, in turn, facilitates a probabilistic interpretation of observational data for decision-making.

Decision Making Image Reconstruction

Paper
Add Code

DECDM: Document Enhancement using Cycle-Consistent Diffusion Models

no code implementations • 16 Nov 2023 • Jiaxin Zhang, Joy Rimchala, Lalla Mouatadid, Kamalika Das, Sricharan Kumar

The performance of optical character recognition (OCR) heavily relies on document image quality, which is crucial for automatic document processing and document intelligence.

Data Augmentation Denoising +5

Paper
Add Code

Advancing Urban Renewal: An Automated Approach to Generating Historical Arcade Facades with Stable Diffusion Models

no code implementations • 20 Nov 2023 • Zheyuan Kuang, Jiaxin Zhang, Yiying Huang, Yunqin Li

Urban renewal and transformation processes necessitate the preservation of the historical urban fabric, particularly in districts known for their architectural and historical significance.

Image Generation

Paper
Add Code

UPOCR: Towards Unified Pixel-Level OCR Interface

no code implementations • 5 Dec 2023 • Dezhi Peng, Zhenhua Yang, Jiaxin Zhang, Chongyu Liu, Yongxin Shi, Kai Ding, Fengjun Guo, Lianwen Jin

Without bells and whistles, the experimental results showcase that the proposed method can simultaneously achieve state-of-the-art performance on three tasks with a unified single model, which provides valuable strategies and insights for future research on generalist OCR models.

Optical Character Recognition Optical Character Recognition (OCR) +2

Paper
Add Code

GLOCALFAIR: Jointly Improving Global and Local Group Fairness in Federated Learning

no code implementations • 7 Jan 2024 • Syed Irfan Ali Meerza, Luyang Liu, Jiaxin Zhang, Jian Liu

Specifically, we utilize constrained optimization to enforce local fairness on the client side and adopt a fairness-aware clustering-based aggregation on the server to further ensure the global model fairness across different sensitive groups while maintaining high utility.

Fairness Federated Learning

Paper
Add Code

GAPS: Geometry-Aware Problem Solver

no code implementations • 29 Jan 2024 • Jiaxin Zhang, Yinghui Jiang, Yashar Moshfeghi

By leveraging these improvements, GAPS showcases remarkable performance in resolving geometry math problems.

Geometry Problem Solving Math

Paper
Add Code

PhaseEvo: Towards Unified In-Context Prompt Optimization for Large Language Models

no code implementations • 17 Feb 2024 • Wendi Cui, Jiaxin Zhang, Zhuohang Li, Hao Sun, Damien Lopez, Kamalika Das, Bradley Malin, Sricharan Kumar

Crafting an ideal prompt for Large Language Models (LLMs) is a challenging task that demands significant resources and expert human input.

Computational Efficiency In-Context Learning

Paper
Add Code

Discriminant Distance-Aware Representation on Deterministic Uncertainty Quantification Methods

no code implementations • 20 Feb 2024 • Jiaxin Zhang, Kamalika Das, Sricharan Kumar

Uncertainty estimation is a crucial aspect of deploying dependable deep learning models in safety-critical systems.

Uncertainty Quantification

Paper
Add Code

SPUQ: Perturbation-Based Uncertainty Quantification for Large Language Models

no code implementations • 4 Mar 2024 • Xiang Gao, Jiaxin Zhang, Lalla Mouatadid, Kamalika Das

Motivated by this gap, we introduce a novel UQ method, sampling with perturbation for UQ (SPUQ), designed to tackle both aleatoric and epistemic uncertainties.

Text Generation Uncertainty Quantification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.