Search Results for author: Jiaxin Zhang

Found 49 papers, 21 papers with code

ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer

3 code implementations ICCV 2023 Mingxin Huang, Jiaxin Zhang, Dezhi Peng, Hao Lu, Can Huang, Yuliang Liu, Xiang Bai, Lianwen Jin

To this end, we introduce a new model named Explicit Synergy-based Text Spotting Transformer framework (ESTextSpotter), which achieves explicit synergy by modeling discriminative and interactive features for text detection and recognition within a single decoder.

Text Detection Text Spotting

SPTS: Single-Point Text Spotting

1 code implementation15 Dec 2021 Dezhi Peng, Xinyu Wang, Yuliang Liu, Jiaxin Zhang, Mingxin Huang, Songxuan Lai, Shenggao Zhu, Jing Li, Dahua Lin, Chunhua Shen, Xiang Bai, Lianwen Jin

For the first time, we demonstrate that training scene text spotting models can be achieved with an extremely low-cost annotation of a single-point for each instance.

Language Modelling Text Detection +1

SPTS v2: Single-Point Scene Text Spotting

3 code implementations4 Jan 2023 Yuliang Liu, Jiaxin Zhang, Dezhi Peng, Mingxin Huang, Xinyu Wang, Jingqun Tang, Can Huang, Dahua Lin, Chunhua Shen, Xiang Bai, Lianwen Jin

Within the context of our SPTS v2 framework, our experiments suggest a potential preference for single-point representation in scene text spotting when compared to other representations.

Text Detection Text Spotting

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

1 code implementation24 Jan 2021 Jiapeng Wang, Chongyu Liu, Lianwen Jin, Guozhi Tang, Jiaxin Zhang, Shuaitao Zhang, Qianying Wang, Yaqiang Wu, Mingxiang Cai

Visual information extraction (VIE) has attracted considerable attention recently owing to its various advanced applications such as document understanding, automatic marking and intelligent education.

3D Feature Matching document understanding +2

Towards Accurate Ground Plane Normal Estimation from Ego-Motion

1 code implementation8 Dec 2022 Jiaxin Zhang, Wei Sui, Qian Zhang, Tao Chen, Cong Yang

In this paper, we introduce a novel approach for ground plane normal estimation of wheeled vehicles.

3D Object Detection Autonomous Driving +3

Inverse design of two-dimensional materials with invertible neural networks

1 code implementation6 Jun 2021 Victor Fung, Jiaxin Zhang, Guoxiang Hu, P. Ganesh, Bobby G. Sumpter

The ability to readily design novel materials with chosen functional properties on-demand represents a next frontier in materials discovery.

Band Gap Vocal Bursts Valence Prediction

Accelerating Inverse Learning via Intelligent Localization with Exploratory Sampling

1 code implementation2 Dec 2022 Jiaxin Zhang, Sirui Bi, Victor Fung

In the scope of "AI for Science", solving inverse problems is a longstanding challenge in materials and drug discovery, where the goal is to determine the hidden structures given a set of desirable properties.

Drug Discovery

Auditing Privacy Defenses in Federated Learning via Generative Gradient Leakage

1 code implementation CVPR 2022 Zhuohang Li, Jiaxin Zhang, Luyang Liu, Jian Liu

Federated Learning (FL) framework brings privacy benefits to distributed learning systems by allowing multiple clients to participate in a learning task under the coordination of a central server without exchanging their private data.

Bayesian Optimization Federated Learning

ELASTIC: Numerical Reasoning with Adaptive Symbolic Compiler

1 code implementation18 Oct 2022 Jiaxin Zhang, Yashar Moshfeghi

Numerical reasoning over text is a challenging task of Artificial Intelligence (AI), requiring reading comprehension and numerical reasoning abilities.

Math Word Problem Solving Question Answering +1

VRSO: Visual-Centric Reconstruction for Static Object Annotation

1 code implementation22 Mar 2024 Chenyao Yu, Yingfeng Cai, Jiaxin Zhang, Hui Kong, Wei Sui, Cong Yang

As a part of the perception results of intelligent driving systems, static object detection (SOD) in 3D space provides crucial cues for driving environment understanding.

Object object-detection +1

DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models

1 code implementation4 Jan 2024 Wendi Cui, Jiaxin Zhang, Zhuohang Li, Lopez Damien, Kamalika Das, Bradley Malin, Sricharan Kumar

Evaluating the quality and variability of text generated by Large Language Models (LLMs) poses a significant, yet unresolved research challenge.

Hallucination Sentence

Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild

1 code implementation23 Jul 2022 Jiaxin Zhang, Canjie Luo, Lianwen Jin, Fengjun Guo, Kai Ding

To address this issue, we propose a novel approach called Marior (Margin Removal and \Iterative Content Rectification).

Optical Character Recognition (OCR)

Atomic structure generation from reconstructing structural fingerprints

1 code implementation27 Jul 2022 Victor Fung, Shuyi Jia, Jiaxin Zhang, Sirui Bi, Junqi Yin, P. Ganesh

These methods would help identify or, in the case of generative models, even create novel crystal structures of materials with a set of specified functional properties to then be synthesized or isolated in the laboratory.

BIG-bench Machine Learning valid

A Novel Evolution Strategy with Directional Gaussian Smoothing for Blackbox Optimization

1 code implementation7 Feb 2020 Jiaxin Zhang, Hoang Tran, Dan Lu, Guannan Zhang

Standard ES methods with $d$-dimensional Gaussian smoothing suffer from the curse of dimensionality due to the high variance of Monte Carlo (MC) based gradient estimators.

Normality Learning-based Graph Anomaly Detection via Multi-Scale Contrastive Learning

1 code implementation12 Sep 2023 Jingcan Duan, Pei Zhang, Siwei Wang, Jingtao Hu, Hu Jin, Jiaxin Zhang, Haifang Zhou, Xinwang Liu

Finally, the model is refined with the only input of reliable normal nodes and learns a more accurate estimate of normality so that anomalous nodes can be more easily distinguished.

Contrastive Learning Graph Anomaly Detection

GeoEval: Benchmark for Evaluating LLMs and Multi-Modal Models on Geometry Problem-Solving

1 code implementation15 Feb 2024 Jiaxin Zhang, Zhongzhi Li, Mingliang Zhang, Fei Yin, ChengLin Liu, Yashar Moshfeghi

Yet, their proficiency in tackling geometry math problems, which necessitates an integrated understanding of both textual and visual information, has not been thoroughly evaluated.

Geometry Problem Solving Math

PBGen: Partial Binarization of Deconvolution-Based Generators for Edge Intelligence

no code implementations26 Feb 2018 Jinglan Liu, Jiaxin Zhang, Yukun Ding, Xiaowei Xu, Meng Jiang, Yiyu Shi

This work explores the binarization of the deconvolution-based generator in a GAN for memory saving and speedup of image construction.

Binarization

Power-Law Graph Cuts

no code implementations29 Oct 2014 Xiangyang Zhou, Jiaxin Zhang, Brian Kulis

Despite strong performance for a number of clustering tasks, spectral graph cut algorithms still suffer from several limitations: first, they require the number of clusters to be known in advance, but this information is often unknown a priori; second, they tend to produce clusters with uniform sizes.

Clustering Image Segmentation +1

Robust data-driven approach for predicting the configurational energy of high entropy alloys

no code implementations10 Aug 2019 Jiaxin Zhang, Xianglin Liu, Sirui Bi, Junqi Yin, Guannan Zhang, Markus Eisenbach

In this study, a robust data-driven framework based on Bayesian approaches is proposed and demonstrated on the accurate and efficient prediction of configurational energy of high entropy alloys.

feature selection Small Data Image Classification

Learning nonlinear level sets for dimensionality reduction in function approximation

no code implementations NeurIPS 2019 Guannan Zhang, Jiaxin Zhang, Jacob Hinkle

We developed a Nonlinear Level-set Learning (NLL) method for dimensionality reduction in high-dimensional function approximation with small data.

Functional Analysis

Scalable Deep-Learning-Accelerated Topology Optimization for Additively Manufactured Materials

no code implementations28 Nov 2020 Sirui Bi, Jiaxin Zhang, Guannan Zhang

Unlike the existing studies of DL for TO, our framework accelerates TO by learning the iterative history data and simultaneously training on the mapping between the given design and its gradient.

Thermodynamic Consistent Neural Networks for Learning Material Interfacial Mechanics

no code implementations28 Nov 2020 Jiaxin Zhang, Congjie Wei, Chenglin Wu

In this paper, we propose a thermodynamic consistent neural network (TCNN) approach to build a data-driven model of the TSR with sparse experimental data.

A Scalable Gradient-Free Method for Bayesian Experimental Design with Implicit Models

no code implementations14 Mar 2021 Jiaxin Zhang, Sirui Bi, Guannan Zhang

However, the approach requires a sampling path to compute the pathwise gradient of the MI lower bound with respect to the design variables, and such a pathwise gradient is usually inaccessible for implicit models.

Experimental Design

A Hybrid Gradient Method to Designing Bayesian Experiments for Implicit Models

no code implementations14 Mar 2021 Jiaxin Zhang, Sirui Bi, Guannan Zhang

However, the approach in Kleinegesse et al., 2020 requires a pathwise sampling path to compute the gradient of the MI lower bound with respect to the design variables, and such a pathwise sampling path is usually inaccessible for implicit models.

Experimental Design

Deep Online Correction for Monocular Visual Odometry

no code implementations18 Mar 2021 Jiaxin Zhang, Wei Sui, Xinggang Wang, Wenming Meng, Hongmei Zhu, Qian Zhang

Second, the poses predicted by CNNs are further improved by minimizing photometric errors via gradient updates of poses during inference phases.

Monocular Visual Odometry RTE

Byzantine-robust Federated Learning through Spatial-temporal Analysis of Local Model Updates

no code implementations3 Jul 2021 Zhuohang Li, Luyang Liu, Jiaxin Zhang, Jian Liu

Federated Learning (FL) enables multiple distributed clients (e. g., mobile devices) to collaboratively train a centralized model while keeping the training data locally on the client.

Federated Learning

AutoNF: Automated Architecture Optimization of Normalizing Flows Using a Mixture Distribution Formulation

no code implementations29 Sep 2021 Yu Wang, Jan Drgona, Jiaxin Zhang, Karthik Somayaji NS, Frank Y Liu, Malachi Schram, Peng Li

Although various flow models based on different transformations have been proposed, there still lacks a quantitative analysis of performance-cost trade-offs between different flows as well as a systematic way of constructing the best flow architecture.

On the Stochastic Stability of Deep Markov Models

no code implementations NeurIPS 2021 Ján Drgoňa, Sayak Mukherjee, Jiaxin Zhang, Frank Liu, Mahantesh Halappanavar

Deep Markov models (DMM) are generative models that are scalable and expressive generalization of Markov models for representation, learning, and inference problems.

Representation Learning

Road-aware Monocular Structure from Motion and Homography Estimation

no code implementations16 Dec 2021 Wei Sui, Teng Chen, Jiaxin Zhang, Jiao Lu, Qian Zhang

The Depth-CNN and Pose-CNN estimate dense depth map and ego-motion respectively, solving SFM, while the Pose-CNN and Ground-CNN followed by a homography layer solve the ground plane estimation problem.

Autonomous Driving Homography Estimation +1

Speech Privacy Leakage from Shared Gradients in Distributed Learning

no code implementations21 Feb 2023 Zhuohang Li, Jiaxin Zhang, Jian Liu

Distributed machine learning paradigms, such as federated learning, have been recently adopted in many privacy-critical applications for speech analysis.

Federated Learning Keyword Spotting

RecUP-FL: Reconciling Utility and Privacy in Federated Learning via User-configurable Privacy Defense

no code implementations11 Apr 2023 Yue Cui, Syed Irfan Ali Meerza, Zhuohang Li, Luyang Liu, Jiaxin Zhang, Jian Liu

In this paper, we seek to reconcile utility and privacy in FL by proposing a user-configurable privacy defense, RecUP-FL, that can better focus on the user-specified sensitive attributes while obtaining significant improvements in utility over traditional defenses.

Adversarial Attack Attribute +4

DocAligner: Annotating Real-world Photographic Document Images by Simply Taking Pictures

no code implementations9 Jun 2023 Jiaxin Zhang, Bangdong Chen, Hiuyi Cheng, Fengjun Guo, Kai Ding, Lianwen Jin

Furthermore, considering the importance of fine-grained elements in document images, we present a details recurrent refinement module to enhance the output in a high-resolution space.

Self-Supervised Learning

PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback

no code implementations27 Jul 2023 Bo Shen, Jiaxin Zhang, Taihong Chen, Daoguang Zan, Bing Geng, An Fu, Muhan Zeng, Ailun Yu, Jichuan Ji, Jingyang Zhao, Yuenan Guo, Qianxiang Wang

In this paper, we propose a novel RRTF (Rank Responses to align Test&Teacher Feedback) framework, which can effectively and efficiently boost pre-trained large language models for code generation.

Code Generation

On the Quantification of Image Reconstruction Uncertainty without Training Data

no code implementations16 Nov 2023 Sirui Bi, Victor Fung, Jiaxin Zhang

This, in turn, facilitates a probabilistic interpretation of observational data for decision-making.

Decision Making Image Reconstruction

DECDM: Document Enhancement using Cycle-Consistent Diffusion Models

no code implementations16 Nov 2023 Jiaxin Zhang, Joy Rimchala, Lalla Mouatadid, Kamalika Das, Sricharan Kumar

The performance of optical character recognition (OCR) heavily relies on document image quality, which is crucial for automatic document processing and document intelligence.

Data Augmentation Denoising +5

Advancing Urban Renewal: An Automated Approach to Generating Historical Arcade Facades with Stable Diffusion Models

no code implementations20 Nov 2023 Zheyuan Kuang, Jiaxin Zhang, Yiying Huang, Yunqin Li

Urban renewal and transformation processes necessitate the preservation of the historical urban fabric, particularly in districts known for their architectural and historical significance.

Image Generation

UPOCR: Towards Unified Pixel-Level OCR Interface

no code implementations5 Dec 2023 Dezhi Peng, Zhenhua Yang, Jiaxin Zhang, Chongyu Liu, Yongxin Shi, Kai Ding, Fengjun Guo, Lianwen Jin

Without bells and whistles, the experimental results showcase that the proposed method can simultaneously achieve state-of-the-art performance on three tasks with a unified single model, which provides valuable strategies and insights for future research on generalist OCR models.

Optical Character Recognition Optical Character Recognition (OCR) +2

GLOCALFAIR: Jointly Improving Global and Local Group Fairness in Federated Learning

no code implementations7 Jan 2024 Syed Irfan Ali Meerza, Luyang Liu, Jiaxin Zhang, Jian Liu

Specifically, we utilize constrained optimization to enforce local fairness on the client side and adopt a fairness-aware clustering-based aggregation on the server to further ensure the global model fairness across different sensitive groups while maintaining high utility.

Fairness Federated Learning

GAPS: Geometry-Aware Problem Solver

no code implementations29 Jan 2024 Jiaxin Zhang, Yinghui Jiang, Yashar Moshfeghi

By leveraging these improvements, GAPS showcases remarkable performance in resolving geometry math problems.

Geometry Problem Solving Math

PhaseEvo: Towards Unified In-Context Prompt Optimization for Large Language Models

no code implementations17 Feb 2024 Wendi Cui, Jiaxin Zhang, Zhuohang Li, Hao Sun, Damien Lopez, Kamalika Das, Bradley Malin, Sricharan Kumar

Crafting an ideal prompt for Large Language Models (LLMs) is a challenging task that demands significant resources and expert human input.

Computational Efficiency In-Context Learning

Discriminant Distance-Aware Representation on Deterministic Uncertainty Quantification Methods

no code implementations20 Feb 2024 Jiaxin Zhang, Kamalika Das, Sricharan Kumar

Uncertainty estimation is a crucial aspect of deploying dependable deep learning models in safety-critical systems.

Uncertainty Quantification

SPUQ: Perturbation-Based Uncertainty Quantification for Large Language Models

no code implementations4 Mar 2024 Xiang Gao, Jiaxin Zhang, Lalla Mouatadid, Kamalika Das

Motivated by this gap, we introduce a novel UQ method, sampling with perturbation for UQ (SPUQ), designed to tackle both aleatoric and epistemic uncertainties.

Text Generation Uncertainty Quantification

Cannot find the paper you are looking for? You can Submit a new open access paper.