Search Results for author: Xinyu Zhang

Found 131 papers, 59 papers with code

Implicit Sample Extension for Unsupervised Person Re-Identification

1 code implementation • CVPR 2022 • Xinyu Zhang, Dongdong Li, Zhigang Wang, Jian Wang, Errui Ding, Javen Qinfeng Shi, Zhaoxiang Zhang, Jingdong Wang

Specifically, we generate support samples from actual samples and their neighbouring clusters in the embedding space through a progressive linear interpolation (PLI) strategy.

Clustering Unsupervised Person Re-Identification

5,251

Paper
Code

MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices

1 code implementation • 28 Dec 2023 • Xiangxiang Chu, Limeng Qiao, Xinyang Lin, Shuang Xu, Yang Yang, Yiming Hu, Fei Wei, Xinyu Zhang, Bo Zhang, Xiaolin Wei, Chunhua Shen

We present MobileVLM, a competent multimodal vision language model (MMVLM) targeted to run on mobile devices.

AutoML Language Modelling

767

Paper
Code

MobileVLM V2: Faster and Stronger Baseline for Vision Language Model

1 code implementation • 6 Feb 2024 • Xiangxiang Chu, Limeng Qiao, Xinyu Zhang, Shuang Xu, Fei Wei, Yang Yang, Xiaofei Sun, Yiming Hu, Xinyang Lin, Bo Zhang, Chunhua Shen

We introduce MobileVLM V2, a family of significantly improved vision language models upon MobileVLM, which proves that a delicate orchestration of novel architectural design, an improved training scheme tailored for mobile VLMs, and rich high-quality dataset curation can substantially benefit VLMs' performance.

AutoML Language Modelling

767

Paper
Code

Answer Complex Questions: Path Ranker Is All You Need

3 code implementations • Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval 2021 • Xinyu Zhang, Ke Zhan, Enrui Hu, Chengzhen Fu, Lan Luo, Hao Jiang, Yantao Jia, Fan Yu, Zhicheng Dou, Zhao Cao, Lei Chen

Currently, the most popular method for open-domain Question Answering (QA) adopts "Retriever and Reader" pipeline, where the retriever extracts a list of candidate documents from a large set of documents followed by a ranker to rank the most relevant documents and the reader extracts answer from the candidates.

Open-Domain Question Answering

334

Paper
Code

Detect Everything with Few Examples

1 code implementation • 22 Sep 2023 • Xinyu Zhang, Yuting Wang, Abdeslam Boularias

DE-ViT establishes new state-of-the-art results on all benchmarks.

Ranked #1 on Few-Shot Object Detection on MS-COCO (30-shot)

Binary Classification Few-Shot Object Detection +4

255

Paper
Code

Alpha-Refine: Boosting Tracking Performance by Precise Bounding Box Estimation

1 code implementation • CVPR 2021 • Bin Yan, Xinyu Zhang, Dong Wang, Huchuan Lu, Xiaoyun Yang

Many recent trackers adopt the multiple-stage tracking strategy to improve the quality of bounding box estimation.

Ranked #15 on Semi-Supervised Video Object Segmentation on VOT2020

Semi-Supervised Video Object Segmentation Visual Object Tracking

188

Paper
Code

BEVHeight: A Robust Framework for Vision-based Roadside 3D Object Detection

1 code implementation • CVPR 2023 • Lei Yang, Kaicheng Yu, Tao Tang, Jun Li, Kun Yuan, Li Wang, Xinyu Zhang, Peng Chen

In essence, instead of predicting the pixel-wise depth, we regress the height to the ground to achieve a distance-agnostic formulation to ease the optimization process of camera-only perception methods.

Ranked #3 on 3D Object Detection on Rope3D

3D Object Detection Autonomous Driving +1

173

Paper
Code

Resolve Domain Conflicts for Generalizable Remote Physiological Measurement

2 code implementations • 11 Apr 2024 • Weiyu Sun, Xinyu Zhang, Hao Lu, Ying Chen, Yun Ge, Xiaolin Huang, Jie Yuan, Yingcong Chen

Remote photoplethysmography (rPPG) technology has become increasingly popular due to its non-invasive monitoring of various physiological indicators, making it widely applicable in multimedia interaction, healthcare, and emotion analysis.

Attribute Emotion Recognition +1

139

Paper
Code

Making a MIRACL: Multilingual Information Retrieval Across a Continuum of Languages

1 code implementation • 18 Oct 2022 • Xinyu Zhang, Nandan Thakur, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Mehdi Rezagholizadeh, Jimmy Lin

MIRACL (Multilingual Information Retrieval Across a Continuum of Languages) is a multilingual dataset we have built for the WSDM 2023 Cup challenge that focuses on ad hoc retrieval across 18 different languages, which collectively encompass over three billion native speakers around the world.

Information Retrieval Retrieval

126

Paper
Code

FastPillars: A Deployment-friendly Pillar-based 3D Detector

1 code implementation • 5 Feb 2023 • Sifan Zhou, Zhi Tian, Xiangxiang Chu, Xinyu Zhang, Bo Zhang, Xiaobo Lu, Chengjian Feng, Zequn Jie, Patrick Yin Chiang, Lin Ma

The deployment of 3D detectors strikes one of the major challenges in real-world self-driving scenarios.

3D Object Detection object-detection

121

Paper
Code

Dual Radar: A Multi-modal Dataset with Dual 4D Radar for Autonomous Driving

1 code implementation • 11 Oct 2023 • Xinyu Zhang, Li Wang, Jian Chen, Cheng Fang, Lei Yang, Ziying Song, Guangqi Yang, Yichen Wang, Xiaofei Zhang, Jun Li, Zhiwei Li, Qingshan Yang, Zhenlin Zhang, Shuzhi Sam Ge

Compared with commonly used 3D radars, the latest 4D radars have precise vertical resolution and higher point cloud density, making it a highly promising sensor for autonomous driving in complex environmental perception.

3D Object Detection Autonomous Driving +1

101

Paper
Code

Robust Multimodal Vehicle Detection in Foggy Weather Using Complementary Lidar and Radar Signals

1 code implementation • CVPR 2021 • Kun Qian, Shilin Zhu, Xinyu Zhang, Li Erran Li

Vehicle detection with visual sensors like lidar and camera is one of the critical functions enabling autonomous driving.

Autonomous Driving

Paper
Code

Lenna: Language Enhanced Reasoning Detection Assistant

1 code implementation • 5 Dec 2023 • Fei Wei, Xinyu Zhang, Ailing Zhang, Bo Zhang, Xiangxiang Chu

To evaluate the reasoning capability of Lenna, we construct a ReasonDet dataset to measure its performance on reasoning-based detection.

World Knowledge

Paper
Code

Mr. TyDi: A Multi-lingual Benchmark for Dense Retrieval

1 code implementation • EMNLP (MRL) 2021 • Xinyu Zhang, Xueguang Ma, Peng Shi, Jimmy Lin

We present Mr. TyDi, a multi-lingual benchmark dataset for mono-lingual retrieval in eleven typologically diverse languages, designed to evaluate ranking with learned dense representations.

Representation Learning Retrieval

Paper
Code

Transforming the Latent Space of StyleGAN for Real Face Editing

1 code implementation • 29 May 2021 • Heyi Li, Jinlong Liu, Xinyu Zhang, Yunzhi Bai, Huayan Wang, Klaus Mueller

But more importantly, the proposed $W$++ space achieves superior performance in both reconstruction quality and editing quality.

Paper
Code

WebBrain: Learning to Generate Factually Correct Articles for Queries by Grounding on Large Web Corpus

1 code implementation • 10 Apr 2023 • Hongjing Qian, Yutao Zhu, Zhicheng Dou, Haoqi Gu, Xinyu Zhang, Zheng Liu, Ruofei Lai, Zhao Cao, Jian-Yun Nie, Ji-Rong Wen

In this paper, we introduce a new NLP task -- generating short factual articles with references for queries by mining supporting evidence from the Web.

Retrieval Text Generation

Paper
Code

Towards Efficient NLP: A Standard Evaluation and A Strong Baseline

1 code implementation • NAACL 2022 • Xiangyang Liu, Tianxiang Sun, Junliang He, Jiawen Wu, Lingling Wu, Xinyu Zhang, Hao Jiang, Zhao Cao, Xuanjing Huang, Xipeng Qiu

ELUE is dedicated to depict the Pareto Frontier for various language understanding tasks, such that it can tell whether and how much a method achieves Pareto improvement.

Paper
Code

Hyperlink-induced Pre-training for Passage Retrieval in Open-domain Question Answering

1 code implementation • ACL 2022 • Jiawei Zhou, Xiaoguang Li, Lifeng Shang, Lan Luo, Ke Zhan, Enrui Hu, Xinyu Zhang, Hao Jiang, Zhao Cao, Fan Yu, Xin Jiang, Qun Liu, Lei Chen

To alleviate the data scarcity problem in training question answering systems, recent works propose additional intermediate pre-training for dense passage retrieval (DPR).

Open-Domain Question Answering Passage Retrieval +1

Paper
Code

LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection

1 code implementation • 29 Jan 2024 • Sifan Zhou, Liang Li, Xinyu Zhang, Bo Zhang, Shipeng Bai, Miao Sun, Ziyu Zhao, Xiaobo Lu, Xiangxiang Chu

To our knowledge, for the very first time in lidar-based 3D detection tasks, the PTQ INT8 model's accuracy is almost the same as the FP32 model while enjoying $3\times$ inference speedup.

3D Object Detection Autonomous Vehicles +3

Paper
Code

CMRxRecon: An open cardiac MRI dataset for the competition of accelerated image reconstruction

1 code implementation • 19 Sep 2023 • Chengyan Wang, Jun Lyu, Shuo Wang, Chen Qin, Kunyuan Guo, Xinyu Zhang, Xiaotong Yu, Yan Li, Fanwen Wang, Jianhua Jin, Zhang Shi, Ziqiang Xu, Yapeng Tian, Sha Hua, Zhensen Chen, Meng Liu, Mengting Sun, Xutong Kuang, Kang Wang, Haoran Wang, Hao Li, Yinghua Chu, Guang Yang, Wenjia Bai, Xiahai Zhuang, He Wang, Jing Qin, Xiaobo Qu

However, a limitation of CMR is its slow imaging speed, which causes patient discomfort and introduces artifacts in the images.

Image Reconstruction

Paper
Code

Mix-Teaching: A Simple, Unified and Effective Semi-Supervised Learning Framework for Monocular 3D Object Detection

1 code implementation • 10 Jul 2022 • Lei Yang, Xinyu Zhang, Li Wang, Minghan Zhu, Chuang Zhang, Jun Li

Besides, by leveraging full training set and the additional 48K raw images of KITTI, it can further improve the MonoFlex by +4. 65% improvement on AP@0. 7 for car detection, reaching 18. 54% AP@0. 7, which ranks the 1st place among all monocular based methods on KITTI test leaderboard.

Autonomous Driving Model Optimization +2

Paper
Code

Training Deep Neural Networks with Joint Quantization and Pruning of Weights and Activations

2 code implementations • 15 Oct 2021 • Xinyu Zhang, Ian Colbert, Ken Kreutz-Delgado, Srinjoy Das

State-of-the-art quantization techniques are currently applied to both the weights and activations; however, pruning is most often applied to only the weights of the network.

Network Pruning Quantization

Paper
Code

HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception

1 code implementation • NeurIPS 2023 • Junkun Yuan, Xinyu Zhang, Hao Zhou, Jian Wang, Zhongwei Qiu, Zhiyin Shao, Shaofeng Zhang, Sifan Long, Kun Kuang, Kun Yao, Junyu Han, Errui Ding, Lanfen Lin, Fei Wu, Jingdong Wang

To further capture human characteristics, we propose a structure-invariant alignment loss that enforces different masked views, guided by the human part prior, to be closely aligned for the same image.

2D Pose Estimation Attribute +3

Paper
Code

Learning Granularity-Unified Representations for Text-to-Image Person Re-identification

2 code implementations • 16 Jul 2022 • Zhiyin Shao, Xinyu Zhang, Meng Fang, Zhifeng Lin, Jian Wang, Changxing Ding

In PGU, we adopt a set of shared and learnable prototypes as the queries to extract diverse and semantically aligned features for both modalities in the granularity-unified feature space, which further promotes the ReID performance.

Person Re-Identification Text based Person Retrieval +1

Paper
Code

Spacerini: Plug-and-play Search Engines with Pyserini and Hugging Face

1 code implementation • 28 Feb 2023 • Christopher Akiki, Odunayo Ogundepo, Aleksandra Piktus, Xinyu Zhang, Akintunde Oladipo, Jimmy Lin, Martin Potthast

We present Spacerini, a tool that integrates the Pyserini toolkit for reproducible information retrieval research with Hugging Face to enable the seamless construction and deployment of interactive search engines.

Information Retrieval Retrieval

Paper
Code

Lite-FPN for Keypoint-based Monocular 3D Object Detection

1 code implementation • 1 May 2021 • Lei Yang, Xinyu Zhang, Li Wang, Minghan Zhu, Jun Li

3D object detection with a single image is an essential and challenging task for autonomous driving.

Autonomous Driving Monocular 3D Object Detection +2

Paper
Code

HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking with Attribution

1 code implementation • 31 Jul 2023 • Ehsan Kamalloo, Aref Jafari, Xinyu Zhang, Nandan Thakur, Jimmy Lin

In this paper, we introduce a new dataset, HAGRID (Human-in-the-loop Attributable Generative Retrieval for Information-seeking Dataset) for building end-to-end generative information-seeking models that are capable of retrieving candidate quotes and generating attributed explanations.

Information Retrieval Informativeness +1

Paper
Code

Unified Pre-training with Pseudo Texts for Text-To-Image Person Re-identification

1 code implementation • ICCV 2023 • Zhiyin Shao, Xinyu Zhang, Changxing Ding, Jian Wang, Jingdong Wang

In this way, the pre-training task and the T2I-ReID task are made consistent with each other on both data and training levels.

Person Re-Identification

Paper
Code

UFO: Unified Feature Optimization

1 code implementation • 21 Jul 2022 • Teng Xi, Yifan Sun, Deli Yu, Bi Li, Nan Peng, Gang Zhang, Xinyu Zhang, Zhigang Wang, Jinwen Chen, Jian Wang, Lufei Liu, Haocheng Feng, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang

UFO aims to benefit each single task with a large-scale pretraining on all tasks.

Face Recognition Multi-Task Learning +4

Paper
Code

Pseudo-Bag Mixup Augmentation for Multiple Instance Learning-Based Whole Slide Image Classification

1 code implementation • 28 Jun 2023 • Pei Liu, Luping Ji, Xinyu Zhang, Feng Ye

Experimental results show that PseMix could often assist state-of-the-art MIL networks to refresh their classification performance on WSIs.

Classification Data Augmentation +3

Paper
Code

Contrastive Learning of User Behavior Sequence for Context-Aware Document Ranking

1 code implementation • 24 Aug 2021 • Yutao Zhu, Jian-Yun Nie, Zhicheng Dou, Zhengyi Ma, Xinyu Zhang, Pan Du, Xiaochen Zuo, Hao Jiang

To learn a more robust representation of the user behavior sequence, we propose a method based on contrastive learning, which takes into account the possible variations in user's behavior sequences.

Contrastive Learning Data Augmentation +1

Paper
Code

VRP-SAM: SAM with Visual Reference Prompt

1 code implementation • 27 Feb 2024 • Yanpeng Sun, Jiahui Chen, Shan Zhang, Xinyu Zhang, Qiang Chen, Gang Zhang, Errui Ding, Jingdong Wang, Zechao Li

In this paper, we propose a novel Visual Reference Prompt (VRP) encoder that empowers the Segment Anything Model (SAM) to utilize annotated reference images as prompts for segmentation, creating the VRP-SAM model.

Meta-Learning Segmentation

Paper
Code

GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration

1 code implementation • 2 Jun 2023 • Aleksandra Piktus, Odunayo Ogundepo, Christopher Akiki, Akintunde Oladipo, Xinyu Zhang, Hailey Schoelkopf, Stella Biderman, Martin Potthast, Jimmy Lin

We discuss how Pyserini - a widely used toolkit for reproducible IR research can be integrated with the Hugging Face ecosystem of open-source AI libraries and artifacts.

Information Retrieval Retrieval

Paper
Code

Pre-training for Ad-hoc Retrieval: Hyperlink is Also You Need

1 code implementation • 20 Aug 2021 • Zhengyi Ma, Zhicheng Dou, Wei Xu, Xinyu Zhang, Hao Jiang, Zhao Cao, Ji-Rong Wen

In this paper, we propose to leverage the large-scale hyperlinks and anchor texts to pre-train the language model for ad-hoc retrieval.

Language Modelling Retrieval

Paper
Code

Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models

1 code implementation • 11 Oct 2023 • Raphael Tang, Xinyu Zhang, Xueguang Ma, Jimmy Lin, Ferhan Ture

Large language models (LLMs) exhibit positional bias in how they use context, which especially complicates listwise ranking.

Paper
Code

From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking

1 code implementation • 22 Aug 2022 • Yutao Zhu, Jian-Yun Nie, Yixuan Su, Haonan Chen, Xinyu Zhang, Zhicheng Dou

In this work, we propose a curriculum learning framework for context-aware document ranking, in which the ranking model learns matching signals between the search context and the candidate document in an easy-to-hard manner.

Document Ranking

Paper
Code

S-AT GCN: Spatial-Attention Graph Convolution Network based Feature Enhancement for 3D Object Detection

2 code implementations • 15 Mar 2021 • Li Wang, Chenfei Wang, Xinyu Zhang, Tianwei Lan, Jun Li

3D object detection plays a crucial role in environmental perception for autonomous vehicles, which is the prerequisite of decision and control.

3D Object Detection Autonomous Vehicles +1

Paper
Code

NoMIRACL: Knowing When You Don't Know for Robust Multilingual Retrieval-Augmented Generation

1 code implementation • 18 Dec 2023 • Nandan Thakur, Luiz Bonifacio, Xinyu Zhang, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Boxing Chen, Mehdi Rezagholizadeh, Jimmy Lin

We measure LLM robustness using two metrics: (i) hallucination rate, measuring model tendency to hallucinate an answer, when the answer is not present in passages in the non-relevant subset, and (ii) error rate, measuring model inaccuracy to recognize relevant passages in the relevant subset.

Hallucination Language Modelling +2

Paper
Code

Dual Memory Aggregation Network for Event-Based Object Detection with Learnable Representation

1 code implementation • 17 Mar 2023 • Dongsheng Wang, Xu Jia, Yang Zhang, Xinyu Zhang, Yaoyuan Wang, Ziyang Zhang, Dong Wang, Huchuan Lu

To fully exploit information with event streams to detect objects, a dual-memory aggregation network (DMANet) is proposed to leverage both long and short memory along event streams to aggregate effective information for object detection.

Object object-detection +1

Paper
Code

Optical Flow boosts Unsupervised Localization and Segmentation

1 code implementation • 25 Jul 2023 • Xinyu Zhang, Abdeslam Boularias

Our fine-tuning procedure outperforms state-of-the-art techniques for unsupervised semantic segmentation through linear probing, without the use of any labeled data.

Object Optical Flow Estimation +3

Paper
Code

EFLNet: Enhancing Feature Learning for Infrared Small Target Detection

1 code implementation • 27 Jul 2023 • Bo Yang, Xinyu Zhang, Jian Zhang, Jun Luo, Mingliang Zhou, Yangjun Pi

To address this problem, we propose a new adaptive threshold focal loss (ATFL) function that decouples the target and the background, and utilizes the adaptive mechanism to adjust the loss weight to force the model to allocate more attention to target features.

regression

Paper
Code

SGV3D:Towards Scenario Generalization for Vision-based Roadside 3D Object Detection

1 code implementation • 29 Jan 2024 • Lei Yang, Xinyu Zhang, Jun Li, Li Wang, Chuang Zhang, Li Ju, Zhiwei Li, Yang shen

Our method surpasses all previous methods by a significant margin in new scenes, including +42. 57% for vehicle, +5. 87% for pedestrian, and +14. 89% for cyclist compared to BEVHeight on the DAIR-V2X-I heterologous benchmark.

3D Object Detection Autonomous Vehicles +1

Paper
Code

Rethinking Label Smoothing on Multi-hop Question Answering

2 code implementations • 19 Dec 2022 • Zhangyue Yin, Yuxin Wang, Xiannian Hu, Yiguang Wu, Hang Yan, Xinyu Zhang, Zhao Cao, Xuanjing Huang, Xipeng Qiu

Multi-Hop Question Answering (MHQA) is a significant area in question answering, requiring multiple reasoning components, including document retrieval, supporting sentence prediction, and answer span extraction.

Image Classification Machine Reading Comprehension +6

Paper
Code

How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement

1 code implementation • 3 Mar 2023 • Xu-Hui Liu, Feng Xu, Xinyu Zhang, Tianyuan Liu, Shengyi Jiang, Ruifeng Chen, Zongzhang Zhang, Yang Yu

In this paper, we propose a novel active imitation learning framework based on a teacher-student interaction model, in which the teacher's goal is to identify the best teaching behavior and actively affect the student's learning process.

Atari Games Imitation Learning

Paper
Code

Neural Image Re-Exposure

1 code implementation • 23 May 2023 • Xinyu Zhang, Hefei Huang, Xu Jia, Dong Wang, Huchuan Lu

In this work, we aim to re-expose the captured photo in post-processing to provide a more flexible way of addressing those issues within a unified framework.

Ranked #4 on Deblurring on GoPro (using extra training data)

Deblurring Joint Deblur and Frame Interpolation +5

Paper
Code

FLTracer: Accurate Poisoning Attack Provenance in Federated Learning

1 code implementation • 20 Oct 2023 • Xinyu Zhang, Qingyu Liu, Zhongjie Ba, Yuan Hong, Tianhang Zheng, Feng Lin, Li Lu, Kui Ren

In this paper, we first conduct a comprehensive study on prior FL attacks and detection methods.

Anomaly Detection Federated Learning

Paper
Code

Debatrix: Multi-dimensional Debate Judge with Iterative Chronological Analysis Based on LLM

1 code implementation • 12 Mar 2024 • Jingcong Liang, Rong Ye, Meng Han, Ruofei Lai, Xinyu Zhang, Xuanjing Huang, Zhongyu Wei

How can we construct an automated debate judge to evaluate an extensive, vibrant, multi-turn debate?

Paper
Code

ROFusion: Efficient Object Detection using Hybrid Point-wise Radar-Optical Fusion

1 code implementation • 17 Jul 2023 • Liu Liu, Shuaifeng Zhi, Zhenhua Du, Li Liu, Xinyu Zhang, Kai Huo, Weidong Jiang

In this paper, we propose a hybrid point-wise Radar-Optical fusion approach for object detection in autonomous driving scenarios.

Autonomous Driving Object +3

Paper
Code

GCN-MIF: Graph Convolutional Network with Multi-Information Fusion for Low-dose CT Denoising

3 code implementations • 15 May 2021 • Kecheng Chen, Jiayu Sun, Jiang Shen, Jixiang Luo, Xinyu Zhang, Xuelin Pan, Dongsheng Wu, Yue Zhao, Miguel Bento, Yazhou Ren, Xiaorong Pu

To address this issue, we propose a novel graph convolutional network-based LDCT denoising model, namely GCN-MIF, to explicitly perform multi-information fusion for denoising purpose.

Denoising

Paper
Code

MTCSNN: Multi-task Clinical Siamese Neural Network for Diabetic Retinopathy Severity Prediction

1 code implementation • 14 Aug 2022 • Chao Feng, Jui Po Hung, Aishan Li, Jieping Yang, Xinyu Zhang

The novelty of this project is to utilize the ordinal information among labels and add a new regression task, which can help the model learn more discriminative feature embedding for fine-grained classification tasks.

regression severity prediction

Paper
Code

Fast Sparse PCA via Positive Semidefinite Projection for Unsupervised Feature Selection

1 code implementation • 12 Sep 2023 • Junjing Zheng, Xinyu Zhang, Yongxiang Liu, Weidong Jiang, Kai Huo, Li Liu

A standard convex SPCA-based model with PSD constraint for unsupervised feature selection is proposed.

feature selection

Paper
Code

What Do Llamas Really Think? Revealing Preference Biases in Language Model Representations

1 code implementation • 30 Nov 2023 • Raphael Tang, Xinyu Zhang, Jimmy Lin, Ferhan Ture

We propose a logistic Bradley-Terry probe which predicts word pair preferences of LLMs from the words' hidden vectors.

Language Modelling

Paper
Code

Peer attention enhances student learning

1 code implementation • 4 Dec 2023 • Songlin Xu, Dongyin Hu, Ru Wang, Xinyu Zhang

Human visual attention is susceptible to social influences.

Paper
Code

Argue with Me Tersely: Towards Sentence-Level Counter-Argument Generation

1 code implementation • 21 Dec 2023 • Jiayu Lin, Rong Ye, Meng Han, Qi Zhang, Ruofei Lai, Xinyu Zhang, Zhao Cao, Xuanjing Huang, Zhongyu Wei

The results show the competitiveness of our proposed framework and evaluator in counter-argument generation tasks.

Sentence

Paper
Code

Generalized Supervised Attention for Text Generation

1 code implementation • Findings (ACL) 2021 • Yixian Liu, Liwen Zhang, Xinyu Zhang, Yong Jiang, Yue Zhang, Kewei Tu

Text Generation

Paper
Code

Certified Error Control of Candidate Set Pruning for Two-Stage Relevance Ranking

1 code implementation • 19 May 2022 • Minghan Li, Xinyu Zhang, Ji Xin, Hongyang Zhang, Jimmy Lin

For example, on MS MARCO Passage v1, our method yields an average candidate set size of 27 out of 1, 000 which increases the reranking speed by about 37 times, while the MRR@10 is greater than a pre-specified value of 0. 38 with about 90% empirical coverage and the empirical baselines fail to provide such guarantee.

Computational Efficiency Information Retrieval +1

Paper
Code

Eloss in the way: A Sensitive Input Quality Metrics for Intelligent Driving

1 code implementation • 2 Feb 2023 • Haobo Yang, Shiyan Zhang, Zhuoyi Yang, Xinyu Zhang

With the increasing complexity of the traffic environment, the importance of safety perception in intelligent driving is growing.

Anomaly Detection

Paper
Code

Hi-ArG: Exploring the Integration of Hierarchical Argumentation Graphs in Language Pretraining

1 code implementation • 1 Dec 2023 • Jingcong Liang, Rong Ye, Meng Han, Qi Zhang, Ruofei Lai, Xinyu Zhang, Zhao Cao, Xuanjing Huang, Zhongyu Wei

In this paper, we propose the Hierarchical Argumentation Graph (Hi-ArG), a new structure to organize arguments.

Knowledge Graphs

Paper
Code

EduAgent: Generative Student Agents in Learning

1 code implementation • 23 Mar 2024 • Songlin Xu, Xinyu Zhang, Lianhui Qin

Student simulation in online education is important to address dynamic learning behaviors of students with diverse backgrounds.

Paper
Code

A New Window Loss Function for Bone Fracture Detection and Localization in X-ray Images with Point-based Annotation

no code implementations • 7 Dec 2020 • Xinyu Zhang, Yirui Wang, Chi-Tung Cheng, Le Lu, Adam P. Harrison, Jing Xiao, Chien-Hung Liao, Shun Miao

Object detection methods are widely adopted for computer-aided diagnosis using medical images.

Image Classification Object +2

Paper
Add Code

Peer-to-Peer Localization for Single-Antenna Devices

no code implementations • 10 Dec 2020 • Xianan Zhang, Wei Wang, Xuedou Xiao, Hang Yang, Xinyu Zhang, Tao Jiang

In this paper, we present P2PLocate, a peer-to-peer localization system that enables a single-antenna device co-located with a batteryless backscatter tag to localize another single-antenna device with decimeter-level accuracy.

Indoor Localization TAG

Paper
Add Code

Diverse Knowledge Distillation for End-to-End Person Search

no code implementations • 21 Dec 2020 • Xinyu Zhang, Xinlong Wang, Jia-Wang Bian, Chunhua Shen, Mingyu You

Person search aims to localize and identify a specific person from a gallery of images.

Human Detection Knowledge Distillation +1

Paper
Add Code

A novel multimodal fusion network based on a joint coding model for lane line segmentation

no code implementations • 20 Mar 2021 • Zhenhong Zou, Xinyu Zhang, Huaping Liu, Zhiwei Li, Amir Hussain, Jun Li

There has recently been growing interest in utilizing multimodal sensors to achieve robust lane line segmentation.

Paper
Add Code

Emotion Eliciting Machine: Emotion Eliciting Conversation Generation based on Dual Generator

no code implementations • 18 May 2021 • Hao Jiang, Yutao Zhu, Xinyu Zhang, Zhicheng Dou, Pan Du, Te Pi, Yantao Jia

Then we propose a dual encoder-decoder structure to model the generation of responses in both positive and negative side based on the changes of the user's emotion status in the conversation.

Paper
Add Code

Early Exiting with Ensemble Internal Classifiers

no code implementations • 28 May 2021 • Tianxiang Sun, Yunhua Zhou, Xiangyang Liu, Xinyu Zhang, Hao Jiang, Zhao Cao, Xuanjing Huang, Xipeng Qiu

In this paper, we show that a novel objective function for the training of the ensemble internal classifiers can be naturally induced from the perspective of ensemble learning and information theory.

Ensemble Learning

Paper
Add Code

IPS300+: a Challenging Multimodal Dataset for Intersection Perception System

no code implementations • 5 Jun 2021 • Huanan Wang, Xinyu Zhang, Jun Li, Zhiwei Li, Lei Yang, Shuyue Pan, Yongqiang Deng

Through an IPS (Intersection Perception System) installed at the diagonal of the intersection, this paper proposes a high-quality multimodal dataset for the intersection perception task.

Paper
Add Code

YES SIR!Optimizing Semantic Space of Negatives with Self-Involvement Ranker

no code implementations • 14 Sep 2021 • Ruizhi Pu, Xinyu Zhang, Ruofei Lai, Zikai Guo, Yinxia Zhang, Hao Jiang, Yongkang Wu, Yantao Jia, Zhicheng Dou, Zhao Cao

Finally, supervisory signal in rear compressor is computed based on condition probability and thus can control sample dynamic and further enhance the model performance.

Document Ranking Information Retrieval +1

Paper
Add Code

Tuning Confidence Bound for Stochastic Bandits with Bandit Distance

no code implementations • 6 Oct 2021 • Xinyu Zhang, Srinjoy Das, Ken Kreutz-Delgado

We propose a novel modification of the standard upper confidence bound (UCB) method for the stochastic multi-armed bandit (MAB) problem which tunes the confidence bound of a given bandit based on its distance to others.

Paper
Add Code

Towards More Effective and Economic Sparsely-Activated Model

no code implementations • 14 Oct 2021 • Hao Jiang, Ke Zhan, Jianwei Qu, Yongkang Wu, Zhaoye Fei, Xinyu Zhang, Lei Chen, Zhicheng Dou, Xipeng Qiu, Zikai Guo, Ruofei Lai, Jiawen Wu, Enrui Hu, Yinxia Zhang, Yantao Jia, Fan Yu, Zhao Cao

To increase the number of activated experts without an increase in computational cost, we propose SAM (Switch and Mixture) routing, an efficient hierarchical routing mechanism that activates multiple experts in a same device (GPU).

Paper
Add Code

Bag-of-Words Baselines for Semantic Code Search

no code implementations • ACL (NLP4Prog) 2021 • Xinyu Zhang, Ji Xin, Andrew Yates, Jimmy Lin

The task of semantic code search is to retrieve code snippets from a source code corpus based on an information need expressed in natural language.

Code Search Information Retrieval +2

Paper
Add Code

A Little Bit Is Worse Than None: Ranking with Limited Training Data

no code implementations • EMNLP (sustainlp) 2020 • Xinyu Zhang, Andrew Yates, Jimmy Lin

Researchers have proposed simple yet effective techniques for the retrieval problem based on using BERT as a relevance classifier to rerank initial candidates from keyword search.

Passage Retrieval Retrieval

Paper
Add Code

KMIR: A Benchmark for Evaluating Knowledge Memorization, Identification and Reasoning Abilities of Language Models

no code implementations • 28 Feb 2022 • Daniel Gao, Yantao Jia, Lei LI, Chengzhen Fu, Zhicheng Dou, Hao Jiang, Xinyu Zhang, Lei Chen, Zhao Cao

However, to figure out whether PLMs can be reliable knowledge sources and used as alternative knowledge bases (KBs), we need to further explore some critical features of PLMs.

General Knowledge Memorization +1

Paper
Add Code

Multi-channel deep convolutional neural networks for multi-classifying thyroid disease

no code implementations • 6 Mar 2022 • Xinyu Zhang, Vincent CS. Lee, Jia Rong, James C. Lee, Jiangning Song, Feng Liu

Therefore, this study proposed a novel multi-channel convolutional neural network (CNN) architecture to address the multi-class classification task of thyroid disease.

Benchmarking Binary Classification +2

Paper
Add Code

Towards Best Practices for Training Multilingual Dense Retrieval Models

no code implementations • 5 Apr 2022 • Xinyu Zhang, Kelechi Ogueji, Xueguang Ma, Jimmy Lin

Dense retrieval models using a transformer-based bi-encoder design have emerged as an active area of research.

Cross-Lingual Transfer Retrieval

Paper
Add Code

Generative Pre-Trained Transformers for Biologically Inspired Design

no code implementations • 31 Mar 2022 • Qihao Zhu, Xinyu Zhang, Jianxi Luo

Biological systems in nature have evolved for millions of years to adapt and survive the environment.

Language Modelling

Paper
Add Code

Replacing Labeled Real-image Datasets with Auto-generated Contours

no code implementations • CVPR 2022 • Hirokatsu Kataoka, Ryo Hayamizu, Ryosuke Yamada, Kodai Nakashima, Sora Takashima, Xinyu Zhang, Edgar Josafat Martinez-Noriega, Nakamasa Inoue, Rio Yokota

In the present work, we show that the performance of formula-driven supervised learning (FDSL) can match or even exceed that of ImageNet-21k without the use of real images, human-, and self-supervision during the pre-training of Vision Transformers (ViTs).

Paper
Add Code

Evolutionary Game-Theoretical Analysis for General Multiplayer Asymmetric Games

no code implementations • 22 Jun 2022 • Xinyu Zhang, Peng Peng, Yushan Zhou, Haifeng Wang, Wenxin Li

First, there is inaccuracy when analysing the simplified payoff table.

Starcraft Starcraft II

Paper
Add Code

BYHE: A Simple Framework for Boosting End-to-end Video-based Heart Rate Measurement Network

no code implementations • 4 Jul 2022 • Weiyu Sun, Xinyu Zhang, Ying Chen, Yun Ge, Chunyu Ji, Xiaolin Huang

Heart rate measuring based on remote photoplethysmography (rPPG) plays an important role in health caring, which estimates heart rate from facial video in a non-contact, less-constrained way.

Heart rate estimation

Paper
Add Code

Learning-based Practical Smartphone Eavesdropping with Built-in Accelerometer

no code implementations • Network and Distributed Systems Security (NDSS) Symposium 2020 • Zhongjie Ba, Tianhang Zheng, Xinyu Zhang, Zhan Qin, Baochun Li, Xue Liu and Kui Ren

The second limitation comes from a common sense that these sensors can only pick up a narrow band (85-100Hz) of speech signals due to a sampling ceiling of 200Hz.

Common Sense Reasoning

Paper
Add Code

Coarse-to-Fine: Hierarchical Multi-task Learning for Natural Language Understanding

no code implementations • COLING 2022 • Zhaoye Fei, Yu Tian, Yongkang Wu, Xinyu Zhang, Yutao Zhu, Zheng Liu, Jiawen Wu, Dejiang Kong, Ruofei Lai, Zhao Cao, Zhicheng Dou, Xipeng Qiu

Our experiments on 13 benchmark datasets across five natural language understanding tasks demonstrate the superiority of our method.

Multi-Task Learning Natural Language Understanding

Paper
Add Code

CAMO-MOT: Combined Appearance-Motion Optimization for 3D Multi-Object Tracking with Camera-LiDAR Fusion

no code implementations • 6 Sep 2022 • Li Wang, Xinyu Zhang, Wenyuan Qin, Xiaoyu Li, Lei Yang, Zhiwei Li, Lei Zhu, Hong Wang, Jun Li, Huaping Liu

As such, we propose a novel camera-LiDAR fusion 3D MOT framework based on the Combined Appearance-Motion Optimization (CAMO-MOT), which uses both camera and LiDAR data and significantly reduces tracking failures caused by occlusion and false detection.

3D Multi-Object Tracking Autonomous Driving +2

Paper
Add Code

Ambiguity Function Shaping based on Alternating Direction Riemannian Optimal Algorithm

no code implementations • 8 Sep 2022 • Haoyu Yi, Xinyu Zhang, Weidong Jiang, Kai Huo

In this paper, we proposed a novel method to design a waveform to synthesize the STAF based on suppressing the interference power.

Paper
Add Code

Pre-training for Information Retrieval: Are Hyperlinks Fully Explored?

no code implementations • 14 Sep 2022 • Jiawen Wu, Xinyu Zhang, Yutao Zhu, Zheng Liu, Zikai Guo, Zhaoye Fei, Ruofei Lai, Yongkang Wu, Zhao Cao, Zhicheng Dou

Hyperlinks, which are commonly used in Web pages, have been leveraged for designing pre-training objectives.

Information Retrieval Question Answering +1

Paper
Add Code

A Hmong Corpus with Elaborate Expression Annotations

no code implementations • LREC 2022 • David R. Mortensen, Xinyu Zhang, Chenxuan Cui, Katherine Zhang

This paper describes the first publicly available corpus of Hmong, a minority language of China, Vietnam, Laos, Thailand, and various countries in Europe and the Americas.

Word Embeddings

Paper
Add Code

Better Than Whitespace: Information Retrieval for Languages without Custom Tokenizers

no code implementations • 11 Oct 2022 • Odunayo Ogundepo, Xinyu Zhang, Jimmy Lin

However, only a handful of the 7000+ languages on the planet benefit from specialized, custom-built tokenization algorithms, while the other languages are stuck with a "default" whitespace tokenizer, which cannot capture the intricacies of different languages.

Information Retrieval Retrieval

Paper
Add Code

RF-CHORD: Towards Deployable RFID Localization System for Logistics Network

no code implementations • 1 Nov 2022 • Bo Liang, Purui Wang, Renjie Zhao, Heyu Guo, Pengyu Zhang, Junchen Guo, Shunmin Zhu, Hongqiang Harry Liu, Xinyu Zhang, Chenren Xu

RFID localization is considered the key enabler of automating the process of inventory tracking and management for high-performance logistic network.

Management

Paper
Add Code

3DFill:Reference-guided Image Inpainting by Self-supervised 3D Image Alignment

no code implementations • 9 Nov 2022 • Liang Zhao, Xinyuan Zhao, Hailong Ma, Xinyu Zhang, Long Zeng

We then fill the hole in the target image with the contents of the aligned image.

Image Inpainting

Paper
Add Code

A classification performance evaluation measure considering data separability

no code implementations • 10 Nov 2022 • Lingyan Xue, Xinyu Zhang, Weidong Jiang, Kai Huo

Machine learning and deep learning classification models are data-driven, and the model and the data jointly determine their classification performance.

Classification

Paper
Add Code

CAE v2: Context Autoencoder with CLIP Target

no code implementations • 17 Nov 2022 • Xinyu Zhang, Jiahui Chen, Junkun Yuan, Qiang Chen, Jian Wang, Xiaodi Wang, Shumin Han, Xiaokang Chen, Jimin Pi, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang

That is to say, the smaller the model, the lower the mask ratio needs to be.

Semantic Segmentation

Paper
Add Code

Asymptotic Properties of the Synthetic Control Method

no code implementations • 22 Nov 2022 • Xiaomeng Zhang, Wendun Wang, Xinyu Zhang

This paper provides new insights into the asymptotic properties of the synthetic control method (SCM).

Paper
Add Code

Biologically Inspired Design Concept Generation Using Generative Pre-Trained Transformers

no code implementations • 26 Dec 2022 • Qihao Zhu, Xinyu Zhang, Jianxi Luo

This paper proposes a generative design approach based on the generative pre-trained language model (PLM) to automatically retrieve and map biological analogy and generate BID in the form of natural language.

Language Modelling

Paper
Add Code

Modelling human logical reasoning process in dynamic environmental stress with cognitive agents

no code implementations • 15 Jan 2023 • Songlin Xu, Xinyu Zhang

Overall, this work demonstrates a powerful, data-driven methodology to simulate and understand the vagaries of human logical reasoning process in dynamic contexts.

Logical Reasoning reinforcement-learning +2

Paper
Add Code

Improving Out-of-Distribution Generalization of Neural Rerankers with Contextualized Late Interaction

no code implementations • 13 Feb 2023 • Xinyu Zhang, Minghan Li, Jimmy Lin

Recent progress in information retrieval finds that embedding query and document representation into multi-vector yields a robust bi-encoder retriever on out-of-distribution datasets.

Information Retrieval Out-of-Distribution Generalization +1

Paper
Add Code

CMG-Net: An End-to-End Contact-Based Multi-Finger Dexterous Grasping Network

no code implementations • 23 Mar 2023 • Mingze Wei, Yaomin Huang, Zhiyuan Xu, Ning Liu, Zhengping Che, Xinyu Zhang, Chaomin Shen, Feifei Feng, Chun Shan, Jian Tang

Our work significantly outperforms the state-of-the-art for three-finger robotic hands.

Paper
Add Code

Simple Yet Effective Neural Ranking and Reranking Baselines for Cross-Lingual Information Retrieval

no code implementations • 3 Apr 2023 • Jimmy Lin, David Alfonso-Hermelo, Vitor Jeronymo, Ehsan Kamalloo, Carlos Lassance, Rodrigo Nogueira, Odunayo Ogundepo, Mehdi Rezagholizadeh, Nandan Thakur, Jheng-Hong Yang, Xinyu Zhang

The advent of multilingual language models has generated a resurgence of interest in cross-lingual information retrieval (CLIR), which is the task of searching documents in one language with queries from another.

Cross-Lingual Information Retrieval Retrieval

Paper
Add Code

Informative Data Selection with Uncertainty for Multi-modal Object Detection

no code implementations • 23 Apr 2023 • Xinyu Zhang, Zhiwei Li, Zhenhong Zou, Xin Gao, Yijin Xiong, Dafeng Jin, Jun Li, Huaping Liu

To quantify the correlation in multi-modal information, we model the uncertainty, as the inverse of data information, in different modalities and embed it in the bounding box generation.

Informativeness object-detection +1

Paper
Add Code

Zero-Shot Listwise Document Reranking with a Large Language Model

no code implementations • 3 May 2023 • Xueguang Ma, Xinyu Zhang, Ronak Pradeep, Jimmy Lin

Supervised ranking methods based on bi-encoder or cross-encoder architectures have shown success in multi-stage text ranking tasks, but they require large amounts of relevance judgments as training data.

Language Modelling Large Language Model +1

Paper
Add Code

Evaluating Embedding APIs for Information Retrieval

no code implementations • 10 May 2023 • Ehsan Kamalloo, Xinyu Zhang, Odunayo Ogundepo, Nandan Thakur, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Jimmy Lin

The ever-increasing size of language models curtails their widespread availability to the community, thereby galvanizing many companies into offering access to large language models through APIs.

Domain Generalization Information Retrieval +2

Paper
Add Code

Path Planning for Air-Ground Robot Considering Modal Switching Point Optimization

no code implementations • 14 May 2023 • Xiaoyu Wang, Kangyao Huang, Xinyu Zhang, Honglin Sun, Wenzhuo LIU, Huaping Liu, Jun Li, Pingping Lu

A robot for the field application environment was proposed, and a lightweight global spatial planning technique for the robot based on the graph-search algorithm taking mode switching point optimization into account, with an emphasis on energy efficiency, searching speed, and the viability of real deployment.

Paper
Add Code

Optimal Weighted Random Forests

no code implementations • 17 May 2023 • Xinyu Chen, Dalei Yu, Xinyu Zhang

The random forest (RF) algorithm has become a very popular prediction method for its great flexibility and promising accuracy.

feature selection

Paper
Add Code

Multi-source adversarial transfer learning for ultrasound image segmentation with limited similarity

no code implementations • 30 May 2023 • Yifu Zhang, Hongru Li, Tao Yang, Rui Tao, Zhengyuan Liu, Shimeng Shi, Jiansong Zhang, Ning Ma, Wujin Feng, Zhanhu Zhang, Xinyu Zhang

Transfer learning provides the possibility to solve this problem, but there are too many features in natural images that are not related to the target domain.

Image Segmentation Lesion Segmentation +2

Paper
Add Code

Towards Optimal Neural Networks: the Role of Sample Splitting in Hyperparameter Selection

no code implementations • 15 Jul 2023 • Shijin Gong, Xinyu Zhang

When artificial neural networks have demonstrated exceptional practical success in a variety of domains, investigations into their theoretical characteristics, such as their approximation power, statistical properties, and generalization performance, have concurrently made significant strides.

Paper
Add Code

Text-CRS: A Generalized Certified Robustness Framework against Textual Adversarial Attacks

no code implementations • 31 Jul 2023 • Xinyu Zhang, Hanbin Hong, Yuan Hong, Peng Huang, Binghui Wang, Zhongjie Ba, Kui Ren

The language models, especially the basic text classification models, have been shown to be susceptible to textual adversarial attacks such as synonym substitution and word insertion attacks.

text-classification Text Classification

Paper
Add Code

SkipcrossNets: Adaptive Skip-cross Fusion for Road Detection

no code implementations • 24 Aug 2023 • Xinyu Zhang, Yan Gong, Zhiwei Li, Xin Gao, Dafeng Jin, Jun Li, Huaping Liu

Multi-modal fusion is increasingly being used for autonomous driving tasks, as images from different modalities provide unique information for feature extraction.

Autonomous Driving

Paper
Add Code

Optimizing Factual Accuracy in Text Generation through Dynamic Knowledge Selection

no code implementations • 30 Aug 2023 • Hongjin Qian, Zhicheng Dou, Jiejun Tan, Haonan Chen, Haoqi Gu, Ruofei Lai, Xinyu Zhang, Zhao Cao, Ji-Rong Wen

Previous methods use external knowledge as references for text generation to enhance factuality but often struggle with the knowledge mix-up(e. g., entity mismatch) of irrelevant references.

Text Generation

Paper
Add Code

Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from Knowledge Graphs

no code implementations • 6 Sep 2023 • Chao Feng, Xinyu Zhang, Zichu Fei

In some previous works, additional modules like graph neural networks (GNNs) are trained on retrieved knowledge from external knowledge bases, aiming to mitigate the problem of lacking domain-specific knowledge.

Hallucination Knowledge Graphs +1

Paper
Add Code

DEFormer: DCT-driven Enhancement Transformer for Low-light Image and Dark Vision

no code implementations • 13 Sep 2023 • Xiangchen Yin, Zhenda Yu, Xin Gao, Ran Ju, Xiao Sun, Xinyu Zhang

However, it is difficult to restore the lost details in the dark area by relying only on the RGB domain.

Autonomous Driving Low-Light Image Enhancement

Paper
Add Code

Tackling the Non-IID Issue in Heterogeneous Federated Learning by Gradient Harmonization

no code implementations • 13 Sep 2023 • Xinyu Zhang, Weiyu Sun, Ying Chen

In this work, we revisit this key challenge through the lens of gradient conflicts on the server side.

Federated Learning Privacy Preserving

Paper
Add Code

Task Graph offloading via Deep Reinforcement Learning in Mobile Edge Computing

no code implementations • 19 Sep 2023 • Jiagang Liu, Yun Mi, Xinyu Zhang, Xiaocui Li

To adapt to environmental changes, we model the task graph scheduling for computation offloading as a Markov Decision Process (MDP).

Edge-computing reinforcement-learning +1

Paper
Add Code

CO-Net: Learning Multiple Point Cloud Tasks at Once with A Cohesive Network

no code implementations • ICCV 2023 • Tao Xie, Ke Wang, Siyi Lu, Yukun Zhang, Kun Dai, Xiaoyu Li, Jie Xu, Li Wang, Lijun Zhao, Xinyu Zhang, Ruifeng Li

Finally, we propose a sign-based gradient surgery to promote the training of CO-Net, thereby emphasizing the usage of task-shared parameters and guaranteeing that each task can be thoroughly optimized.

Incremental Learning Multi-Task Learning

Paper
Add Code

Sequential Texts Driven Cohesive Motions Synthesis with Natural Transitions

no code implementations • ICCV 2023 • Shuai Li, Sisi Zhuang, Wenfeng Song, Xinyu Zhang, Hejia Chen, Aimin Hao

At the technical level, we explore the local-to-global semantic features of previous and current texts to extract relevant information.

Paper
Add Code

BEVHeight++: Toward Robust Visual Centric 3D Object Detection

no code implementations • 28 Sep 2023 • Lei Yang, Tao Tang, Jun Li, Peng Chen, Kun Yuan, Li Wang, Yi Huang, Xinyu Zhang, Kaicheng Yu

In essence, we regress the height to the ground to achieve a distance-agnostic formulation to ease the optimization process of camera-only perception methods.

3D Object Detection Autonomous Driving +2

Paper
Add Code

MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings

no code implementations • 30 Sep 2023 • Lei Yang, Jiaxin Yu, Xinyu Zhang, Jun Li, Li Wang, Yi Huang, Chuang Zhang, Hong Wang, Yiming Li

We discover that most existing monocular 3D object detectors rely on the ego-vehicle prior assumption that the optical axis of the camera is parallel to the ground.

Autonomous Driving Monocular 3D Object Detection +1

Paper
Add Code

ProGO: Probabilistic Global Optimizer

no code implementations • 4 Oct 2023 • Xinyu Zhang, Sujit Ghosh

To address these challenges, we develop a sequence of multidimensional integration-based methods that we show to converge to the global optima under some mild regularity conditions.

Bayesian Optimization

Paper
Add Code

FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer

no code implementations • 20 Oct 2023 • Xinyu Zhang, Li Wang, Zhiqiang Jiang, Kun Dai, Tao Xie, Lei Yang, Wenhao Yu, Yang shen, Jun Li

However, these methods only integrate long-range context information among keypoints with a fixed receptive field, which constrains the network from reconciling the importance of features with different receptive fields to realize complete image perception, hence limiting the matching accuracy.

Homography Estimation Pose Estimation +1

Paper
Add Code

Fuzzy-NMS: Improving 3D Object Detection with Fuzzy Classification in NMS

no code implementations • 21 Oct 2023 • Li Wang, Xinyu Zhang, Fachuan Zhao, Chuze Wu, Yichen Wang, Ziying Song, Lei Yang, Jun Li, Huaping Liu

The proposed Fuzzy-NMS module combines the volume and clustering density of candidate bounding boxes, refining them with a fuzzy classification method and optimizing the appropriate suppression thresholds to reduce uncertainty in the NMS process.

3D Object Detection object-detection

Paper
Add Code

Leveraging generative artificial intelligence to simulate student learning behavior

no code implementations • 30 Oct 2023 • Songlin Xu, Xinyu Zhang

Student simulation presents a transformative approach to enhance learning outcomes, advance educational research, and ultimately shape the future of effective pedagogy.

Paper
Add Code

Magmaw: Modality-Agnostic Adversarial Attacks on Machine Learning-Based Wireless Communication Systems

no code implementations • 1 Nov 2023 • Jung-Woo Chang, Ke Sun, Nasimeh Heydaribeni, Seira Hidano, Xinyu Zhang, Farinaz Koushanfar

Although there have been a number of adversarial attacks on ML-based wireless systems, the existing methods do not provide a comprehensive view including multi-modality of the source data, common physical layer components, and wireless domain constraints.

Paper
Add Code

Self-similarity Prior Distillation for Unsupervised Remote Physiological Measurement

no code implementations • 9 Nov 2023 • Xinyu Zhang, Weiyu Sun, Hao Lu, Ying Chen, Yun Ge, Xiaolin Huang, Jie Yuan, Yingcong Chen

In this paper, we propose a Self-Similarity Prior Distillation (SSPD) framework for unsupervised rPPG estimation, which capitalizes on the intrinsic self-similarity of cardiac activities.

Contrastive Learning

Paper
Add Code

Differentiable Radio Frequency Ray Tracing for Millimeter-Wave Sensing

no code implementations • 22 Nov 2023 • Xingyu Chen, Xinyu Zhang, Qiyue Xia, Xinmin Fang, Chris Xiaoxuan Lu, Zhengxiong Li

We propose DiffSBR, a differentiable framework for mmWave-based 3D reconstruction.

3D Reconstruction

Paper
Add Code

IAG: Induction-Augmented Generation Framework for Answering Reasoning Questions

no code implementations • 30 Nov 2023 • Zhebin Zhang, Xinyu Zhang, Yuanhang Ren, Saijiang Shi, Meng Han, Yongkang Wu, Ruofei Lai, Zhao Cao

In this paper, we propose an Induction-Augmented Generation (IAG) framework that utilizes inductive knowledge along with the retrieved documents for implicit reasoning.

Knowledge Distillation Retrieval +1

Paper
Add Code

Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models

no code implementations • 5 Dec 2023 • Xinyu Zhang, Sebastian Hofstätter, Patrick Lewis, Raphael Tang, Jimmy Lin

However, current works in this direction all depend on the GPT models, making it a single point of failure in scientific reproducibility.

Passage Retrieval Retrieval

Paper
Add Code

Efficient Multi-scale Network with Learnable Discrete Wavelet Transform for Blind Motion Deblurring

no code implementations • 29 Dec 2023 • Xin Gao, Tianheng Qiu, Xinyu Zhang, Hanlin Bai, Kang Liu, Xuan Huang, Hu Wei, Guoying Zhang, Huaping Liu

Coarse-to-fine schemes are widely used in traditional single-image motion deblur; however, in the context of deep learning, existing multi-scale algorithms not only require the use of complex modules for feature fusion of low-scale RGB images and deep semantics, but also manually generate low-resolution pairs of images that do not have sufficient confidence.

Computational Efficiency Deblurring

Paper
Add Code

From Data to Insights: A Comprehensive Survey on Advanced Applications in Thyroid Cancer Research

no code implementations • 8 Jan 2024 • Xinyu Zhang, Vincent CS Lee, Feng Liu

Thyroid cancer, the most prevalent endocrine cancer, has gained significant global attention due to its impact on public health.

Paper
Add Code

Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning

no code implementations • 4 Feb 2024 • Lanqing Li, Hai Zhang, Xinyu Zhang, Shatong Zhu, Junqiao Zhao, Pheng-Ann Heng

As a marriage between offline RL and meta-RL, the advent of offline meta-reinforcement learning (OMRL) has shown great promise in enabling RL agents to multi-task and quickly adapt while acquiring knowledge safely.

Meta Reinforcement Learning Offline RL

Paper
Add Code

Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics

no code implementations • 17 Feb 2024 • Xinyu Zhang, Wenjie Qiu, Yi-Chen Li, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu

DORA incorporates an information bottleneck principle that maximizes mutual information between the dynamics encoding and the environmental data, while minimizing mutual information between the dynamics encoding and the actions of the behavior policy.

Representation Learning

Paper
Add Code

LDSF: Lightweight Dual-Stream Framework for SAR Target Recognition by Coupling Local Electromagnetic Scattering Features and Global Visual Features

no code implementations • 6 Mar 2024 • Xuying Xiong, Xinyu Zhang, Weidong Jiang, Tianpeng Liu

We extract the EM scattering (EMS) information from the complex SAR data and integrate the physical properties of the target into the network through a dual-stream framework to guide the network to learn physically meaningful and discriminative features.

Paper
Add Code

In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model

no code implementations • 10 Mar 2024 • Junhui Yin, Xinyu Zhang, Lin Wu, Xianghua Xie, Xiaojie Wang

To this end, we explore the concept of test-time prompt tuning (TTPT), which enables the adaptation of the CLIP model to novel downstream tasks through only one step of optimization on an unsupervised objective that involves the test sample.

In-Context Learning Language Modelling +1

Paper
Add Code

Stimulate the Potential of Robots via Competition

no code implementations • 15 Mar 2024 • Kangyao Huang, Di Guo, Xinyu Zhang, Xiangyang Ji, Huaping Liu

It is common for us to feel pressure in a competition environment, which arises from the desire to obtain success comparing with other individuals or opponents.

Paper
Add Code

An Analysis on Matching Mechanisms and Token Pruning for Late-interaction Models

no code implementations • 20 Mar 2024 • Qi Liu, Gang Guo, Jiaxin Mao, Zhicheng Dou, Ji-Rong Wen, Hao Jiang, Xinyu Zhang, Zhao Cao

Based on these findings, we then propose several simple document pruning methods to reduce the storage overhead and compare the effectiveness of different pruning methods on different late-interaction models.

Retrieval

Paper
Add Code

Worst-Case Riemannian Optimization with Uncertain Target Steering Vector for Slow-Time Transmit Sequence of Cognitive Radar

no code implementations • 16 Apr 2024 • Xinyu Zhang, Weidong Jiang, Xiangfeng Qiu, Yongxiang Liu

In order to solve this problem, we propose a new optimization method for slow-time transmit sequence design.

Riemannian optimization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.