Search Results for author: Tao Ma

Found 21 papers, 9 papers with code

Asphalt Concrete Characterization Using Digital Image Correlation: A Systematic Review of Best Practices, Applications, and Future Vision

no code implementations26 Feb 2024 Siqi Wang, Zehui Zhu, Tao Ma, Jianwei Fan

This article presents a state-of-art review of DIC as a crucial tool for laboratory testing of asphalt concrete (AC), primarily focusing on the widely utilized 2D-DIC and 3D-DIC techniques.

Real-Time Asphalt Pavement Layer Thickness Prediction Using Ground-Penetrating Radar Based on a Modified Extended Common Mid-Point (XCMP) Approach

no code implementations7 Jan 2024 Siqi Wang, Zhen Leng, Xin Sui, Weiguang Zhang, Tao Ma, Zehui Zhu

This study investigates the affecting factors and develops a modified XCMP method to allow automatic thickness prediction of in-service asphalt pavement with non-uniform dielectric properties through depth.

Dielectric Constant Edge Detection +1

On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving

1 code implementation9 Nov 2023 Licheng Wen, Xuemeng Yang, Daocheng Fu, XiaoFeng Wang, Pinlong Cai, Xin Li, Tao Ma, Yingxuan Li, Linran Xu, Dengke Shang, Zheng Zhu, Shaoyan Sun, Yeqi Bai, Xinyu Cai, Min Dou, Shuanglu Hu, Botian Shi, Yu Qiao

This has been a significant bottleneck, particularly in the development of common sense reasoning and nuanced scene understanding necessary for safe and reliable autonomous driving.

Autonomous Driving Common Sense Reasoning +4

DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models

2 code implementations28 Sep 2023 Licheng Wen, Daocheng Fu, Xin Li, Xinyu Cai, Tao Ma, Pinlong Cai, Min Dou, Botian Shi, Liang He, Yu Qiao

Recent advancements in autonomous driving have relied on data-driven approaches, which are widely adopted but face challenges including dataset bias, overfitting, and uninterpretability.

Autonomous Driving Common Sense Reasoning +1

AGMDT: Virtual Staining of Renal Histology Images with Adjacency-Guided Multi-Domain Transfer

no code implementations12 Sep 2023 Tao Ma, Chao Zhang, Min Lu, Lin Luo

Renal pathology, as the gold standard of kidney disease diagnosis, requires doctors to analyze a series of tissue slices stained by H&E staining and special staining like Masson, PASM, and PAS, respectively.

Graph Matching Style Transfer

Sequential Knockoffs for Variable Selection in Reinforcement Learning

no code implementations24 Mar 2023 Tao Ma, Hengrui Cai, Zhengling Qi, Chengchun Shi, Eric B. Laber

In real-world applications of reinforcement learning, it is often challenging to obtain a state representation that is parsimonious and satisfies the Markov property without prior knowledge.

reinforcement-learning Variable Selection

Pavementscapes: a large-scale hierarchical image dataset for asphalt pavement damage segmentation

1 code implementation24 Jul 2022 Zheng Tong, Tao Ma, Ju Huyan, Weiguang Zhang

However, few current public datasets limit the potential exploration of deep learning in the application of pavement damage segmentation.

Playing the Game of 2048 Segmentation

OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous Driving

1 code implementation27 May 2022 Guohang Yan, Liu Zhuochun, Chengjie Wang, Chunlei Shi, Pengjin Wei, Xinyu Cai, Tao Ma, Zhizheng Liu, Zebin Zhong, Yuqian Liu, Ming Zhao, Zheng Ma, Yikang Li

To this end, we present OpenCalib, a calibration toolbox that contains a rich set of various sensor calibration methods.

Autonomous Driving

Comprehensive Review of Deep Learning-Based 3D Point Cloud Completion Processing and Analysis

no code implementations7 Mar 2022 Ben Fei, Weidong Yang, Wenming Chen, Zhijun Li, Yikang Li, Tao Ma, Xing Hu, Lipeng Ma

Point cloud completion is a generation and estimation issue derived from the partial point clouds, which plays a vital role in the applications in 3D computer vision.

Point Cloud Completion

MOC-GAN: Mixing Objects and Captions to Generate Realistic Images

no code implementations6 Jun 2021 Tao Ma, Yikang Li

Correspondingly, a MOC-GAN is proposed to mix the inputs of two modalities to generate realistic images.

Implicit Relations

Perception Entropy: A Metric for Multiple Sensors Configuration Evaluation and Design

no code implementations14 Apr 2021 Tao Ma, Zhizheng Liu, Yikang Li

To tackle these issues, we propose a novel method based on conditional entropy in Bayesian theory to evaluate the sensor configurations containing both cameras and LiDARs.

Autonomous Driving

CRLF: Automatic Calibration and Refinement based on Line Feature for LiDAR and Camera in Road Scenes

no code implementations8 Mar 2021 Tao Ma, Zhizheng Liu, Guohang Yan, Yikang Li

For autonomous vehicles, an accurate calibration for LiDAR and camera is a prerequisite for multi-sensor perception systems.

Autonomous Vehicles

Speech Fusion to Face: Bridging the Gap Between Human's Vocal Characteristics and Facial Imaging

no code implementations10 Jun 2020 Yeqi Bai, Tao Ma, Lipo Wang, Zhenjie Zhang

While deep learning technologies are now capable of generating realistic images confusing humans, the research efforts are turning to the synthesis of images for more concrete and application-specific purposes.

Image Generation

ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition

no code implementations21 May 2020 Jing Pan, Joshua Shapiro, Jeremy Wohlwend, Kyu J. Han, Tao Lei, Tao Ma

In this paper we present state-of-the-art (SOTA) performance on the LibriSpeech corpus with two novel neural network architectures, a multistream CNN for acoustic modeling and a self-attentive simple recurrent unit (SRU) for language modeling.

Data Augmentation Language Modelling +2

Multistream CNN for Robust Acoustic Modeling

no code implementations21 May 2020 Kyu J. Han, Jing Pan, Venkata Krishna Naveen Tadala, Tao Ma, Dan Povey

When combined with self-attentive SRU LM rescoring, multistream CNN contributes for ASAPP to achieve the best WER of 1. 75% on test-clean in LibriSpeech.

Data Augmentation speech-recognition +1

State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D Convolutions

1 code implementation1 Oct 2019 Kyu J. Han, Ramon Prieto, Kaixing Wu, Tao Ma

Self-attention has been a huge success for many downstream tasks in NLP, which led to exploration of applying self-attention to speech problems as well.

speech-recognition Speech Recognition

PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph

1 code implementation NeurIPS 2019 Yikang Li, Tao Ma, Yeqi Bai, Nan Duan, Sining Wei, Xiaogang Wang

Therefore, to generate the images with preferred objects and rich interactions, we propose a semi-parametric method, PasteGAN, for generating the image from the scene graph and the image crops, where spatial arrangements of the objects and their pair-wise relationships are defined by the scene graph and the object appearances are determined by the given object crops.

Image Generation Object

Multi-task Learning for Financial Forecasting

no code implementations27 Sep 2018 Tao Ma

Compared to the previous works, we use multiple networks to forecast multiple related stocks, using the shared and private information of them simultaneously through multi-task learning.

Multi-Task Learning Time Series +1

Cannot find the paper you are looking for? You can Submit a new open access paper.