Search Results for author: Wenbo Zhao

Found 23 papers, 8 papers with code

MCGS: Multiview Consistency Enhancement for Sparse-View 3D Gaussian Radiance Fields

no code implementations15 Oct 2024 Yuru Xiao, Deming Zhai, Wenbo Zhao, Kui Jiang, Junjun Jiang, Xianming Liu

These modular, plug-and-play strategies enhance robustness to sparse input views, accelerate rendering, and reduce memory consumption, making MCGS a practical and efficient framework for 3D Gaussian Splatting.

PVContext: Hybrid Context Model for Point Cloud Compression

no code implementations19 Sep 2024 Guoqing Zhang, Wenbo Zhao, Jian Liu, Yuanchao Bai, Junjun Jiang, Xianming Liu

Efficient storage of large-scale point cloud data has become increasingly challenging due to advancements in scanning technology.

Spatial Annealing for Efficient Few-shot Neural Rendering

1 code implementation12 Jun 2024 Yuru Xiao, Deming Zhai, Wenbo Zhao, Kui Jiang, Junjun Jiang, Xianming Liu

Although FreeNeRF has introduced an efficient frequency annealing strategy, its operation on frequency positional encoding is incompatible with the efficient hybrid representations.

Neural Rendering Novel View Synthesis

Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision

1 code implementation4 Jun 2024 Saierdaer Yusuyin, Te Ma, Hao Huang, Wenbo Zhao, Zhijian Ou

We construct a common experimental setup based on the CommonVoice dataset, called CV-Lang10, with 10 seen languages and 2 unseen languages.

Automatic Speech Recognition speech-recognition +1

DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge

no code implementations27 May 2024 Yifan Mao, Ming Li, Jian Liu, Jiayang Liu, Zihan Qin, Chunxi Chu, Jialei Xu, Wenbo Zhao, Junjun Jiang, Xianming Liu

However, given that most of the data in the autonomous driving dataset is collected in daytime scenarios, this leads to poor depth model performance in the face of out-of-distribution(OoD) data.

3D Reconstruction Autonomous Driving +1

Mesh Denoising Transformer

no code implementations10 May 2024 Wenbo Zhao, Xianming Liu, Deming Zhai, Junjun Jiang, Xiangyang Ji

Next, we propose a dual-stream structure consisting of a Geometric Encoder branch and a Spatial Encoder branch, which jointly encode local geometry details and spatial information to fully explore multimodal information for mesh denoising.

Denoising

REPS: Reconstruction-based Point Cloud Sampling

1 code implementation8 Mar 2024 Guoqing Zhang, Wenbo Zhao, Jian Liu, Xianming Liu

Our method outperforms previous approaches in preserving the structural features of the sampled point clouds.

Mitigating Bias for Question Answering Models by Tracking Bias Influence

no code implementations13 Oct 2023 Mingyu Derek Ma, Jiun-Yu Kao, Arpit Gupta, Yu-Hsiang Lin, Wenbo Zhao, Tagyoung Chung, Wei Wang, Kai-Wei Chang, Nanyun Peng

Based on the intuition that a model would lean to be more biased if it learns from a biased example, we measure the bias level of a query instance by observing its influence on another instance.

Multiple-choice Multi-Task Learning +1

DiNADO: Norm-Disentangled Neurally-Decomposed Oracles for Controlling Language Models

1 code implementation20 Jun 2023 Sidi Lu, Wenbo Zhao, Chenyang Tao, Arpit Gupta, Shanchan Wu, Tagyoung Chung, Nanyun Peng

NeurAlly-Decomposed Oracle (NADO) is a powerful approach for controllable generation with large language models.

Machine Translation

Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition

1 code implementation22 May 2023 Hong Liu, Zhaobiao Lv, Zhijian Ou, Wenbo Zhao, Qing Xiao

Energy-based language models (ELMs) parameterize an unnormalized distribution for natural sentences and are radically different from popular autoregressive language models (ALMs).

Sentence speech-recognition +1

Unsupervised Melody-Guided Lyrics Generation

no code implementations12 May 2023 Yufei Tian, Anjali Narayan-Chen, Shereen Oraby, Alessandra Cervone, Gunnar Sigurdsson, Chenyang Tao, Wenbo Zhao, Tagyoung Chung, Jing Huang, Nanyun Peng

At inference time, we leverage the crucial alignments between melody and lyrics and compile the given melody into constraints to guide the generation process.

Text Generation

A Learning-based Adaptive Compliance Method for Symmetric Bi-manual Manipulation

no code implementations27 Mar 2023 Yuxue Cao, Wenbo Zhao, Shengjie Wang, Xiang Zheng, Wenke Ma, Zhaolei Wang, Tao Zhang

However, traditional methods have viewed motion planning and compliant control as two separate modules, which can lead to conflicts with the simultaneous change of the desired trajectory and impedance parameters in the presence of external forces and disturbances.

Motion Planning

Self-Supervised Arbitrary-Scale Point Clouds Upsampling via Implicit Neural Representation

1 code implementation CVPR 2022 Wenbo Zhao, Xianming Liu, Zhiwei Zhong, Junjun Jiang, Wei Gao, Ge Li, Xiangyang Ji

Most existing methods either take the end-to-end supervised learning based manner, where large amounts of pairs of sparse input and dense ground-truth are exploited as supervision information; or treat up-scaling of different scale factors as independent tasks, and have to build multiple networks to handle upsampling with varying factors.

Self-Supervised Learning

Simple Question Answering with Subgraph Ranking and Joint-Scoring

no code implementations NAACL 2019 Wenbo Zhao, Tagyoung Chung, Anuj Goyal, Angeliki Metallinou

Using this framework as a starting point, we focus on two aspects: improving subgraph selection through a novel ranking method and leveraging the subject--relation dependency by proposing a joint scoring CNN model with a novel loss function that enforces the well-order of scores.

Fact Selection Question Answering +1

Hierarchical Routing Mixture of Experts

no code implementations18 Mar 2019 Wenbo Zhao, Yang Gao, Shahan Ali Memon, Bhiksha Raj, Rita Singh

Addressing these problems, we propose a binary tree-structured hierarchical routing mixture of experts (HRME) model that has classifiers as non-leaf node experts and simple regression models as leaf node experts.

regression

Neural Regression Trees

no code implementations1 Oct 2018 Shahan Ali Memon, Wenbo Zhao, Bhiksha Raj, Rita Singh

Regression-via-Classification (RvC) is the process of converting a regression problem to a classification one.

Classification General Classification +1

Neural Regression Tree

no code implementations27 Sep 2018 Wenbo Zhao, Shahan Ali Memon, Bhiksha Raj, Rita Singh

Regression-via-Classification (RvC) is the process of converting a regression problem to a classification one.

Classification regression

Speaker identification from the sound of the human breath

no code implementations1 Dec 2017 Wenbo Zhao, Yang Gao, Rita Singh

The goal of this paper is to demonstrate that breath sounds are indeed bio-signatures that can be used to identify speakers.

Speaker Identification Speaker Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.