Search Results for author: Yao Yao

Found 48 papers, 19 papers with code

Anti-Aliased Neural Implicit Surfaces with Encoding Level of Detail

no code implementations19 Sep 2023 Yiyu Zhuang, Qi Zhang, Ying Feng, Hao Zhu, Yao Yao, Xiaoyu Li, Yan-Pei Cao, Ying Shan, Xun Cao

Drawing inspiration from voxel-based representations with the level of detail (LoD), we introduce a multi-scale tri-plane-based scene representation that is capable of capturing the LoD of the signed distance function (SDF) and the space radiance.

Surface Reconstruction

TOPIC: A Parallel Association Paradigm for Multi-Object Tracking under Complex Motions and Diverse Scenes

2 code implementations22 Aug 2023 Xiaoyan Cao, Yiyao Zheng, Yao Yao, Huapeng Qin, Xiaoyu Cao, Shihui Guo

Existing trackers can be categorized into two association paradigms: single-feature paradigm (based on either motion or appearance feature) and serial paradigm (one feature serves as secondary while the other is primary).

Multi-Object Tracking

JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models

no code implementations9 Aug 2023 Peike Li, BoYu Chen, Yao Yao, Yikai Wang, Allen Wang, Alex Wang

Despite the task's significance, prevailing generative models exhibit limitations in music quality, computational efficiency, and generalization.

Music Generation Text-to-Music Generation

AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation

no code implementations16 Jun 2023 Yifei Zeng, Yuanxun Lu, Xinya Ji, Yao Yao, Hao Zhu, Xun Cao

Unlike previous approaches that can only synthesize avatars based on simple text descriptions, our method enables the creation of personalized avatars from casually captured face or body images, while still supporting text-based model generation and editing.

Text to 3D

Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Large Language Models

no code implementations26 May 2023 Yao Yao, Zuchao Li, Hai Zhao

Therefore, we propose Graph-of-Thought (GoT) reasoning, which models human thought processes not only as a chain but also as a graph.

GSM8K Representation Learning

NeILF++: Inter-Reflectable Light Fields for Geometry and Material Estimation

no code implementations30 Mar 2023 Jingyang Zhang, Yao Yao, Shiwei Li, Jingbo Liu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

We present a novel differentiable rendering framework for joint geometry, material, and lighting estimation from multi-view images.

Lighting Estimation

Stochastic Methods for AUC Optimization subject to AUC-based Fairness Constraints

no code implementations23 Dec 2022 Yao Yao, Qihang Lin, Tianbao Yang

In this work, we formulate the training problem of a fairness-aware machine learning model as an AUC optimization problem subject to a class of AUC-based fairness constraints.


Towards Relation-centered Pooling and Convolution for Heterogeneous Graph Learning Networks

1 code implementation31 Oct 2022 Tiehua Zhang, Yuze Liu, Yao Yao, Youhua Xia, Xin Chen, Xiaowei Huang, Jiong Jin

Heterogeneous graph neural network has unleashed great potential on graph representation learning and shown superior performance on downstream tasks such as node classification and clustering.

Graph Learning Graph Representation Learning +1

Critical Regularizations for Neural Surface Reconstruction in the Wild

no code implementations CVPR 2022 Jingyang Zhang, Yao Yao, Shiwei Li, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

The first one is the Hessian regularization that smoothly diffuses the signed distance values to the entire distance field given noisy and incomplete input.

Surface Reconstruction

i-Razor: A Neural Input Razor for Feature Selection and Dimension Search in Large-Scale Recommender Systems

1 code implementation1 Apr 2022 Yao Yao, Bin Liu, Haoxun He, Dakui Sheng, Ke Wang, Li Xiao, Huanhuan Cao

Typically, feature selection and embedding dimension search are optimized sequentially, i. e., feature selection is performed first, followed by embedding dimension search to determine the optimal dimension size for each selected feature.

Click-Through Rate Prediction Feature Engineering +3

NeILF: Neural Incident Light Field for Physically-based Material Estimation

1 code implementation14 Mar 2022 Yao Yao, Jingyang Zhang, Jingbo Liu, Yihang Qu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

We present a differentiable rendering framework for material and lighting estimation from multi-view images and a reconstructed geometry.

Lighting Estimation

Large-scale Optimization of Partial AUC in a Range of False Positive Rates

no code implementations3 Mar 2022 Yao Yao, Qihang Lin, Tianbao Yang

The partial AUC, as a generalization of the AUC, summarizes only the TPRs over a specific range of the FPRs and is thus a more suitable performance measure in many real-world situations.

CLS: Cross Labeling Supervision for Semi-Supervised Learning

1 code implementation17 Feb 2022 Yao Yao, Junyi Shen, Jin Xu, Bin Zhong, Li Xiao

Based on FixMatch, where a pseudo label is generated from a weakly-augmented sample to teach the prediction on a strong augmentation of the same input sample, CLS allows the creation of both pseudo and complementary labels to support both positive and negative learning.

Pseudo Label

POI-Transformers: POI Entity Matching through POI Embeddings by Incorporating Semantic and Geographic Information

no code implementations29 Sep 2021 Jinbao Zhang, Changwang Zhang, Xiaojuan Liu, Xia Li, Weilin Liao, Penghua Liu, Yao Yao, Jihong Zhang

A general and robust POI embedding framework, the POI-Transformers, is initially proposed in this study to address these problems of POI entity matching.

Robust Model-based Reinforcement Learning for Autonomous Greenhouse Control

no code implementations26 Aug 2021 Wanpeng Zhang, Xiaoyan Cao, Yao Yao, Zhicheng An, Xi Xiao, Dijun Luo

In this paper, we present a model-based robust RL framework for autonomous greenhouse control to meet the sample efficiency and safety challenges.

Decision Making Model-based Reinforcement Learning +2

Learning Signed Distance Field for Multi-view Surface Reconstruction

1 code implementation ICCV 2021 Jingyang Zhang, Yao Yao, Long Quan

In this work, we introduce a novel neural surface reconstruction framework that leverages the knowledge of stereo matching and feature consistency to optimize the implicit surface representation.

Stereo Matching Surface Reconstruction

MBDP: A Model-based Approach to Achieve both Robustness and Sample Efficiency via Double Dropout Planning

no code implementations3 Aug 2021 Wanpeng Zhang, Xi Xiao, Yao Yao, Mingzhe Chen, Dijun Luo

MBDP consists of two kinds of dropout mechanisms, where the rollout-dropout aims to improve the robustness with a small cost of sample efficiency, while the model-dropout is designed to compensate for the lost efficiency at a slight expense of robustness.

Model-based Reinforcement Learning

IGrow: A Smart Agriculture Solution to Autonomous Greenhouse Control

1 code implementation6 Jul 2021 Xiaoyan Cao, Yao Yao, Lanqing Li, Wanpeng Zhang, Zhicheng An, Zhong Zhang, Li Xiao, Shihui Guo, Xiaoyu Cao, Meihong Wu, Dijun Luo

However, the optimal control of autonomous greenhouses is challenging, requiring decision-making based on high-dimensional sensory data, and the scaling of production is limited by the scarcity of labor capable of handling this task.

Cloud Computing Decision Making

Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation

1 code implementation5 Jul 2021 Yao Yao, Li Xiao, Zhicheng An, Wanpeng Zhang, Dijun Luo

Model-based deep reinforcement learning has achieved success in various domains that require high sample efficiencies, such as Go and robotics.

Continuous Control reinforcement-learning +1

Excess-noise suppression for a squeezed state propagating through random amplifying media via wave-front shaping

no code implementations4 Feb 2021 Dong Li, Song Sun, Yao Yao

After propagating through a random amplifying medium, a squeezed state commonly shows excess noise above the shot-noise level.

Quantum Physics

Remarks on stationary and uniformly-rotating vortex sheets: Rigidity results

no code implementations8 Dec 2020 Javier Gómez-Serrano, Jaemin Park, Jia Shi, Yao Yao

In this paper, we show that the only solution of the vortex sheet equation, either stationary or uniformly rotating with negative angular velocity $\Omega$, such that it has positive vorticity and is concentrated in a finite disjoint union of smooth curves with finite length is the trivial one: constant vorticity amplitude supported on a union of nested, concentric circles.

Analysis of PDEs

Understanding the drivers of sustainable land expansion using a patch-level simulation model: A case study in Wuhan, China

no code implementations22 Oct 2020 Xun Liang, Qingfeng Guan, Keith C. Clarke, Shishi Liu, Bingyu Wang, Yao Yao

The change complexity lies in the detailed scale of high granularity data, and in the geometric units used to simulate the change.

Computers and Society

Practical Option Valuations of Futures Contracts with Negative Underlying Prices

no code implementations25 Sep 2020 Anatoliy Swishchuk, Ana Roldan-Contreras, Elham Soufiani, Guillermo Martinez, Mohsen Seifi, Nishant Agrawal, Yao Yao

Here we propose two alternatives to Black 76 to value European option future contracts in which the underlying market prices can be negative or mean reverting.

Visibility-aware Multi-view Stereo Network

1 code implementation18 Aug 2020 Jingyang Zhang, Yao Yao, Shiwei Li, Zixin Luo, Tian Fang

As such, the adverse influence of occluded pixels is suppressed in the cost fusion.

3D Reconstruction Depth Estimation +1

Learning Stereo Matchability in Disparity Regression Networks

1 code implementation11 Aug 2020 Jingyang Zhang, Yao Yao, Zixin Luo, Shiwei Li, Tianwei Shen, Tian Fang, Long Quan

Finally, a matchability-aware disparity refinement is introduced to improve the depth inference in weakly matchable regions.

regression Stereo Disparity Estimation +1

KFNet: Learning Temporal Camera Relocalization using Kalman Filtering

1 code implementation CVPR 2020 Lei Zhou, Zixin Luo, Tianwei Shen, Jiahui Zhang, Mingmin Zhen, Yao Yao, Tian Fang, Long Quan

Temporal camera relocalization estimates the pose with respect to each video frame in sequence, as opposed to one-shot relocalization which focuses on a still image.

Camera Relocalization

Network Cooperation with Progressive Disambiguation for Partial Label Learning

no code implementations22 Feb 2020 Yao Yao, Chen Gong, Jiehui Deng, Jian Yang

Partial Label Learning (PLL) aims to train a classifier when each training instance is associated with a set of candidate labels, among which only one is correct but is not accessible during the training phase.

Partial Label Learning

BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Networks

2 code implementations CVPR 2020 Yao Yao, Zixin Luo, Shiwei Li, Jingyang Zhang, Yufan Ren, Lei Zhou, Tian Fang, Long Quan

Compared with other computer vision tasks, it is rather difficult to collect a large-scale MVS dataset as it requires expensive active scanners and labor-intensive process to obtain ground truth 3D structures.

3D Reconstruction

Self-Supervised Learning of Depth and Motion Under Photometric Inconsistency

1 code implementation19 Sep 2019 Tianwei Shen, Lei Zhou, Zixin Luo, Yao Yao, Shiwei Li, Jiahui Zhang, Tian Fang, Long Quan

The self-supervised learning of depth and pose from monocular sequences provides an attractive solution by using the photometric consistency of nearby frames as it depends much less on the ground-truth data.

Pose Estimation Self-Supervised Learning

ContextDesc: Local Descriptor Augmentation with Cross-Modality Context

1 code implementation CVPR 2019 Zixin Luo, Tianwei Shen, Lei Zhou, Jiahui Zhang, Yao Yao, Shiwei Li, Tian Fang, Long Quan

Most existing studies on learning local features focus on the patch-based descriptions of individual keypoints, whereas neglecting the spatial relations established from their keypoint locations.

Geometric Matching

Recurrent MVSNet for High-resolution Multi-view Stereo Depth Inference

1 code implementation CVPR 2019 Yao Yao, Zixin Luo, Shiwei Li, Tianwei Shen, Tian Fang, Long Quan

However, one major limitation of current learned MVS approaches is the scalability: the memory-consuming cost volume regularization makes the learned MVS hard to be applied to high-resolution scenes.

Vocal Bursts Intensity Prediction

GeoDesc: Learning Local Descriptors by Integrating Geometry Constraints

1 code implementation ECCV 2018 Zixin Luo, Tianwei Shen, Lei Zhou, Siyu Zhu, Runze Zhang, Yao Yao, Tian Fang, Long Quan

Learned local descriptors based on Convolutional Neural Networks (CNNs) have achieved significant improvements on patch-based benchmarks, whereas not having demonstrated strong generalization ability on recent benchmarks of image-based 3D reconstruction.

3D Reconstruction

MVSNet: Depth Inference for Unstructured Multi-view Stereo

4 code implementations ECCV 2018 Yao Yao, Zixin Luo, Shiwei Li, Tian Fang, Long Quan

We present an end-to-end deep learning architecture for depth map inference from multi-view images.

Ranked #14 on Point Clouds on Tanks and Temples (Mean F1 (Intermediate) metric)

3D Reconstruction Point Clouds

Pulsar Candidate Identification with Artificial Intelligence Techniques

no code implementations27 Nov 2017 Ping Guo, Fuqing Duan, Pei Wang, Yao Yao, Qian Yin, Xin Xin

To address these problems, we proposed a framework which combines deep convolution generative adversarial network (DCGAN) with support vector machine (SVM) to deal with imbalance class problem and to improve pulsar identification accuracy.


Sensing Urban Land-Use Patterns By Integrating Google Tensorflow And Scene-Classification Models

no code implementations4 Aug 2017 Yao Yao, Haolin Liang, Xia Li, Jinbao Zhang, Jialv He

To take advantage of the deep-learning method in detecting urban land-use patterns, we applied a transfer-learning-based remote-sensing image approach to extract and classify features.

General Classification Scene Classification +1

Extracting urban impervious surface from GF-1 imagery using one-class classifiers

no code implementations13 May 2017 Yao Yao, Jialv He, Jinbao Zhang, Yatao Zhang

In this study, we investigate several one-class classifiers, such as Presence and Background Learning (PBL), Positive Unlabeled Learning (PUL), OCSVM, BSVM and MAXENT, to extract urban impervious surface area using high spatial resolution imagery of GF-1, China's new generation of high spatial remote sensing satellite, and evaluate the classification accuracy based on artificial interpretation results.

General Classification Management

Cannot find the paper you are looking for? You can Submit a new open access paper.