Search Results for author: Xin Tong

Found 87 papers, 42 papers with code

Feature Augmentation via Nonparametrics and Selection (FANS) in High Dimensional Classification

no code implementations • 31 Dec 2013 • Jianqing Fan, Yang Feng, Jiancheng Jiang, Xin Tong

We motivate FANS by generalizing the Naive Bayes model, writing the log ratio of joint densities as a linear combination of those of marginal densities.

Additive models General Classification +1

Paper
Add Code

Neyman-Pearson Classification under High-Dimensional Settings

no code implementations • 13 Aug 2015 • Anqi Zhao, Yang Feng, Lie Wang, Xin Tong

Most existing binary classification methods target on the optimization of the overall classification risk and may fail to serve some real-world applications such as cancer diagnosis, where users are more concerned with the risk of misclassifying one specific class than the other.

Binary Classification Classification +4

Paper
Add Code

Mesh Denoising via Cascaded Normal Regression

no code implementations • 15 Nov 2016 • Peng-Shuai Wang, Yang Liu, Xin Tong

At runtime, our method applies the learned cascaded regression functions to a noisy input mesh and reconstructs the denoised mesh from the output facet normals.

Denoising regression

Paper
Add Code

Demography-based Facial Retouching Detection using Subclass Supervised Sparse Autoencoder

no code implementations • 22 Sep 2017 • Aparna Bharati, Mayank Vatsa, Richa Singh, Kevin W. Bowyer, Xin Tong

However, previous work on this topic has not considered whether or how accuracy of retouching detection varies with the demography of face images.

Paper
Add Code

O-CNN: Octree-based Convolutional Neural Networks for 3D Shape Analysis

1 code implementation • 5 Dec 2017 • Peng-Shuai Wang, Yang Liu, Yu-Xiao Guo, Chun-Yu Sun, Xin Tong

We present O-CNN, an Octree-based Convolutional Neural Network (CNN) for 3D shape analysis.

Ranked #4 on 3D Object Classification on ModelNet40

3D Object Classification Retrieval +1

697

Paper
Code

Intentional Control of Type I Error over Unconscious Data Distortion: a Neyman-Pearson Approach to Text Classification

no code implementations • 7 Feb 2018 • Lucy Xia, Richard Zhao, Yanhui Wu, Xin Tong

To deal with inestimable data distortion, we propose the use of the Neyman-Pearson (NP) classification paradigm, which minimizes type II error under a user-specified type I error constraint.

General Classification text-classification +1

Paper
Add Code

Neyman-Pearson classification: parametrics and sample size requirement

no code implementations • 7 Feb 2018 • Xin Tong, Lucy Xia, Jiacheng Wang, Yang Feng

In this work, we employ the parametric linear discriminant analysis (LDA) model and propose a new parametric thresholding algorithm, which does not need the minimum sample size requirements on class $0$ observations and thus is suitable for small sample applications such as rare disease diagnosis.

Binary Classification Classification +3

Paper
Add Code

View-volume Network for Semantic Scene Completion from a Single Depth Image

no code implementations • 14 Jun 2018 • Yu-Xiao Guo, Xin Tong

We introduce a View-Volume convolutional neural network (VVNet) for inferring the occupancy and semantic labels of a volumetric 3D scene from a single depth image.

Ranked #20 on 3D Semantic Scene Completion on NYUv2

3D Semantic Scene Completion

Paper
Add Code

PFCNN: Convolutional Neural Networks on 3D Surfaces Using Parallel Frames

1 code implementation • CVPR 2020 • Yu-Qi Yang, Shilin Liu, Hao Pan, Yang Liu, Xin Tong

Surface meshes are widely used shape representations and capture finer geometry data than point clouds or volumetric grids, but are challenging to apply CNNs directly due to their non-Euclidean structure.

Ranked #27 on Semantic Segmentation on ScanNet

Scene Segmentation Segmentation

Paper
Code

Deep Single-View 3D Object Reconstruction with Visual Hull Embedding

1 code implementation • 10 Sep 2018 • Hanqing Wang, Jiaolong Yang, Wei Liang, Xin Tong

The key idea of our method is to leverage object mask and pose estimation from CNNs to assist the 3D shape learning by constructing a probabilistic single-view visual hull inside of the network.

3D Object Reconstruction Object +1

Paper
Code

Adaptive O-CNN: A Patch-based Deep Representation of 3D Shapes

1 code implementation • 21 Sep 2018 • Peng-Shuai Wang, Chun-Yu Sun, Yang Liu, Xin Tong

The Adaptive O-CNN encoder takes the planar patch normal and displacement as input and performs 3D convolutions only at the octants at each level, while the Adaptive O-CNN decoder infers the shape occupancy and subdivision status of octants at each level and estimates the best plane normal and displacement for each leaf octant.

697

Paper
Code

Image Smoothing via Unsupervised Learning

1 code implementation • 7 Nov 2018 • Qingnan Fan, Jiaolong Yang, David Wipf, Baoquan Chen, Xin Tong

Image smoothing represents a fundamental component of many disparate computer vision and graphics applications.

Image Manipulation image smoothing

101

Paper
Code

Learn a Prior for RHEA for Better Online Planning

no code implementations • 14 Feb 2019 • Xin Tong, Weiming Liu, Bin Li

In this paper, we propose to learn a prior for RHEA in an offline manner by training a value network and a policy network.

Evolutionary Algorithms OpenAI Gym

Paper
Add Code

A Skeleton-bridged Deep Learning Approach for Generating Meshes of Complex Topologies from Single RGB Images

1 code implementation • CVPR 2019 • Jiapeng Tang, Xiaoguang Han, Junyi Pan, Kui Jia, Xin Tong

To this end, we propose in this paper a skeleton-bridged, stage-wise learning approach to address the challenge.

Paper
Code

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set

4 code implementations • 20 Mar 2019 • Yu Deng, Jiaolong Yang, Sicheng Xu, Dong Chen, Yunde Jia, Xin Tong

Recently, deep learning based 3D face reconstruction methods have shown promising results in both quality and efficiency. However, training deep neural networks typically requires a large volume of data, whereas face images with ground-truth 3D face shapes are scarce.

Ranked #3 on 3D Face Reconstruction on Florence (RMSE Cooperative metric)

3D Face Reconstruction Weakly-supervised Learning

2,096

Paper
Code

A Skeleton-bridged Deep Learning Approach for Generating Meshesof Complex Topologies from Single RGB Image

1 code implementation • CVPR 2019 2019 • Jiapeng Tang, Xiaoguang Han, Junyi Pan, Kui Jia, Xin Tong

To this end, we propose in this paper a skeleton-bridged, stage-wise learning approach to address the challenge.

Paper
Code

Synthesizing 3D Shapes from Silhouette Image Collections using Multi-projection Generative Adversarial Networks

no code implementations • CVPR 2019 • Xiao Li, Yue Dong, Pieter Peers, Xin Tong

Key to our method is a novel multi-projection generative adversarial network (MP-GAN) that trains a 3D shape generator to be consistent with multiple 2D projections of the 3D shapes, and without direct access to these 3D shapes.

Generative Adversarial Network Weakly-supervised Learning

Paper
Add Code

Imbalanced classification: a paradigm-based review

no code implementations • 11 Feb 2020 • Yang Feng, Min Zhou, Xin Tong

For each pair of resampling techniques and classification methods, we use simulation studies and a real data set on credit card fraud to study the performance under different evaluation metrics.

Binary Classification Classification +2

Paper
Add Code

Disentangled and Controllable Face Image Generation via 3D Imitative-Contrastive Learning

4 code implementations • CVPR 2020 • Yu Deng, Jiaolong Yang, Dong Chen, Fang Wen, Xin Tong

Our method can also be used to embed real images into the disentangled latent space.

Contrastive Learning Disentanglement +1

628

Paper
Code

Deep 3D Portrait from a Single Image

1 code implementation • CVPR 2020 • Sicheng Xu, Jiaolong Yang, Dong Chen, Fang Wen, Yu Deng, Yunde Jia, Xin Tong

We evaluate the accuracy of our method both in 3D and with pose manipulation tasks on 2D images.

Face Model Stereo Matching

374

Paper
Code

Deep Octree-based CNNs with Output-Guided Skip Connections for 3D Shape and Scene Completion

1 code implementation • 6 Jun 2020 • Peng-Shuai Wang, Yang Liu, Xin Tong

Acquiring complete and clean 3D shape and scene data is challenging due to geometric occlusion and insufficient views during 3D capturing.

697

Paper
Code

A Closer Look at Local Aggregation Operators in Point Cloud Analysis

1 code implementation • ECCV 2020 • Ze Liu, Han Hu, Yue Cao, Zheng Zhang, Xin Tong

Our investigation reveals that despite the different designs of these operators, all of these operators make surprisingly similar contributions to the network performance under the same network input and feature numbers and result in the state-of-the-art accuracy on standard benchmarks.

Ranked #4 on 3D Semantic Segmentation on PartNet

3D Semantic Segmentation

250

Paper
Code

Unsupervised 3D Learning for Shape Analysis via Multiresolution Instance Discrimination

1 code implementation • 3 Aug 2020 • Peng-Shuai Wang, Yu-Qi Yang, Qian-Fang Zou, Zhirong Wu, Yang Liu, Xin Tong

Although unsupervised feature learning has demonstrated its advantages to reducing the workload of data labeling and network design in many fields, existing unsupervised 3D learning methods still cannot offer a generic network for various shape analysis tasks with competitive performance to supervised methods.

Ranked #2 on 3D Semantic Segmentation on PartNet

3D Point Cloud Linear Classification 3D Semantic Segmentation

697

Paper
Code

Object-based Illumination Estimation with Rendering-aware Neural Networks

no code implementations • ECCV 2020 • Xin Wei, Guojun Chen, Yue Dong, Stephen Lin, Xin Tong

With the estimated lighting, virtual objects can be rendered in AR scenarios with shading that is consistent to the real scene, leading to improved realism.

Inverse Rendering Object

Paper
Add Code

SkeletonNet: A Topology-Preserving Solution for Learning Mesh Reconstruction of Object Surfaces from RGB Images

1 code implementation • 13 Aug 2020 • Jiapeng Tang, Xiaoguang Han, Mingkui Tan, Xin Tong, Kui Jia

However, they all have their own drawbacks, and cannot properly reconstruct the surface shapes of complex topologies, arguably due to a lack of constraints on the topologicalstructures in their learning frameworks.

Surface Reconstruction

Paper
Code

Field-Tuned Quantum Effects in a Triangular-Lattice Ising Magnet

no code implementations • 18 Nov 2020 • Yayuan Qin, Yao Shen, ChangLe Liu, Hongliang Wo, Yonghao Gao, Yu Feng, Xiaowen Zhang, Gaofeng Ding, Yiqing Gu, Qisi Wang, Shoudong Shen, Helen C. Walker, Robert Bewley, Jianhui Xu, Martin Boehm, Paul Steffens, Seiko Ohira-Kawamura, Naoki Murai, Astrid Schneidewind, Xin Tong, Gang Chen, Jun Zhao

We report thermodynamic and neutron scattering measurements of the triangular-lattice quantum Ising magnet TmMgGaO 4 in longitudinal magnetic fields.

Strongly Correlated Electrons Materials Science

Paper
Add Code

Deformed Implicit Field: Modeling 3D Shapes with Learned Dense Correspondence

1 code implementation • CVPR 2021 • Yu Deng, Jiaolong Yang, Xin Tong

We propose a novel Deformed Implicit Field (DIF) representation for modeling 3D shapes of a category and generating dense correspondences among shapes.

120

Paper
Code

Bridging Cost-sensitive and Neyman-Pearson Paradigms for Asymmetric Binary Classification

1 code implementation • 29 Dec 2020 • Wei Vivian Li, Xin Tong, Jingyi Jessica Li

In contrast, the Neyman-Pearson paradigm can train classifiers to achieve a high-probability control of the population type I error, but it relies on sample splitting that reduces the effective training sample size.

Binary Classification General Classification +2

Paper
Code

Learning High-Fidelity Face Texture Completion Without Complete Face Texture

no code implementations • ICCV 2021 • Jongyoo Kim, Jiaolong Yang, Xin Tong

For face texture completion, previous methods typically use some complete textures captured by multiview imaging systems or 3D scanners for supervised learning.

Vocal Bursts Intensity Prediction

Paper
Add Code

Deep Implicit Moving Least-Squares Functions for 3D Reconstruction

1 code implementation • CVPR 2021 • Shi-Lin Liu, Hao-Xiang Guo, Hao Pan, Peng-Shuai Wang, Xin Tong, Yang Liu

We incorporate IMLS surface generation into deep neural networks for inheriting both the flexibility of point sets and the high quality of implicit surfaces.

3D Object Reconstruction 3D Reconstruction +1

123

Paper
Code

Group-Free 3D Object Detection via Transformers

4 code implementations • ICCV 2021 • Ze Liu, Zheng Zhang, Yue Cao, Han Hu, Xin Tong

Instead of grouping local points to each object candidate, our method computes the feature of an object from all the points in the point cloud with the help of an attention mechanism in the Transformers \cite{vaswani2017attention}, where the contribution of each point is automatically learned in the network training.

Ranked #3 on 3D Object Detection on SUN-RGBD

3D Object Detection Object +1

240

Paper
Code

High-Resolution Optical Flow from 1D Attention and Correlation

1 code implementation • ICCV 2021 • Haofei Xu, Jiaolong Yang, Jianfei Cai, Juyong Zhang, Xin Tong

Optical flow is inherently a 2D search problem, and thus the computational complexity grows quadratically with respect to the search window, making large displacements matching infeasible for high-resolution images.

4k Optical Flow Estimation +1

103

Paper
Code

Profiling Pareto Front With Multi-Objective Stein Variational Gradient Descent

1 code implementation • NeurIPS 2021 • Xingchao Liu, Xin Tong, Qiang Liu

Finding diverse and representative Pareto solutions from the Pareto front is a key challenge in multi-objective optimization (MOO).

Paper
Code

Sampling with Trusthworthy Constraints: A Variational Gradient Framework

1 code implementation • NeurIPS 2021 • Xingchao Liu, Xin Tong, Qiang Liu

In this work, we propose a family of constrained sampling algorithms which generalize Langevin Dynamics (LD) and Stein Variational Gradient Descent (SVGD) to incorporate a moment constraint specified by a general nonlinear function.

Bayesian Inference Fairness

Paper
Code

Spline Positional Encoding for Learning 3D Implicit Signed Distance Fields

1 code implementation • 3 Jun 2021 • Peng-Shuai Wang, Yang Liu, Yu-Qi Yang, Xin Tong

Multilayer perceptrons (MLPs) have been successfully used to represent 3D shapes implicitly and compactly, by mapping 3D coordinates to the corresponding signed distance values or occupancy values.

3D Shape Reconstruction Image Reconstruction

Paper
Code

StyleCariGAN: Caricature Generation via StyleGAN Feature Map Modulation

1 code implementation • 9 Jul 2021 • Wonjong Jang, Gwangjin Ju, Yucheol Jung, Jiaolong Yang, Xin Tong, Seungyong Lee

Our framework, dubbed StyleCariGAN, automatically creates a realistic and detailed caricature from an input photo with optional controls on shape exaggeration degree and color stylization type.

Caricature

292

Paper
Code

Indoor Scene Generation from a Collection of Semantic-Segmented Depth Images

1 code implementation • ICCV 2021 • Ming-Jia Yang, Yu-Xiao Guo, Bin Zhou, Xin Tong

Different from existing methods that represent an indoor scene with the type, location, and other properties of objects in the room and learn the scene layout from a collection of complete 3D indoor scenes, our method models each indoor scene as a 3D semantic scene volume and learns a volumetric generative adversarial network (GAN) from a collection of 2. 5D partial observations of 3D scenes.

Generative Adversarial Network Scene Generation

Paper
Code

Joint Multi-User Communication and Sensing Exploiting Both Signal and Environment Sparsity

no code implementations • 6 Sep 2021 • Xin Tong, Zhaoyang Zhang, Jue Wang, Chongwen Huang, Merouane Debbah

As a potential technology feature for 6G wireless networks, the idea of sensing-communication integration requires the system not only to complete reliable multi-user communication but also to achieve accurate environment sensing.

object-detection Object Detection

Paper
Add Code

Sampling with Trusthworthy Constraints: A Variational Gradient Framework

1 code implementation • NeurIPS 2021 • Xingchao Liu, Xin Tong, Qiang Liu

Bayesian Inference Fairness

Paper
Code

Asymmetric error control under imperfect supervision: a label-noise-adjusted Neyman-Pearson umbrella algorithm

no code implementations • 1 Dec 2021 • Shunan Yao, Bradley Rava, Xin Tong, Gareth James

It is somewhat surprising that even when common NP classifiers ignore the label noise in the training stage, they are still able to control the type I error with high probability.

Classification Medical Diagnosis +1

Paper
Add Code

Profiling Pareto Front With Multi-Objective Stein Variational Gradient Descent

1 code implementation • NeurIPS 2021 • Xingchao Liu, Xin Tong, Qiang Liu

Finding diverse and representative Pareto solutions from the Pareto front is a key challenge in multi-objective optimization (MOO).

Paper
Code

VirtualCube: An Immersive 3D Video Communication System

no code implementations • 13 Dec 2021 • Yizhong Zhang, Jiaolong Yang, Zhen Liu, Ruicheng Wang, Guojun Chen, Xin Tong, Baining Guo

The VirtualCube system is a 3D video conference system that attempts to overcome some limitations of conventional technologies.

Depth Estimation

Paper
Add Code

GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation

no code implementations • CVPR 2022 • Yu Deng, Jiaolong Yang, Jianfeng Xiang, Xin Tong

3D-aware image generative modeling aims to generate 3D-consistent images with explicitly controllable camera poses.

3D-Aware Image Synthesis Face Generation +1

Paper
Add Code

Transformer Based Line Segment Classifier With Image Context for Real-Time Vanishing Point Detection in Manhattan World

no code implementations • CVPR 2022 • Xin Tong, Xianghua Ying, Yongjie Shi, Ruibin Wang, Jinfa Yang

To achieve this goal, we propose a novel Transformer based Line segment Classifier (TLC) that can group line segments in images and estimate the corresponding vanishing points.

Paper
Add Code

MPS-NeRF: Generalizable 3D Human Rendering from Multiview Images

no code implementations • 31 Mar 2022 • Xiangjun Gao, Jiaolong Yang, Jongyoo Kim, Sida Peng, Zicheng Liu, Xin Tong

For this task, we propose a simple yet effective method to train a generalizable NeRF with multiview images as conditional input.

Novel View Synthesis

Paper
Add Code

Semi-supervised 3D shape segmentation with multilevel consistency and part substitution

1 code implementation • 19 Apr 2022 • Chun-Yu Sun, Yu-Qi Yang, Hao-Xiang Guo, Peng-Shuai Wang, Xin Tong, Yang Liu, Heung-Yeung Shum

We propose an effective semi-supervised method for learning 3D segmentations from a few labeled 3D shapes and a large amount of unlabeled 3D data.

Segmentation Semantic Segmentation +2

Paper
Code

Dual Octree Graph Networks for Learning Adaptive Volumetric Shape Representations

1 code implementation • 5 May 2022 • Peng-Shuai Wang, Yang Liu, Xin Tong

Our method encodes the volumetric field of a 3D shape with an adaptive feature volume organized by an octree and applies a compact multilayer perceptron network for mapping the features to the field value at each 3D position.

3D Shape Reconstruction

110

Paper
Code

What are People Talking about in #BlackLivesMatter and #StopAsianHate? Exploring and Categorizing Twitter Topics Emerging in Online Social Movements through the Latent Dirichlet Allocation Model

1 code implementation • 29 May 2022 • Xin Tong, Yixuan Li, Jiayi Li, Rongqi Bei, Luyao Zhang

Minority groups have been using social media to organize social movements that create profound social impacts.

Ethics

Paper
Code

ComplexGen: CAD Reconstruction by B-Rep Chain Complex Generation

1 code implementation • 29 May 2022 • Haoxiang Guo, Shilin Liu, Hao Pan, Yang Liu, Xin Tong, Baining Guo

We view the reconstruction of CAD models in the boundary representation (B-Rep) as the detection of geometric primitives of different orders, i. e. vertices, edges and surface patches, and the correspondence of primitives, which are holistically modeled as a chain complex, and show that by modeling such comprehensive structures more complete and regularized reconstructions can be achieved.

CAD Reconstruction

Paper
Code

GRAM-HD: 3D-Consistent Image Generation at High Resolution with Generative Radiance Manifolds

no code implementations • ICCV 2023 • Jianfeng Xiang, Jiaolong Yang, Yu Deng, Xin Tong

This paper proposes a novel 3D-aware GAN that can generate high resolution images (up to 1024X1024) while keeping strict 3D consistency as in volume rendering.

Image Generation Super-Resolution

Paper
Add Code

SDF-StyleGAN: Implicit SDF-Based StyleGAN for 3D Shape Generation

1 code implementation • 24 Jun 2022 • Xin-Yang Zheng, Yang Liu, Peng-Shuai Wang, Xin Tong

We further complement the evaluation metrics of 3D generative models with the shading-image-based Fr\'echet inception distance (FID) scores to better assess visual quality and shape distribution of the generated shapes.

3D Generation 3D Shape Generation +1

113

Paper
Code

Environment Sensing Considering the Occlusion Effect: A Multi-View Approach

no code implementations • 2 Jul 2022 • Xin Tong, Zhaoyang Zhang, Yihan Zhang, Zhaohui Yang, Chongwen Huang, Kai-Kit Wong, Merouane Debbah

In this paper, we consider the problem of sensing the environment within a wireless cellular framework.

Paper
Add Code

Sparse Ellipsometry: Portable Acquisition of Polarimetric SVBRDF and Shape with Unstructured Flash Photography

1 code implementation • 9 Jul 2022 • Inseung Hwang, Daniel S. Jeon, Adolfo Muñoz, Diego Gutierrez, Xin Tong, Min H. Kim

Ellipsometry techniques allow to measure polarization information of materials, requiring precise rotations of optical components with different configurations of lights and sensors.

Data Augmentation Inverse Rendering

Paper
Code

Deep Deformable 3D Caricatures with Learned Shape Control

1 code implementation • 29 Jul 2022 • Yucheol Jung, Wonjong Jang, Soongjin Kim, Jiaolong Yang, Xin Tong, Seungyong Lee

To achieve the goal, we propose an MLP-based framework for building a deformable surface model, which takes a latent code and produces a 3D surface.

Caricature Position

Paper
Code

Semantic Segmentation-Assisted Instance Feature Fusion for Multi-Level 3D Part Instance Segmentation

1 code implementation • 9 Aug 2022 • ChunYu Sun, Xin Tong, Yang Liu

Our method exploits semantic segmentation to fuse nonlocal instance features, such as center prediction, and further enhances the fusion scheme in a multi- and cross-level way.

Ranked #1 on 3D Instance Segmentation on PartNet

3D Instance Segmentation 3D Part Segmentation +2

Paper
Code

Generative Deformable Radiance Fields for Disentangled Image Synthesis of Topology-Varying Objects

no code implementations • 9 Sep 2022 • Ziyu Wang, Yu Deng, Jiaolong Yang, Jingyi Yu, Xin Tong

Experiments show that our method can successfully learn the generative model from unstructured monocular images and well disentangle the shape and appearance for objects (e. g., chairs) with large topological variance.

Disentanglement Image Generation +1

Paper
Add Code

Hierarchical Neyman-Pearson Classification for Prioritizing Severe Disease Categories in COVID-19 Patient Data

no code implementations • 1 Oct 2022 • Lijia Wang, Y. X. Rachel Wang, Jingyi Jessica Li, Xin Tong

Here, we propose a hierarchical NP (H-NP) framework and an umbrella algorithm that generally adapts to popular classification methods and controls the under-classification errors with high probability.

Binary Classification Classification +1

Paper
Add Code

AniFaceGAN: Animatable 3D-Aware Face Image Generation for Video Avatars

1 code implementation • 12 Oct 2022 • Yue Wu, Yu Deng, Jiaolong Yang, Fangyun Wei, Qifeng Chen, Xin Tong

To achieve meaningful control over facial expressions via deformation, we propose a 3D-level imitative learning scheme between the generator and a parametric 3D face model during adversarial training of the 3D-aware GAN.

Disentanglement Face Model +1

Paper
Code

NeRFInvertor: High Fidelity NeRF-GAN Inversion for Single-shot Real Image Animation

no code implementations • CVPR 2023 • Yu Yin, Kamran Ghasedi, HsiangTao Wu, Jiaolong Yang, Xin Tong, Yun Fu

Furthermore, our method leverages explicit and implicit 3D regularizations using the in-domain neighborhood samples around the optimized latent code to remove geometrical and visual artifacts.

Image Animation

Paper
Add Code

RemoteTouch: Enhancing Immersive 3D Video Communication with Hand Touch

no code implementations • 28 Feb 2023 • Yizhong Zhang, Zhiqi Li, Sicheng Xu, Chong Li, Jiaolong Yang, Xin Tong, Baining Guo

A key challenge in emulating the remote hand touch is the realistic rendering of the participant's hand and arm as the hand touches the screen.

Paper
Add Code

ReBotNet: Fast Real-time Video Enhancement

no code implementations • 23 Mar 2023 • Jeya Maria Jose Valanarasu, Rahul Garg, Andeep Toor, Xin Tong, Weijuan Xi, Andreas Lugmayr, Vishal M. Patel, Anne Menini

The first branch learns spatio-temporal features by tokenizing the input frames along the spatial and temporal dimensions using a ConvNext-based encoder and processing these abstract tokens using a bottleneck mixer.

Video Enhancement Video Restoration

Paper
Add Code

Synergistic Potential Functions from Single Modified Trace Function on SO(3)

no code implementations • 28 Mar 2023 • Xin Tong, Shing Shin Cheng

Second, it can be shown that for each potential function in the family, there exists a subset of the family such that the synergistic gap is positive at the unwanted critical points.

Paper
Add Code

3D-aware Image Generation using 2D Diffusion Models

no code implementations • ICCV 2023 • Jianfeng Xiang, Jiaolong Yang, Binbin Huang, Xin Tong

In this paper, we introduce a novel 3D-aware image generation method that leverages 2D diffusion models.

Image Generation

Paper
Add Code

3D Feature Prediction for Masked-AutoEncoder-Based Point Cloud Pretraining

no code implementations • 14 Apr 2023 • Siming Yan, YuQi Yang, YuXiao Guo, Hao Pan, Peng-Shuai Wang, Xin Tong, Yang Liu, QiXing Huang

Masked autoencoders (MAE) have recently been introduced to 3D self-supervised pretraining for point clouds due to their great success in NLP and computer vision.

Paper
Add Code

Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding

2 code implementations • 14 Apr 2023 • Yu-Qi Yang, Yu-Xiao Guo, Jian-Yu Xiong, Yang Liu, Hao Pan, Peng-Shuai Wang, Xin Tong, Baining Guo

We pretrained a large {\SST} model on a synthetic Structured3D dataset, which is an order of magnitude larger than the ScanNet dataset.

Ranked #2 on 3D Object Detection on S3DIS (using extra training data)

3D Object Detection Scene Understanding +1

1,119

Paper
Code

Locally Attentional SDF Diffusion for Controllable 3D Shape Generation

no code implementations • 8 May 2023 • Xin-Yang Zheng, Hao Pan, Peng-Shuai Wang, Xin Tong, Yang Liu, Heung-Yeung Shum

Our method is built on a two-stage diffusion model.

3D Generation 3D Shape Generation

Paper
Add Code

Global Stabilization of Antipodal Points on n-Sphere with Application to Attitude Tracking

no code implementations • 8 Jun 2023 • Xin Tong, Shing Shin Cheng

Existing approaches to robust global asymptotic stabilization of a pair of antipodal points on unit $n$-sphere $\mathbb{S}^n$ typically involve the non-centrally synergistic hybrid controllers for attitude tracking on unit quaternion space.

Paper
Add Code

Relighting Neural Radiance Fields with Shadow and Highlight Hints

1 code implementation • 25 Aug 2023 • Chong Zeng, Guojun Chen, Yue Dong, Pieter Peers, Hongzhi Wu, Xin Tong

This paper presents a novel neural implicit radiance representation for free viewpoint relighting from a small set of unstructured photographs of an object lit by a moving point light source different from the view position.

Position

142

Paper
Code

AniPortraitGAN: Animatable 3D Portrait Generation from 2D Image Collections

no code implementations • 5 Sep 2023 • Yue Wu, Sicheng Xu, Jianfeng Xiang, Fangyun Wei, Qifeng Chen, Jiaolong Yang, Xin Tong

For the new task, we base our method on the generative radiance manifold representation and equip it with learnable facial and head-shoulder deformations.

Paper
Add Code

Adaptive conformal classification with noisy labels

1 code implementation • 10 Sep 2023 • Matteo Sesia, Y. X. Rachel Wang, Xin Tong

This paper develops novel conformal prediction methods for classification tasks that can automatically adapt to random label contamination in the calibration sample, leading to more informative prediction sets with stronger coverage guarantees compared to state-of-the-art approaches.

Classification Conformal Prediction

Paper
Code

"I'm Not Confident in Debiasing AI Systems Since I Know Too Little": Teaching AI Creators About Gender Bias Through Hands-on Tutorials

no code implementations • 15 Sep 2023 • Kyrie Zhixuan Zhou, Jiaxun Cao, Xiaowen Yuan, Daniel E. Weissglass, Zachary Kilhoffer, Madelyn Rose Sanfilippo, Xin Tong

Gender bias is rampant in AI systems, causing bad user experience, injustices, and mental harm to women.

Paper
Add Code

A Real-time Method for Inserting Virtual Objects into Neural Radiance Fields

no code implementations • 9 Oct 2023 • Keyang Ye, Hongzhi Wu, Xin Tong, Kun Zhou

We present the first real-time method for inserting a rigid virtual object into a neural radiance field, which produces realistic lighting and shadowing effects, as well as allows interactive manipulation of the object.

Lighting Estimation Object

Paper
Add Code

From Authority-Respect to Grassroots-Dissent: Degree-Weighted Social Learning and Convergence Speed

no code implementations • 13 Nov 2023 • Chen Cheng, Xiao Han, Xin Tong, Yusheng Wu, Yiqing Xing

Opinions are influenced by neighbors, with varying degrees of emphasis based on their connections.

Unity

Paper
Add Code

SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Scene Segmentation

1 code implementation • 29 Nov 2023 • Mutian Xu, Xingyilang Yin, Lingteng Qiu, Yang Liu, Xin Tong, Xiaoguang Han

We introduce SAMPro3D for zero-shot 3D indoor scene segmentation.

Scene Segmentation Scene Understanding +1

Paper
Code

StructRe: Rewriting for Structured Shape Modeling

no code implementations • 29 Nov 2023 • Jiepeng Wang, Hao Pan, Yang Liu, Xin Tong, Taku Komura, Wenping Wang

Such a localized rewriting process enables probabilistic modeling of ambiguous structures and robust generalization across object categories.

Object

Paper
Add Code

Plan, Posture and Go: Towards Open-World Text-to-Motion Generation

no code implementations • 22 Dec 2023 • Jinpeng Liu, Wenxun Dai, Chunyu Wang, Yiji Cheng, Yansong Tang, Xin Tong

Some works use the CLIP model to align the motion space and the text space, aiming to enable motion generation from natural language motion descriptions.

Paper
Add Code

Advanced Unstructured Data Processing for ESG Reports: A Methodology for Structured Transformation and Enhanced Analysis

1 code implementation • 4 Jan 2024 • Jiahui Peng, Jing Gao, Xin Tong, Jing Guo, Hang Yang, Jianchuan Qi, Ruiqiao Li, Nan Li, Ming Xu

In the evolving field of corporate sustainability, analyzing unstructured Environmental, Social, and Governance (ESG) reports is a complex challenge due to their varied formats and intricate content.

Paper
Code

A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars

1 code implementation • 9 Jan 2024 • Ronglai Zuo, Fangyun Wei, Zenggui Chen, Brian Mak, Jiaolong Yang, Xin Tong

The objective of this paper is to develop a functional system for translating spoken languages into sign languages, referred to as Spoken2Sign translation.

Sign Language Translation Translation

202

Paper
Code

CPSDBench: A Large Language Model Evaluation Benchmark and Baseline for Chinese Public Security Domain

no code implementations • 11 Feb 2024 • Xin Tong, Bo Jin, Zhi Lin, Binjun Wang, Ting Yu, Qiang Cheng

Large Language Models (LLMs) have demonstrated significant potential and effectiveness across multiple application domains.

Language Modelling Large Language Model +4

Paper
Add Code

DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation

no code implementations • 19 Feb 2024 • Chong Zeng, Yue Dong, Pieter Peers, Youkang Kong, Hongzhi Wu, Xin Tong

To provide the content creator with fine-grained control over the lighting during image generation, we augment the text-prompt with detailed lighting information in the form of radiance hints, i. e., visualizations of the scene geometry with a homogeneous canonical material under the target lighting.

Image Generation

Paper
Add Code

MVD$^2$: Efficient Multiview 3D Reconstruction for Multiview Diffusion

no code implementations • 22 Feb 2024 • Xin-Yang Zheng, Hao Pan, Yu-Xiao Guo, Xin Tong, Yang Liu

By finetuning pretrained large image diffusion models with 3D data, the MVD methods first generate multiple views of a 3D object based on an image or text prompt and then reconstruct 3D shapes with multiview 3D reconstruction.

3D Generation 3D Reconstruction

Paper
Add Code

Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors

no code implementations • 18 Mar 2024 • Ruicheng Wang, Jianfeng Xiang, Jiaolong Yang, Xin Tong

We propose a novel image editing technique that enables 3D manipulations on single images, such as object rotation and translation.

Novel View Synthesis

Paper
Add Code

Quaternion-Based Attitude Stabilization Using Synergistic Hybrid Feedback With Minimal Potential Functions

no code implementations • 12 Apr 2024 • Xin Tong, Qingpeng Ding, Haiyang Fang, Shing Shin Cheng

This paper investigates the robust global attitude stabilization problem for a rigid-body system using quaternion-based feedback.

Paper
Add Code

VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time

no code implementations • 16 Apr 2024 • Sicheng Xu, Guojun Chen, Yu-Xiao Guo, Jiaolong Yang, Chong Li, Zhenyu Zang, Yizhong Zhang, Xin Tong, Baining Guo

We introduce VASA, a framework for generating lifelike talking faces with appealing visual affective skills (VAS) given a single static image and a speech audio clip.

Paper
Add Code

Learning Hierarchy-Aware Quaternion Knowledge Graph Embeddings with Representing Relations as 3D Rotations

1 code implementation • COLING 2022 • Jinfa Yang, Xianghua Ying, Yongjie Shi, Xin Tong, Ruibin Wang, Taiyan Chen, Bowei Xing

It is crucial for knowledge graph embedding models to model and infer various relation patterns, such as symmetry/antisymmetry.

Knowledge Graph Completion Knowledge Graph Embedding +2

Paper
Code

Improving Knowledge Graph Embedding Using Affine Transformations of Entities Corresponding to Each Relation

no code implementations • Findings (EMNLP) 2021 • Jinfa Yang, Yongjie Shi, Xin Tong, Robin Wang, Taiyan Chen, Xianghua Ying

By using previous knowledge graph embedding methods, every entity in a knowledge graph is usually represented as a k-dimensional vector.

Knowledge Graph Embedding Relation

Paper
Add Code

Knowledge Graph Embedding by Adaptive Limit Scoring Loss Using Dynamic Weighting Strategy

no code implementations • Findings (ACL) 2022 • Jinfa Yang, Xianghua Ying, Yongjie Shi, Xin Tong, Ruibin Wang, Taiyan Chen, Bowei Xing

The recently proposed Limit-based Scoring Loss independently limits the range of positive and negative triplet scores.

Knowledge Graph Embedding Knowledge Graphs +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.