1 code implementation • 11 Aug 2024 • Lei Zhou, Yuzhong Zhang, Jiadong Zhang, Xuejun Qian, Chen Gong, Kun Sun, Zhongxiang Ding, Xing Wang, Zhenhui Li, Zaiyi Liu, Dinggang Shen
To strike an optimal trade-off between computational costs and segmentation performance, we propose a hybrid network via the combination of convolution neural network (CNN) and transformer layers.
no code implementations • 22 May 2024 • Hongkai Chen, Zixin Luo, Yurun Tian, Xuyang Bai, Ziyu Wang, Lei Zhou, Mingmin Zhen, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan
Identifying robust and accurate correspondences across images is a fundamental problem in computer vision that enables various downstream tasks.
no code implementations • 12 May 2024 • Bin Lu, Ze Zhao, Luyu Han, Xiaoying Gan, Yuntao Zhou, Lei Zhou, Luoyi Fu, Xinbing Wang, Chenghu Zhou, Jing Zhang
Accurately reconstructing the global ocean deoxygenation over a century is crucial for assessing and protecting marine ecosystem.
1 code implementation • 30 Apr 2024 • Shuqian Sheng, Yi Xu, Tianhang Zhang, Zanwei Shen, Luoyi Fu, Jiaxin Ding, Lei Zhou, Xiaoying Gan, Xinbing Wang, Chenghu Zhou
Besides, previous LLM-based metrics ignore the fact that, within the space of LLM representations, there exist direction vectors that indicate the estimation of text quality.
no code implementations • 4 Apr 2024 • Lei Zhou, Haozhe Wang, Zhengshen Zhang, Zhiyang Liu, Francis EH Tay, adn Marcelo H. Ang. Jr
In the realm of robotic grasping, achieving accurate and reliable interactions with the environment is a pivotal challenge.
no code implementations • 21 Mar 2024 • Shuqian Sheng, Yi Xu, Luoyi Fu, Jiaxin Ding, Lei Zhou, Xinbing Wang, Chenghu Zhou
The majority of automatic metrics for evaluating NLG systems are reference-based.
no code implementations • 11 Nov 2023 • Jingjie Wu, Lei Zhou
Precision motion systems are at the core of various manufacturing equipment.
1 code implementation • ICCV 2023 • Chaoqiang Zhao, Matteo Poggi, Fabio Tosi, Lei Zhou, Qiyu Sun, Yang Tang, Stefano Mattoccia
This paper tackles the challenges of self-supervised monocular depth estimation in indoor scenes caused by large rotation between frames and low texture.
no code implementations • 25 Sep 2023 • Jingjie Wu, Lei Zhou
In recent years, the drastically growing demand for higher throughput and reduced power consumption in various IC manufacturing equipment calls for the development of next-generation precision positioning systems with unprecedented acceleration capability while maintaining exceptional positioning accuracy and high control bandwidth.
no code implementations • 21 Sep 2023 • Jingjie Wu, Lei Zhou
For these systems, the motion control bandwidth is limited by the first structural resonance frequency of the stage, which enforces a fundamental trade-off between the stage's bandwidth and acceleration capability.
1 code implementation • 5 Sep 2023 • Lei Zhou, Zhiyang Liu, Runze Gan, Haozhe Wang, Marcelo H. Ang Jr
In the second stage, a novel registration network is designed to extract pose-sensitive features and predict the representation of object partial point cloud in canonical space based on the deformation results from the first stage.
1 code implementation • 31 Aug 2023 • Ruohuan Fang, Guansong Pang, Lei Zhou, Xiao Bai, Jin Zheng
Open-World Object Detection (OWOD) extends object detection problem to a realistic and dynamic scenario, where a detection model is required to be capable of detecting both known and unknown objects and incrementally learning newly introduced knowledge.
no code implementations • 29 Aug 2023 • Yi Xu, Junjie Ou, Hui Xu, Luoyi Fu, Lei Zhou, Xinbing Wang, Chenghu Zhou
To this end, we investigate the limits of historical information for temporal knowledge graph extrapolation and propose a new event forecasting model called Contrastive Event Network (CENET) based on a novel training framework of historical contrastive learning.
1 code implementation • 7 Jul 2023 • Tianqi Li, Guansong Pang, Xiao Bai, Jin Zheng, Lei Zhou, Xin Ning
Open-Set Recognition (OSR) is dedicated to addressing the unknown class issue, but existing OSR methods are not designed to model the semantic information of the unseen classes.
1 code implementation • 11 Mar 2023 • Lei Zhou, Huidong Liu, Joseph Bae, Junjun He, Dimitris Samaras, Prateek Prasanna
To this end, we reformulate segmentation as a sparse encoding -> token completion -> dense decoding (SCD) pipeline.
no code implementations • 21 Feb 2023 • Lu Liu, Lei Zhou, Yuhan Dong
This allows the camera to capture images with shallow depth-of-field, in which only a small area of the image is in sharp focus, while the rest of the image is blurred.
no code implementations • 10 Jan 2023 • Jingjie Wu, Lei Zhou
To overcome this challenge, this paper proposes a new hardware design and control framework for lightweight precision motion stages with the stage's low-frequency flexible modes actively controlled.
no code implementations • 14 Dec 2022 • Hongkuan Zhang, Saku Sugawara, Akiko Aizawa, Lei Zhou, Ryohei Sasano, Koichi Takeda
Moreover, the higher model performance on difficult examples and unseen data also demonstrates the generalization ability.
no code implementations • 15 Nov 2022 • Lei Zhou
Automated polyp segmentation technology plays an important role in diagnosing intestinal diseases, such as tumors and precancerous lesions.
Ranked #6 on Medical Image Segmentation on Kvasir-SEG
1 code implementation • 30 Aug 2022 • Hongkai Chen, Zixin Luo, Lei Zhou, Yurun Tian, Mingmin Zhen, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan
Generating robust and reliable correspondences across images is a fundamental task for a diversity of applications.
1 code implementation • 29 Jul 2022 • Zaiquan Yang, Yang Liu, Wenjia Xu, Chong Huang, Lei Zhou, Chao Tong
Specifically, we combine seen classes to hallucinate new classes which play as placeholders of the unseen classes in the visual and semantic space.
no code implementations • 12 May 2022 • YiWen Chen, Sheng Guo, Zedong Zhang, Lei Zhou, Xian Yao Ng, Marcelo H. Ang Jr
Previous methods achieved good performance on such manipulation tasks.
1 code implementation • 30 Mar 2022 • Haozhe Wang, Zhiyang Liu, Lei Zhou, Huan Yin, Marcelo H Ang Jr
Vision-based grasp estimation is an essential part of robotic manipulation tasks in the real world.
1 code implementation • 10 Mar 2022 • Lei Zhou, Huidong Liu, Joseph Bae, Junjun He, Dimitris Samaras, Prateek Prasanna
Masked Autoencoder (MAE) has recently been shown to be effective in pre-training Vision Transformers (ViT) for natural image analysis.
no code implementations • 15 Feb 2022 • Jingjie Wu, Lei Zhou
Precision motion stages are an essential part of a wide range of manufacturing equipment, and their motion performance are critical to the quality and throughput of the systems.
no code implementations • 9 Feb 2022 • Laura Homiller, Lei Zhou
Bearingless motors use a single stator assembly to apply torque and magnetic suspension forces on the rotor, making these machines compact with frictionless operation and thus well suited to high-speed applications.
1 code implementation • 18 Jan 2022 • Lei Zhou, Joseph Bae, Huidong Liu, Gagandeep Singh, Jeremy Green, Amit Gupta, Dimitris Samaras, Prateek Prasanna
Well-labeled datasets of chest radiographs (CXRs) are difficult to acquire due to the high cost of annotation.
no code implementations • 29 Sep 2021 • Huidong Liu, Ke Ma, Lei Zhou, Dimitris Samaras
If the \texttt{MRE} is smaller than 1, then every target point is guaranteed to have an area in the source distribution that is mapped to it.
no code implementations • 27 Sep 2021 • Abhishek Gupta, Lei Zhou, Yew-Soon Ong, Zefeng Chen, Yaqing Hou
Until recently, the potential to transfer evolved skills across distinct optimization problem instances (or tasks) was seldom explored in evolutionary computation.
1 code implementation • ICCV 2021 • Hongkai Chen, Zixin Luo, Jiahui Zhang, Lei Zhou, Xuyang Bai, Zeyu Hu, Chiew-Lan Tai, Long Quan
2) Seeded Graph Neural Network, which utilizes seed matches to pass messages within/across images and predicts assignment costs.
no code implementations • ACL (IWSLT) 2021 • Lei Zhou, Liang Ding, Kevin Duh, Shinji Watanabe, Ryohei Sasano, Koichi Takeda
In the field of machine learning, the well-trained model is assumed to be able to recover the training labels, i. e. the synthetic labels predicted by the model should be as close to the ground-truth labels as possible.
no code implementations • 15 Mar 2021 • Liutong Zhang, Lei Zhou, Ruiyang Li, Xianyu Wang, Boxuan Han, Hongen Liao
In this paper, we pre-sent a cascaded feature warping network to perform the coarse-to-fine registration.
1 code implementation • CVPR 2021 • Xuyang Bai, Zixin Luo, Lei Zhou, Hongkai Chen, Lei LI, Zeyu Hu, Hongbo Fu, Chiew-Lan Tai
Removing outlier correspondences is one of the critical steps for successful feature-based point cloud registration.
1 code implementation • CVPR 2021 • Yang Liu, Lei Zhou, Xiao Bai, Yifei HUANG, Lin Gu, Jun Zhou, Tatsuya Harada
Therefore, we introduce a novel goal-oriented gaze estimation module (GEM) to improve the discriminative attribute localization based on the class-level attributes for ZSL.
no code implementations • 26 Jan 2021 • Yizun He, Qingnan Cai, Lingjing Ji, Zhening Fang, Yuzhuo Wang, Liyang Qiu, Lei Zhou, Saijun Wu, Stefano Grava, Darrick E. Chang
The interaction between light and cold atoms is a complex phenomenon potentially featuring many-body resonant dipole interactions.
Atomic Physics Quantum Physics
no code implementations • WMT (EMNLP) 2020 • Lei Zhou, Liang Ding, Koichi Takeda
In response to this issue, we propose to expose explicit cross-lingual patterns, \textit{e. g.} word alignments and generation score, to our proposed zero-shot models.
no code implementations • 16 Sep 2020 • Yang Liu, Lei Zhou, Xiao Bai, Lin Gu, Tatsuya Harada, Jun Zhou
Though many ZSL methods rely on a direct mapping between the visual and the semantic space, the calibration deviation and hubness problem limit the generalization capability to unseen classes.
no code implementations • ECCV 2020 • Mingmin Zhen, Shiwei Li, Lei Zhou, Jiaxiang Shang, Haoan Feng, Tian Fang, Long Quan
In this paper, we introduce a novel network, called discriminative feature network (DFNet), to address the unsupervised video object segmentation task.
Ranked #1 on Video Object Segmentation on FBMS
1 code implementation • ECCV 2020 • Lei Zhou, Zixin Luo, Mingmin Zhen, Tianwei Shen, Shiwei Li, Zhuofei Huang, Tian Fang, Long Quan
In this work, we propose a stochastic bundle adjustment algorithm which seeks to decompose the RCS approximately inside the LM iterations to improve the efficiency and scalability.
1 code implementation • ECCV 2020 • Jiaxiang Shang, Tianwei Shen, Shiwei Li, Lei Zhou, Mingmin Zhen, Tian Fang, Long Quan
Recent learning-based approaches, in which models are trained by single-view images have shown promising results for monocular 3D face reconstruction, but they suffer from the ill-posed face pose and depth ambiguity issue.
Ranked #7 on 3D Face Reconstruction on REALY (side-view)
no code implementations • 26 May 2020 • XiangJi Wu, Ziwen Zhang, Jie Feng, Lei Zhou, Junmin Wu
We present an end-to-end trainable framework for P-frame compression in this paper.
no code implementations • CVPR 2020 • Mingmin Zhen, Jinglu Wang, Lei Zhou, Shiwei Li, Tianwei Shen, Jiaxiang Shang, Tian Fang, Quan Long
In this paper, we present a joint multi-task learning framework for semantic segmentation and boundary detection.
1 code implementation • CVPR 2020 • Lei Zhou, Zixin Luo, Tianwei Shen, Jiahui Zhang, Mingmin Zhen, Yao Yao, Tian Fang, Long Quan
Temporal camera relocalization estimates the pose with respect to each video frame in sequence, as opposed to one-shot relocalization which focuses on a still image.
4 code implementations • CVPR 2020 • Zixin Luo, Lei Zhou, Xuyang Bai, Hongkai Chen, Jiahui Zhang, Yao Yao, Shiwei Li, Tian Fang, Long Quan
This work focuses on mitigating two limitations in the joint learning of local feature detectors and descriptors.
2 code implementations • CVPR 2020 • Xuyang Bai, Zixin Luo, Lei Zhou, Hongbo Fu, Long Quan, Chiew-Lan Tai
In this paper, we leverage a 3D fully convolutional network for 3D point clouds, and propose a novel and practical learning mechanism that densely predicts both a detection score and a description feature for each 3D point.
Ranked #2 on Point Cloud Registration on KITTI
3 code implementations • CVPR 2020 • Yao Yao, Zixin Luo, Shiwei Li, Jingyang Zhang, Yufan Ren, Lei Zhou, Tian Fang, Long Quan
Compared with other computer vision tasks, it is rather difficult to collect a large-scale MVS dataset as it requires expensive active scanners and labor-intensive process to obtain ground truth 3D structures.
1 code implementation • 19 Sep 2019 • Tianwei Shen, Lei Zhou, Zixin Luo, Yao Yao, Shiwei Li, Jiahui Zhang, Tian Fang, Long Quan
The self-supervised learning of depth and pose from monocular sequences provides an attractive solution by using the photometric consistency of nearby frames as it depends much less on the ground-truth data.
1 code implementation • ICCV 2019 • Jiahui Zhang, Dawei Sun, Zixin Luo, Anbang Yao, Lei Zhou, Tianwei Shen, Yurong Chen, Long Quan, Hongen Liao
First, to capture the local context of sparse correspondences, the network clusters unordered input correspondences by learning a soft assignment matrix.
no code implementations • 18 Jun 2019 • Dong Wang, Lei Zhou, Xiao Bai, Jun Zhou
Our method accelerates the network in one-step pruning-recovery manner with a novel optimization objective function, which achieves higher accuracy with much less cost compared with existing pruning methods.
no code implementations • 22 May 2019 • Mingmin Zhen, Jinglu Wang, Lei Zhou, Tian Fang, Long Quan
On the other hand, it learns more efficiently with the more efficient gradient backpropagation.
Ranked #79 on Semantic Segmentation on NYU Depth v2
1 code implementation • CVPR 2019 • Zixin Luo, Tianwei Shen, Lei Zhou, Jiahui Zhang, Yao Yao, Shiwei Li, Tian Fang, Long Quan
Most existing studies on learning local features focus on the patch-based descriptions of individual keypoints, whereas neglecting the spatial relations established from their keypoint locations.
1 code implementation • 25 Feb 2019 • Tianwei Shen, Zixin Luo, Lei Zhou, Hanyu Deng, Runze Zhang, Tian Fang, Long Quan
Accurate relative pose is one of the key components in visual odometry (VO) and simultaneous localization and mapping (SLAM).
Ranked #3 on Camera Pose Estimation on KITTI Odometry Benchmark
1 code implementation • 26 Nov 2018 • Tianwei Shen, Zixin Luo, Lei Zhou, Runze Zhang, Siyu Zhu, Tian Fang, Long Quan
Convolutional Neural Networks (CNNs) have achieved superior performance on object image retrieval, while Bag-of-Words (BoW) models with handcrafted local features still dominate the retrieval of overlapping images in 3D reconstruction.
1 code implementation • ECCV 2018 • Zixin Luo, Tianwei Shen, Lei Zhou, Siyu Zhu, Runze Zhang, Yao Yao, Tian Fang, Long Quan
Learned local descriptors based on Convolutional Neural Networks (CNNs) have achieved significant improvements on patch-based benchmarks, whereas not having demonstrated strong generalization ability on recent benchmarks of image-based 3D reconstruction.
no code implementations • ECCV 2018 • Lei Zhou, Siyu Zhu, Zixin Luo, Tianwei Shen, Runze Zhang, Mingmin Zhen, Tian Fang, Long Quan
Critical to the registration of point clouds is the establishment of a set of accurate correspondences between points in 3D space.
no code implementations • CVPR 2018 • Siyu Zhu, Runze Zhang, Lei Zhou, Tianwei Shen, Tian Fang, Ping Tan, Long Quan
This work proposes a divide-and-conquer framework to solve very large global SfM at the scale of millions of images.
no code implementations • CVPR 2018 • Yali Wang, Lei Zhou, Yu Qiao
To mimic this capacity, we propose a novel Hybrid Video Memory (HVM) machine, which can hallucinate temporal features of still images from video memory, in order to boost action recognition with few still images.
no code implementations • 15 Mar 2018 • Lei Zhou, Xiao Bai, Xianglong Liu, Jun Zhou, Hancock Edwin
Therefore, the efficiency and scalability of traditional spectral clustering methods can not be guaranteed for large scale datasets.
no code implementations • 15 Mar 2018 • Dong Wang, Lei Zhou, Xueni Zhang, Xiao Bai, Jun Zhou
In this way, most of the representative information in the network can be retained in each cluster.
no code implementations • ICCV 2017 • Lei Zhou, Siyu Zhu, Tianwei Shen, Jinglu Wang, Tian Fang, Long Quan
In this paper, we propose a scale-invariant image matching approach to tackling the very large scale variation of views.
no code implementations • 12 Aug 2017 • Lei Zhou, Zhi Liu, Xiangjian He
In this work, we address the face parsing task with a Fully-Convolutional continuous CRF Neural Network (FC-CNN) architecture.
no code implementations • 28 Feb 2017 • Siyu Zhu, Tianwei Shen, Lei Zhou, Runze Zhang, Jinglu Wang, Tian Fang, Long Quan
In this paper, we tackle the accurate and consistent Structure from Motion (SfM) problem, in particular camera registration, far exceeding the memory of a single computer in parallel.