no code implementations • 24 Dec 2024 • Jiaxin Li, Weiqi Huang, Zan Wang, Wei Liang, Huijun Di, Feng Liu
To eliminate this gap, we introduce a novel navigation task: Floor Plan Visual Navigation (FloNa), the first attempt to incorporate floor plan into embodied visual navigation.
1 code implementation • 23 Dec 2024 • Jiawei Tan, Hongxing Wang, Kang Dang, Jiaxin Li, Zhilong Ou
Achieving this requires meticulously correlating multi-modal cues, $\it{e. g.}$ visual entity and place modalities, among shots and comparing semantic changes around each shot.
no code implementations • 18 Dec 2024 • Zixuan Chen, Jiaxin Li, Liming Tan, Yejie Guo, Junxuan Liang, Cewu Lu, Yong-Lu Li
In light of this, we introduce the concept of phase in segmentation, which categorizes real-world objects based on their visual characteristics and potential morphological and appearance changes.
no code implementations • 28 Sep 2024 • Jiaxin Li, Gorka Abad, Stjepan Picek, Mauro Conti
If we convert ANNs trained with static datasets to SNNs, the accuracy of MIAs drops (maximum 11. 5% with a reduction of 7. 6% on the test accuracy of the target model).
no code implementations • 28 Sep 2024 • Jiaxin Li, Marco Arazzi, Antonino Nocera, Mauro Conti
Subject Membership Inference Attack (SMIA) targets this scenario and attempts to infer whether any client utilizes data points from a target subject in cross-silo FL.
1 code implementation • 4 Jun 2024 • Philip Anastassiou, Jiawei Chen, Jitong Chen, Yuanzhe Chen, Zhuo Chen, Ziyi Chen, Jian Cong, Lelai Deng, Chuang Ding, Lu Gao, Mingqing Gong, Peisong Huang, Qingqing Huang, Zhiying Huang, YuanYuan Huo, Dongya Jia, ChuMin Li, Feiya Li, Hui Li, Jiaxin Li, Xiaoyang Li, Xingxing Li, Lin Liu, Shouda Liu, Sichao Liu, Xudong Liu, Yuchen Liu, Zhengxi Liu, Lu Lu, Junjie Pan, Xin Wang, Yuping Wang, Yuxuan Wang, Zhen Wei, Jian Wu, Chao Yao, Yifeng Yang, YuanHao Yi, Junteng Zhang, Qidi Zhang, Shuo Zhang, Wenjie Zhang, Yang Zhang, Zilin Zhao, Dejian Zhong, Xiaobin Zhuang
Seed-TTS offers superior controllability over various speech attributes such as emotion and is capable of generating highly expressive and diverse speech for speakers in the wild.
no code implementations • 9 May 2024 • Zhenhui Jiang, Jiaxin Li, Yang Liu
This study provides a comprehensive comparative evaluation of American and Chinese LLMs in both English and Chinese contexts.
no code implementations • 10 Apr 2024 • Philip Anastassiou, Zhenyu Tang, Kainan Peng, Dongya Jia, Jiaxin Li, Ming Tu, Yuping Wang, Yuxuan Wang, Mingbo Ma
We present VoiceShop, a novel speech-to-speech framework that can modify multiple attributes of speech, such as age, gender, accent, and speech style, in a single forward pass while preserving the input speaker's timbre.
1 code implementation • 14 Feb 2024 • Huizhi Zhu, Wenxia Xu, Jian Huang, Jiaxin Li
As executed on a GPU, our two-stage method can ensure the requirement for real-time computation.
1 code implementation • 17 Jan 2024 • Haixin Wang, Jiaxin Li, Anubhav Dwivedi, Kentaro Hara, Tailin Wu
Here we introduce Boundary-Embedded Neural Operators (BENO), a novel neural operator architecture that embeds the complex geometries and inhomogeneous boundary values into the solving of elliptic PDEs.
1 code implementation • CVPR 2024 • Jiawei Tan, Hongxing Wang, Jiaxin Li, Zhilong Ou, Zhangbin Qian
As a result not only do the learned shot features suppress the affinity among similar shots from different scenes but they also promote the affinity among dissimilar shots in the same scene.
Ranked #1 on Scene Segmentation on MovieNet (using extra training data)
1 code implementation • 14 Dec 2023 • Chubin Zhang, Juncheng Yan, Yi Wei, Jiaxin Li, Li Liu, Yansong Tang, Yueqi Duan, Jiwen Lu
Occupancy prediction reconstructs 3D structures of surrounding environments.
no code implementations • 7 Mar 2023 • Shangshang Shi, Zhimin Wang, Ruimin Shang, Yanan Li, Jiaxin Li, Guoqiang Zhong, Yongjian Gu
The taxonomic composition and abundance of phytoplankton, having direct impact on marine ecosystem dynamic and global environment change, are listed as essential ocean variables.
1 code implementation • 7 Feb 2023 • Yanan Li, Zhimin Wang, Rongbing Han, Shangshang Shi, Jiaxin Li, Ruimin Shang, Haiyong Zheng, Guoqiang Zhong, Yongjian Gu
Quantum neural network (QNN) is one of the promising directions where the near-term noisy intermediate-scale quantum (NISQ) devices could find advantageous applications against classical resources.
no code implementations • 12 Dec 2022 • Dongya Jia, Qiao Tian, Kainan Peng, Jiaxin Li, Yuanzhe Chen, Mingbo Ma, Yuping Wang, Yuxuan Wang
The goal of accent conversion (AC) is to convert the accent of speech into the target accent while preserving the content and speaker identity.
no code implementations • 28 Oct 2022 • Mauro Conti, Jiaxin Li, Stjepan Picek
Membership Inference Attacks (MIAs) infer whether a data point is in the training data of a machine learning model.
no code implementations • 27 Oct 2022 • Yuanzhe Chen, Ming Tu, Tang Li, Xin Li, Qiuqiang Kong, Jiaxin Li, Zhichao Wang, Qiao Tian, Yuping Wang, Yuxuan Wang
In this paper, we propose to use intermediate bottleneck features (IBFs) to replace PPGs.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 27 Jul 2022 • Mauro Conti, Jiaxin Li, Stjepan Picek, Jing Xu
Even in those scenarios, our label-only MIA achieves a better attack performance in most cases.
2 code implementations • 29 Jun 2022 • Yining Shi, Jingyan Shen, Yifan Sun, Yunlong Wang, Jiaxin Li, Shiqi Sun, Kun Jiang, Diange Yang
Our novel sparse feature sampling module only utilizes local 2D region of interest (RoI) features calculated by the projection of 3D query boxes for further box refinement, leading to a fully-convolutional and deployment-friendly pipeline.
Ranked #8 on Robust Camera Only 3D Object Detection on nuScenes-C
no code implementations • 13 May 2022 • Minghua Wang, Danfeng Hong, Zhu Han, Jiaxin Li, Jing Yao, Lianru Gao, Bing Zhang, Jocelyn Chanussot
Owing to the rapid development of sensor technology, hyperspectral (HS) remote sensing (RS) imaging has provided a significant amount of spatial and spectral information for the observation and analysis of the Earth's surface at a distance of data acquisition devices, such as aircraft, spacecraft, and satellite.
no code implementations • 3 May 2022 • Jiaxin Li, Danfeng Hong, Lianru Gao, Jing Yao, Ke Zheng, Bing Zhang, Jocelyn Chanussot
With the extremely rapid advances in remote sensing (RS) technology, a great quantity of Earth observation (EO) data featuring considerable and complicated heterogeneity is readily available nowadays, which renders researchers an opportunity to tackle current geoscience applications in a fresh way.
1 code implementation • 28 Mar 2022 • Yi Wei, Zibu Wei, Yongming Rao, Jiaxin Li, Jie zhou, Jiwen Lu
In this paper, we propose the LiDAR Distillation to bridge the domain gap induced by different LiDAR beams for 3D object detection.
6 code implementations • 8 Mar 2022 • Jiaxin Li, Yan Ding, HuaLiang Wei
Joint detection and embedding (JDE) based methods usually estimate bounding boxes and embedding features of objects with a single network in Multi-Object Tracking (MOT).
no code implementations • 10 Nov 2021 • Jiaxin Li, Yan Ding, Weizhong Zhang, Yifan Zhao, Lingxi Guo, Zhe Yang
Augmented reality technology based on image registration is becoming increasingly popular for the convenience of pre-surgery preparation and medical education.
no code implementations • 30 Sep 2021 • Fengrui Liu, Yang Li, Baitong Li, Jiaxin Li, Huiyang Xie
Then an automatically-generating transaction strategy is constructed building on PPO with LSTM as the basis to construct the policy.
1 code implementation • ICCV 2021 • Panhe Feng, Qi She, Lei Zhu, Jiaxin Li, Lin Zhang, Zijian Feng, Changhu Wang, Chunpeng Li, Xuejing Kang, Anlong Ming
Retrieving occlusion relation among objects in a single image is challenging due to sparsity of boundaries in image.
no code implementations • 10 Aug 2021 • Hongwu Peng, Shanglin Zhou, Scott Weitze, Jiaxin Li, Sahidul Islam, Tong Geng, Ang Li, Wei zhang, Minghu Song, Mimi Xie, Hang Liu, Caiwen Ding
Deep complex networks (DCN), in contrast, can learn from complex data, but have high computational costs; therefore, they cannot satisfy the instant decision-making requirements of many deployable systems dealing with short observations or short signal bursts.
1 code implementation • CVPR 2021 • Jiaxin Li, Gim Hee Lee
This paper presents DeepI2P: a novel approach for cross-modality registration between an image and a point cloud.
1 code implementation • ICCV 2021 • Jiaxin Li, Zijian Feng, Qi She, Henghui Ding, Changhu Wang, Gim Hee Lee
In this paper, we propose MINE to perform novel view synthesis and depth estimation via dense 3D reconstruction from a single image.
no code implementations • 15 Feb 2021 • Yiming Xu, Jiaxin Li, Yiheng Peng, Yan Ding, Hua-Liang Wei
Both of the two types of methods involve two stages, namely, person detection and joints detection.
no code implementations • ICCV 2021 • Henghui Ding, HUI ZHANG, Jun Liu, Jiaxin Li, Zijian Feng, Xudong Jiang
In this work, we treat each respective region in an image as a whole, and capture the structure topology as well as the affinity among different regions.
no code implementations • SEMEVAL 2020 • Jinan Zhou, Jiaxin Li
This paper describes our TemporalTeller system for SemEval Task 1: Unsupervised Lexical Semantic Change Detection.
1 code implementation • 31 Mar 2019 • Jiaxin Li, Yingcai Bi, Gim Hee Lee
In this paper, we propose a deep learning architecture that achieves discrete $\mathbf{SO}(2)$/$\mathbf{SO}(3)$ rotation equivariance for point cloud recognition.
1 code implementation • ICCV 2019 • Jiaxin Li, Gim Hee Lee
In this paper, we propose the USIP detector: an Unsupervised Stable Interest Point detector that can detect highly repeatable and accurately localized keypoints from 3D point clouds under arbitrary transformations without the need for any ground truth training data.
1 code implementation • 28 Jul 2018 • Jiaxin Li, Yingcai Bi, Kun Li, Kangli Wang, Feng Lin, Ben M. Chen
Driven by applications like Micro Aerial Vehicles (MAVs), driver-less cars, etc, localization solution has become an active research topic in the past decade.
Robotics
3 code implementations • CVPR 2018 • Jiaxin Li, Ben M. Chen, Gim Hee Lee
This paper presents SO-Net, a permutation invariant architecture for deep learning with orderless point clouds.
Ranked #3 on 3D Part Segmentation on IntrA