no code implementations • ECCV 2020 • Tong Wang, Yousong Zhu, Chaoyang Zhao, Wei Zeng, Yao-Wei Wang, Jinqiao Wang, Ming Tang
Most of existing object detectors usually adopt a small training batch size ( ~16), which severely hinders the whole community from exploring large-scale datasets due to the extremely long training procedure.
no code implementations • ECCV 2020 • Jibin Gao, Wei-Shi Zheng, Jia-Hui Pan, Chengying Gao, Yao-Wei Wang, Wei Zeng, Jian-Huang Lai
However, existing methods for action assessment are mostly limited to individual actions, especially lacking modeling of the asymmetric relations among agents (e. g., between persons and objects); and this limitation undermines their ability to assess actions containing asymmetrically interactive motion patterns, since there always exists subordination between agents in many interactive actions.
no code implementations • ECCV 2020 • Wei Zeng, Sezer Karaoglu, Theo Gevers
Leveraging the layout depth map as an intermediate representation, our proposed method outperforms existing methods for both panorama layout prediction and depth estimation.
no code implementations • 11 Feb 2025 • Haichuan Lin, Yilin Ye, Jiazhi Xia, Wei Zeng
Text-to-image models can generate visually appealing images from text descriptions.
1 code implementation • 19 Oct 2024 • Yin Li, Liangwei Wang, Shiyuan Piao, Boo-Ho Yang, Ziyue Li, Wei Zeng, Fugee Tsung
To address these challenges, we introduce MCCoder, an LLM-powered system designed to generate code that addresses complex motion control tasks, with integrated soft-motion data verification.
no code implementations • 3 Oct 2024 • Ziyao Gao, Yiwen Zhang, Ling Li, Theodoros Papatheodorou, Wei Zeng
Data surveillance has become more covert and pervasive with AI algorithms, which can result in biased social classifications.
1 code implementation • 29 Jul 2024 • Xingchen Zeng, Haichuan Lin, Yilin Ye, Wei Zeng
To fill the gap, we propose a visualization-referenced instruction tuning approach to guide the training dataset enhancement and model development.
no code implementations • 27 Jul 2024 • Tengyao Tu, Wei Zeng, Kun Zhao, Zhenyu Zhang
The result proves that adding a classifier to the model based on the random forest algorithm is very effective, and our model generally outperforms ordinary deep learning methods.
1 code implementation • 17 Jul 2024 • Yilin Ye, Shishi Xiao, Xingchen Zeng, Wei Zeng
Multi-modal embeddings form the foundation for vision-language models, such as CLIP embeddings, the most widely used text-image embeddings.
1 code implementation • 9 Jul 2024 • Fanyue Wei, Wei Zeng, Zhenyang Li, Dawei Yin, Lixin Duan, Wen Li
Personalized text-to-image models allow users to generate varied styles of images (specified with a sentence) for an object (specified with a set of reference images).
1 code implementation • 3 Jun 2024 • Ling Li, Yu Ye, Bingchuan Jiang, Wei Zeng
The data and code are available at https://github. com/lingli1996/GeoReasoner.
1 code implementation • 22 May 2024 • Wei Zeng, Xian He, Ye Wang
Piano audio-to-score transcription (A2S) is an important yet underexplored task with extensive applications for music composition, practice, and analysis.
no code implementations • 1 May 2024 • Zidong Cao, Zhan Wang, Yexin Liu, Yan-Pei Cao, Ying Shan, Wei Zeng, Lin Wang
Our system enables users to effortlessly locate and zoom in on the objects of interest in VR.
no code implementations • 28 Apr 2024 • Yilin Ye, Jianing Hao, Yihan Hou, Zhan Wang, Shishi Xiao, Yuyu Luo, Wei Zeng
From a technical perspective, this paper looks back on previous visualization studies leveraging GenAI and discusses the challenges and opportunities for future research.
1 code implementation • 14 Apr 2024 • Ya-Qi Yu, Minghui Liao, Jihao Wu, Yongxin Liao, Xiaoyu Zheng, Wei Zeng
We conduct extensive experiments on both general and document-oriented MLLM benchmarks, and show that TextHawk outperforms the state-of-the-art methods, demonstrating its effectiveness and superiority in fine-grained document perception and general abilities.
no code implementations • 20 Jan 2024 • Shishi Xiao, Liangwei Wang, Xiaojuan Ma, Wei Zeng
Semantic typographic logos harmoniously blend typeface and imagery to represent semantic concepts while maintaining legibility.
no code implementations • 4 Dec 2023 • Yilin Ye, Qian Zhu, Shishi Xiao, Kang Zhang, Wei Zeng
Moreover, the intent expansion framework enables users to perform flexible contextualized interactions with the search results to further specify or adjust their detailed search intents iteratively.
1 code implementation • 5 Aug 2023 • Xiangming Gu, Wei Zeng, Ye Wang
Leveraging the prior knowledge that pitch distributions may contribute to the gender bias, we propose conditionally aligning acoustic representations between demographic groups by feeding note events to the attribute predictor.
1 code implementation • 19 Jul 2023 • Jianing Hao, Qing Shi, Yilin Ye, Wei Zeng
Deep learning (DL) approaches are being increasingly used for time-series forecasting, with many efforts devoted to designing complex DL models.
1 code implementation • 28 Apr 2023 • Shishi Xiao, Suizi Huang, Yue Lin, Yilin Ye, Wei Zeng
Pictorial visualization seamlessly integrates data and semantic context into visual representation, conveying complex information in a manner that is both engaging and informative.
1 code implementation • 17 Apr 2023 • Yilin Ye, Rong Huang, Kang Zhang, Wei Zeng
The recent advances of AI technology, particularly in AI-Generated Content (AIGC), have enabled everyone to easily generate beautiful paintings with simple text description.
1 code implementation • 14 Apr 2023 • Shishi Xiao, Yihan Hou, Cheng Jin, Wei Zeng
Retrieving charts from a large corpus is a fundamental task that can benefit numerous applications such as visualization recommendations. The retrieved results are expected to conform to both explicit visual attributes (e. g., chart type, colormap) and implicit user intents (e. g., design style, context information) that vary upon application scenarios.
2 code implementations • 23 Mar 2023 • Xiang Gu, Yucheng Yang, Wei Zeng, Jian Sun, Zongben Xu
In this paper, we propose a novel KeyPoint-Guided model by ReLation preservation (KPG-RL) that searches for the optimal matching (i. e., transport plan) guided by the keypoints in OT.
no code implementations • 3 Apr 2022 • Juanhui Li, Yao Ma, Wei Zeng, Suqi Cheng, Jiliang Tang, Shuaiqiang Wang, Dawei Yin
In other words, GE-BERT can capture both the semantic information and the users' search behavioral information of queries.
no code implementations • 29 Mar 2022 • Jingting Zhang, Chengzhi Yuan, Wei Zeng, Cong Wang
This paper proposes a novel fault detection and isolation (FDI) scheme for distributed parameter systems modeled by a class of parabolic partial differential equations (PDEs) with nonlinear uncertain dynamics.
no code implementations • Neurocomputing 2022 • Ge Fan, Biao Geng, Jianrong Tao, Kai Wang, Changjie Fan, Wei Zeng
These methods may fail to capture the personalized informativeness of each vertex.
no code implementations • Neurocomputing 2022 • Ge Fan, Biao Geng, Jianrong Tao, Kai Wang, Changjie Fan, Wei Zeng
These methods may fail to capture the personalized informativeness of each vertex.
Ranked #1 on
Link Prediction
on PPI
1 code implementation • CVPR 2022 • Xiawu Zheng, Xiang Fei, Lei Zhang, Chenglin Wu, Fei Chao, Jianzhuang Liu, Wei Zeng, Yonghong Tian, Rongrong Ji
Building upon RMI, we further propose a new search algorithm termed RMI-NAS, facilitating with a theorem to guarantee the global optimal of the searched architecture.
3 code implementations • 23 Dec 2021 • Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, dianhai yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang
A unified framework named ERNIE 3. 0 was recently proposed for pre-training large-scale knowledge enhanced models and trained a model with 10 billion parameters.
1 code implementation • 6 Dec 2021 • Yulong Ao, Zhihua Wu, dianhai yu, Weibao Gong, Zhiqing Kui, Minxu Zhang, Zilingfeng Ye, Liang Shen, Yanjun Ma, Tian Wu, Haifeng Wang, Wei Zeng, Chao Yang
The experiments demonstrate that our framework can satisfy various requirements from the diversity of applications and the heterogeneity of resources with highly competitive performance.
1 code implementation • 30 Jul 2021 • Zhiyang Chen, Yousong Zhu, Chaoyang Zhao, Guosheng Hu, Wei Zeng, Jinqiao Wang, Ming Tang
To address this problem, we propose a new Deformable Patch (DePatch) module which learns to adaptively split the images into patches with different positions and scales in a data-driven way rather than using predefined fixed patches.
Ranked #17 on
Semantic Segmentation
on DensePASS
no code implementations • 6 Jul 2021 • Mengyang Wu, Wei Zeng, Chi-Wing Fu
The ability to recognize the position and order of the floor-level lines that divide adjacent building floors can benefit many applications, for example, urban augmented reality (AR).
4 code implementations • 26 Apr 2021 • Wei Zeng, Xiaozhe Ren, Teng Su, Hui Wang, Yi Liao, Zhiwei Wang, Xin Jiang, ZhenZhang Yang, Kaisheng Wang, Xiaoda Zhang, Chen Li, Ziyan Gong, Yifan Yao, Xinjing Huang, Jun Wang, Jianfeng Yu, Qi Guo, Yue Yu, Yan Zhang, Jin Wang, Hengtao Tao, Dasen Yan, Zexuan Yi, Fang Peng, Fangqing Jiang, Han Zhang, Lingfeng Deng, Yehong Zhang, Zhe Lin, Chao Zhang, Shaojie Zhang, Mingyue Guo, Shanzhi Gu, Gaojun Fan, YaoWei Wang, Xuefeng Jin, Qun Liu, Yonghong Tian
To enhance the generalization ability of PanGu-$\alpha$, we collect 1. 1TB high-quality Chinese data from a wide range of domains to pretrain the model.
Ranked #1 on
Reading Comprehension (One-Shot)
on DuReader
Cloze (multi-choices) (Few-Shot)
Cloze (multi-choices) (One-Shot)
+19
1 code implementation • CVPR 2021 • Tong Wang, Yousong Zhu, Chaoyang Zhao, Wei Zeng, Jinqiao Wang, Ming Tang
To address the problem of long-tail distribution for the large vocabulary object detection task, existing methods usually divide the whole categories into several groups and treat each group with different strategies.
1 code implementation • CVPR 2021 • Yaofo Chen, Yong Guo, Qi Chen, Minli Li, Wei Zeng, YaoWei Wang, Mingkui Tan
One of the key steps in Neural Architecture Search (NAS) is to estimate the performance of candidate architectures.
no code implementations • 31 Jan 2021 • Liqun Yang, Yijun Yang, Yao Wang, Zhenyu Yang, Wei Zeng
In the application of neural networks, we need to select a suitable model based on the problem complexity and the dataset scale.
no code implementations • 8 Jan 2021 • Wei Zeng, Chengqiao Lin, Kang Liu, Juncong Lin, Anthony K. H. Tung
Furthermore, to better fit with convolutions, we suggest to first aggregate traffic flows according to pre-conceived regions or self-organized regions based on traffic flows, then dispose to sequentially organized raster images for network input.
1 code implementation • 4 Dec 2020 • Shubham Rai, Walter Lau Neto, Yukio Miyasaka, Xinpei Zhang, Mingfei Yu, Qingyang Yi Masahiro Fujita, Guilherme B. Manske, Matheus F. Pontes, Leomar S. da Rosa Junior, Marilton S. de Aguiar, Paulo F. Butzen, Po-Chun Chien, Yu-Shan Huang, Hoa-Ren Wang, Jie-Hong R. Jiang, Jiaqi Gu, Zheng Zhao, Zixuan Jiang, David Z. Pan, Brunno A. de Abreu, Isac de Souza Campos, Augusto Berndt, Cristina Meinhardt, Jonata T. Carvalho, Mateus Grellert, Sergio Bampi, Aditya Lohana, Akash Kumar, Wei Zeng, Azadeh Davoodi, Rasit O. Topaloglu, Yuan Zhou, Jordan Dotzel, Yichi Zhang, Hanyu Wang, Zhiru Zhang, Valerio Tenace, Pierre-Emmanuel Gaillardon, Alan Mishchenko, Satrajit Chatterjee
If the function is incompletely-specified, the implementation has to be true only on the care set.
2 code implementations • 13 Aug 2020 • Ling-An Zeng, Fa-Ting Hong, Wei-Shi Zheng, Qi-Zhi Yu, Wei Zeng, Yao-Wei Wang, Jian-Huang Lai
However, most existing works focus only on video dynamic information (i. e., motion information) but ignore the specific postures that an athlete is performing in a video, which is important for action assessment in long videos.
Ranked #2 on
Action Quality Assessment
on Rhythmic Gymnastic
no code implementations • 30 Jul 2020 • Wei Zeng, Chengqiao Lin, Juncong Lin, Jincheng Jiang, Jiazhi Xia, Cagatay Turkay, Wei Chen
Deep learning methods are being increasingly used for urban traffic prediction where spatiotemporal traffic data is aggregated into sequentially organized matrices that are then fed into convolution-based residual neural networks.
no code implementations • 25 Jul 2020 • Haonan Jia, Xiao Zhang, Jun Xu, Wei Zeng, Hao Jiang, Xiaohui Yan, Ji-Rong Wen
Deep Q-learning algorithms often suffer from poor gradient estimations with an excessive variance, resulting in unstable training and poor sampling efficiency.
no code implementations • 20 Dec 2019 • Peng Gang, Wei Zeng, Yuri Gordienko, Oleksandr Rokovyi, Oleg Alienin, Sergii Stirenko
The classification problem was to predict the known level of the in-exercise loads (in three categories by calories) by the heart rate activity features measured during the short period of time (1 minute only) after training, i. e by features of the post-exercise load.
no code implementations • 31 Jul 2019 • Zhutian Chen, Wei Zeng, Zhiguang Yang, Lingyun Yu, Chi-Wing Fu, Huamin Qu
A hierarchical network is trained using a dataset with over 30K lasso-selection records on two different point cloud data.
Human-Computer Interaction Graphics
no code implementations • 23 Jul 2019 • Li He, Long Xia, Wei Zeng, Zhi-Ming Ma, Yihong Zhao, Dawei Yin
To make full use of such historical data, learning policies from multiple loggers becomes necessary.
no code implementations • 27 May 2019 • Ge Fan, Wei Zeng, Shan Sun, Biao Geng, Weiyi Wang, Weibo Liu
One advantage of deep neural network is that the performance of the algorithm can be easily enhanced by augmenting the depth of the neural network.
no code implementations • 4 Dec 2018 • Wei Zeng, Sezer Karaoglu, Theo Gevers
In this paper, we propose a pipeline to generate 3D point cloud of an object from a single-view RGB image.
no code implementations • 9 Nov 2018 • Wei Zeng, Azadeh Davoodi, Yu Hen Hu
Design rule check is a critical step in the physical design of integrated circuits to ensure manufacturability.
no code implementations • 31 Aug 2018 • Nikita Gordienko, Peng Gang, Yuri Gordienko, Wei Zeng, Oleg Alienin, Oleksandr Rokovyi, Sergii Stirenko
A new image dataset of these carved Glagolitic and Cyrillic letters (CGCL) was assembled and pre-processed for recognition and prediction by machine learning methods.
no code implementations • 14 Aug 2018 • Sergii Stirenko, Gang Peng, Wei Zeng, Yuri Gordienko, Oleg Alienin, Oleksandr Rokovyi, Nikita Gordienko
Several statistical and machine learning methods are proposed to estimate the type and intensity of physical load and accumulated fatigue .
no code implementations • 22 Apr 2018 • Guoxin Cui, Jun Xu, Wei Zeng, Yanyan Lan, Jiafeng Guo, Xue-Qi Cheng
One of the most significant bottleneck in training large scale machine learning models on parameter server (PS) is the communication overhead, because it needs to frequently exchange the model gradients between the workers and servers during the training iterations.
1 code implementation • 3 Mar 2018 • Sergii Stirenko, Yuriy Kochura, Oleg Alienin, Oleksandr Rokovyi, Peng Gang, Wei Zeng, Yuri Gordienko
Lossless data augmentation of the segmented dataset leads to the lowest validation loss (without overfitting) and nearly the same accuracy (within the limits of standard deviation) in comparison to the original and other pre-processed datasets after lossy data augmentation.
no code implementations • 19 Jan 2018 • Yu. Gordienko, Yu. Kochura, O. Alienin, O. Rokovyi, S. Stirenko, Peng Gang, Jiang Hui, Wei Zeng
Efficiency of some dimensionality reduction techniques, like lung segmentation, bone shadow exclusion, and t-distributed stochastic neighbor embedding (t-SNE) for exclusion of outliers, is estimated for analysis of chest X-ray (CXR) 2D images by deep learning approach to help radiologists identify marks of lung cancer in CXR.
no code implementations • 20 Dec 2017 • Yu. Gordienko, Peng Gang, Jiang Hui, Wei Zeng, Yu. Kochura, O. Alienin, O. Rokovyi, S. Stirenko
The recent progress of computing, machine learning, and especially deep learning, for image recognition brings a meaningful effect for automatic detection of various diseases from chest X-ray images (CXRs).
no code implementations • 30 Nov 2017 • Wei Zeng, Theo Gevers
Classification and segmentation of 3D point clouds are important tasks in computer vision.
no code implementations • ICCV 2017 • Ke Yan, Yonghong Tian, Yao-Wei Wang, Wei Zeng, Tiejun Huang
In this paper, we model the relationship of vehicle images as multiple grains.
no code implementations • CVPR 2014 • Wei Zeng, Lok Ming Lui, Xianfeng GU
The physically plausible constraints, in terms of feature landmarks and deformation types, define subspaces in the Beltrami coefficient space.
no code implementations • CVPR 2013 • Zhengyu Su, Wei Zeng, Rui Shi, Yalin Wang, Jian Sun, Xianfeng GU
Experimental results on caudate nucleus surface mapping and cortical surface mapping demonstrate the efficacy and efficiency of the proposed method.
no code implementations • CVPR 2013 • Rui Shi, Wei Zeng, Zhengyu Su, Hanna Damasio, Zhonglin Lu, Yalin Wang, Shing-Tung Yau, Xianfeng GU
This work conquer this problem by changing the Riemannian metric on the target surface to a hyperbolic metric, so that the harmonic mapping is guaranteed to be a diffeomorphism under landmark constraints.