1 code implementation • 28 Oct 2024 • Weizhe Chen, Zhicheng Zhang, Guanlin Liu, Renjie Zheng, Wenlei Shi, Chen Dun, Zheng Wu, Xing Jin, Lin Yan
Since the release of ChatGPT, large language models (LLMs) have demonstrated remarkable capabilities across various domains.
no code implementations • 28 Sep 2024 • Jiwei Tang, Jin Xu, Tingwei Lu, Zhicheng Zhang, Yiming Zhao, Lin Hai, Hai-Tao Zheng
Large language models (LLMs) demonstrate exceptional capabilities in various scenarios.
1 code implementation • 10 Sep 2024 • Zehao Wang, Haobo Yue, Zhicheng Zhang, Da Mu, Jin Tang, Jianqin Yin
Sound Event Detection (SED) plays a vital role in comprehending and perceiving acoustic scenes.
no code implementations • 4 Sep 2024 • Jialong Li, Zhicheng Zhang, Yunwei Chen, Qiqi Lu, Ye Wu, Xiaoming Liu, Qianjin Feng, Yanqiu Feng, Xinyuan Zhang
The former fits DW images from diverse acquisition settings into diffusion tensor field, while the latter applies a deep learning-based denoiser to regularize the diffusion tensor field instead of the DW images, which is free from the limitation of fixed-channel assignment of the network.
1 code implementation • 30 Aug 2024 • Mingyuan Zhang, Zhicheng Zhang, Hao Wu, Yong Wang
We present flow matching for reaction coordinates (FMRC), a novel deep learning algorithm designed to identify optimal reaction coordinates (RC) in biomolecular reversible dynamics.
no code implementations • 9 Aug 2024 • Da Mu, Zhicheng Zhang, Haobo Yue, Zehao Wang, Jin Tang, Jianqin Yin
In the Sound Event Localization and Detection (SELD) task, Transformer-based models have demonstrated impressive capabilities.
1 code implementation • 7 Aug 2024 • Xiangyan Liu, Bo Lan, Zhiyuan Hu, Yang Liu, Zhicheng Zhang, Fei Wang, Michael Shieh, Wenmeng Zhou
Similarity-based retrieval often has low recall in complex tasks, while manual tools and APIs are typically task-specific and require expert knowledge, reducing their generalizability across diverse code tasks and real-world applications.
no code implementations • 18 Jun 2024 • Xinquan Yang, Guanqun Zhou, Wei Sun, Youjian Zhang, Zhongya Wang, Jiahui He, Zhicheng Zhang
In this paper, we have discovered that the uncertainty image computed from the restoration result of initial training weights can effectively highlight high-frequency regions, including metal artifacts.
1 code implementation • 13 Jun 2024 • Da Mu, Zhicheng Zhang, Haobo Yue
This paper proposes a three-stage network structure named Multi-scale Feature Fusion (MFF) module to fully extract multi-scale features across spectral, spatial, and temporal domains.
no code implementations • 12 Jun 2024 • Ren Zhang, Jianqin Yin, Chao Qi, Zehao Wang, Zhicheng Zhang, Yonghao Dang
Conversely, depth information can effectively represent motion information related to facial structure changes and is not affected by lighting.
1 code implementation • 21 May 2024 • Zhicheng Zhang, Yong Wang, Shaoqi Tan, Bowei Xia, Yujie Luo
Recently, Transformer-based models for long sequence time series forecasting have demonstrated promising results.
no code implementations • 1 May 2024 • Zhicheng Zhang, Yancheng Liang, Yi Wu, Fei Fang
It learns to explore by first identifying the agents' high-rewarding joint state-action subspace from training tasks and then learning a set of diverse exploration policies to "cover" the subspace.
no code implementations • 20 Apr 2024 • Yuheng Ji, Yue Liu, Zhicheng Zhang, Zhao Zhang, YuTing Zhao, Gang Zhou, Xingwei Zhang, Xinwang Liu, Xiaolong Zheng
Different from LoRA, we improve the efficiency and robustness of adversarial adaptation by designing a novel reparameterizing method based on parameter clustering and parameter alignment.
no code implementations • 15 Apr 2024 • Yuan Bi, Cheng Qian, Zhicheng Zhang, Nassir Navab, Zhongliang Jiang
Ultrasound (US) has been widely used in daily clinical practice for screening internal organs and guiding interventions.
1 code implementation • CVPR 2024 • Pancheng Zhao, Peng Xu, Pengda Qin, Deng-Ping Fan, Zhicheng Zhang, Guoli Jia, BoWen Zhou, Jufeng Yang
Camouflaged vision perception is an important vision task with numerous practical applications.
1 code implementation • 10 Jan 2024 • Haobo Yue, Zhicheng Zhang, Da Mu, Yonghao Dang, Jianqin Yin, Jin Tang
Recently, 2D convolution has been found unqualified in sound event detection (SED).
no code implementations • CVPR 2024 • Zhicheng Zhang, Junyao Hu, Wentao Cheng, Danda Paudel, Jufeng Yang
Video prediction is a challenging task due to its nature of uncertainty especially for forecasting a long period.
no code implementations • CVPR 2024 • Zhicheng Zhang, Pancheng Zhao, Eunil Park, Jufeng Yang
Inspired by psychology research and empirical theory we verify that the degree of emotion may vary in different segments of the video thus introducing the sentiment complementary and emotion intrinsic among temporal segments.
Multimodal Emotion Recognition
Multimodal Sentiment Analysis
+2
no code implementations • 17 Nov 2023 • Zhicheng Zhang, Xueyao Sun, Yonghao Dang, Jianqin Yin
On the challenging of COCO dataset, the proposed method enables the binary neural network to achieve 70. 8 mAP, which is better than most tested lightweight full-precision networks.
1 code implementation • 1 Oct 2023 • Xiangyu Zeng, Jie Lin, Piao Hu, Ruizheng Huang, Zhicheng Zhang
How humans and machines make sense of current inputs for relation reasoning and question-answering while putting the perceived information into context of our past memories, has been a challenging conundrum in cognitive science and artificial intelligence.
3 code implementations • 2 Sep 2023 • Chenliang Li, Hehong Chen, Ming Yan, Weizhou Shen, Haiyang Xu, Zhikai Wu, Zhicheng Zhang, Wenmeng Zhou, Yingda Chen, Chen Cheng, Hongzhu Shi, Ji Zhang, Fei Huang, Jingren Zhou
Large language models (LLMs) have recently demonstrated remarkable capabilities to comprehend human intentions, engage in reasoning, and design planning-like behavior.
1 code implementation • 21 Jun 2023 • Chengxu Duan, Zhicheng Zhang, Xiaoli Liu, Yonghao Dang, Jianqin Yin
Specifically, we introduce a novel adaptable scheme that facilitates the attack to suit the scale of the target pose and two physical constraints to enhance the naturalness of the adversarial example.
no code implementations • 27 Mar 2023 • Zhicheng Zhang, Yasumasa Fujisaki
In this paper, we explore the discrete time sparse feedback control for a linear invariant system, where the proposed optimal feedback controller enjoys input sparsity by using a dynamic linear compensator, i. e., the components of feedback control signal having the smallest possible nonzero values.
1 code implementation • 30 Jan 2023 • Hong-Yu Zhou, Yunxiang Fu, Zhicheng Zhang, Cheng Bian, Yizhou Yu
Protein representation learning has primarily benefited from the remarkable development of language models (LMs).
no code implementations • ICCV 2023 • Zhicheng Zhang, Shengzhe Liu, Jufeng Yang
Specifically, we present a dual-branch network to track the visible part of planar objects, including vertexes and mask.
1 code implementation • CVPR 2023 • Zhicheng Zhang, Lijuan Wang, Jufeng Yang
Automatically predicting the emotions of user-generated videos (UGVs) receives increasing interest recently.
Ranked #3 on
Video Emotion Recognition
on Ekman6
no code implementations • 31 Dec 2022 • Xiaofa Liu, Jianqin Yin, Yuan Sun, Zhicheng Zhang, Jin Tang
Unlike most existing methods with offline feature generation, our method directly takes frames as input and further models motion evolution on two different temporal scales. Therefore, we solve the complexity problems of the two stages of modeling and the problem of insufficient temporal and spatial information of a single scale.
no code implementations • 27 Oct 2022 • Zhicheng Zhang, Zhiqiang Zuo, Xiang Chen, Ying Tan, Yijing Wang
The output regulation scheme is utilized in the framework to track the reference in the presence of modeled disturbance, and the effect of unmodeled disturbance is reduced by an $\mathcal{H}_\infty$ compensator.
1 code implementation • 1 Sep 2022 • Zhixiong Yang, Junwen Pan, Yanzhan Yang, Xiaozhou Shi, Hong-Yu Zhou, Zhicheng Zhang, Cheng Bian
The overall framework, namely as Prototype-aware Contrastive learning (ProCo), is unified as a single-stage pipeline in an end-to-end manner to alleviate the imbalanced problem in medical image classification, which is also a distinct progress than existing works as they follow the traditional two-stage pipeline.
no code implementations • 16 Aug 2022 • Chulong Zhang, Yuming Jiang, Na Li, Zhicheng Zhang, Md Tauhidul Islam, Jingjing Dai, Lin Liu, Wenfeng He, Wenjian Qin, Jing Xiong, Yaoqin Xie, Xiaokun Liang
Deformable image registration is a necessary technique for fusing multi-modal pathology slices.
no code implementations • 21 Jun 2022 • Junwen Pan, Guanlin Chen, Yi Liu, Jiexiang Wang, Cheng Bian, Pengfei Zhu, Zhicheng Zhang
Answer grounding aims to reveal the visual evidence for visual question answering (VQA), which entails highlighting relevant positions in the image when answering questions about images.
no code implementations • 25 May 2022 • Stephanie Milani, Zhicheng Zhang, Nicholay Topin, Zheyuan Ryan Shi, Charles Kamhoua, Evangelos E. Papalexakis, Fei Fang
The first algorithm, IVIPER, extends VIPER, a recent method for single-agent interpretable RL, to the multi-agent setting.
Multi-agent Reinforcement Learning
reinforcement-learning
+2
no code implementations • 20 Mar 2022 • Xihuai Wang, Zhicheng Zhang, Weinan Zhang
Significant advances have recently been achieved in Multi-Agent Reinforcement Learning (MARL) which tackles sequential decision-making problems involving multiple participants.
no code implementations • 14 Mar 2022 • Yan Yan, Xuankun Wu, Chengdong Li, Yini He, Zhicheng Zhang, Huihui Li, Ang Li, Lei Wang
The proposed work is the first investigation in the emotion recognition oriented EEG topological feature analysis, which brought a novel insight into the brain neural system nonlinear dynamics analysis and feature extraction.
no code implementations • 17 Nov 2021 • Ming Yan, Haiyang Xu, Chenliang Li, Junfeng Tian, Bin Bi, Wei Wang, Weihua Chen, Xianzhe Xu, Fan Wang, Zheng Cao, Zhicheng Zhang, Qiyu Zhang, Ji Zhang, Songfang Huang, Fei Huang, Luo Si, Rong Jin
The Visual Question Answering (VQA) task utilizes both visual image and language analysis to answer a textual question with respect to an image.
Ranked #8 on
Visual Question Answering (VQA)
on VQA v2 test-dev
no code implementations • 28 Sep 2021 • Lequan Yu, Zhicheng Zhang, Xiaomeng Li, Hongyi Ren, Wei Zhao, Lei Xing
We then design a novel FBP reconstruction loss to encourage the network to generate more perfect completion results and a residual-learning-based image refinement module to reduce the secondary artifacts in the reconstructed CT images.
1 code implementation • 13 May 2021 • Menghui Zhu, Minghuan Liu, Jian Shen, Zhicheng Zhang, Sheng Chen, Weinan Zhang, Deheng Ye, Yong Yu, Qiang Fu, Wei Yang
In Goal-oriented Reinforcement learning, relabeling the raw goals in past experience to provide agents with hindsight ability is a major solution to the reward sparsity problem.
1 code implementation • 28 Feb 2021 • Zhicheng Zhang, Lequan Yu, Xiaokun Liang, Wei Zhao, Lei Xing
Low dose computed tomography (LDCT) has attracted more and more attention in routine clinical diagnosis assessment, therapy planning, etc., which can reduce the dose of X-ray radiation to patients.
no code implementations • 1 Jan 2021 • Xin Ma, Zhicheng Zhang, Danfeng Wang, Yu Luo, Hui Yuan
In deep learning-based local stereo matching methods, larger image patches usually bring better stereo matching accuracy.
no code implementations • 30 Dec 2020 • Shaode Yu, Haobo Chen, Hang Yu, Zhicheng Zhang, Xiaokun Liang, Wenjian Qin, Yaoqin Xie, Ping Shi
After features are sorted according to their frequency, linear support vector machine performs the classification in an incremental manner.
no code implementations • 16 Dec 2020 • Zhicheng Zhang, Shaode Yu, Wenjian Qin, Xiaokun Liang, Yaoqin Xie, Guohua Cao
We incorporated the CT domain knowledge into the SADIR and unrolled it into a DL network (SADIR Net).
no code implementations • 16 Sep 2020 • Lequan Yu, Zhicheng Zhang, Xiaomeng Li, Lei Xing
Computed tomography (CT) has been widely used for medical diagnosis, assessment, and therapy planning and guidance.
1 code implementation • 10 Mar 2020 • Shaode Yu, Zhicheng Zhang, Xiaokun Liang, Junjie Wu, Erlei Zhang, Wenjian Qin, Yaoqin Xie
Moreover, the toolbox is evaluated on a database of 163 ultrasound images.