no code implementations • 14 Oct 2024 • Jiazhi Guan, Quanwei Yang, Kaisiyuan Wang, Hang Zhou, Shengyi He, Zhiliang Xu, Haocheng Feng, Errui Ding, Jingdong Wang, Hongtao Xie, Youjian Zhao, Ziwei Liu
We propose a Motion-Enhanced Textural Alignment module to enhance the bond between driving and target signals.
no code implementations • 6 Aug 2024 • Jiazhi Guan, Zhiliang Xu, Hang Zhou, Kaisiyuan Wang, Shengyi He, Zhanwang Zhang, Borong Liang, Haocheng Feng, Errui Ding, Jingtuo Liu, Jingdong Wang, Youjian Zhao, Ziwei Liu
Lip-syncing videos with given audio is the foundation for various applications including the creation of virtual presenters or performers.
1 code implementation • 19 May 2024 • Tongze Wang, Xiaohui Xie, Wenduo Wang, Chuyi Wang, Youjian Zhao, Yong Cui
In addition, we design a traffic representation scheme to extract valid information from massive traffic data while removing biased information.
no code implementations • 1 Apr 2024 • Bo Zou, Shaofeng Wang, Hao liu, Gaoyue Sun, Yajie Wang, FeiFei Zuo, Chengbin Quan, Youjian Zhao
Teeth localization, segmentation, and labeling in 2D images have great potential in modern dentistry to enhance dental diagnostics, treatment planning, and population-based studies on oral health.
no code implementations • CVPR 2024 • Bo Zou, Chao Yang, Yu Qiao, Chengbin Quan, Youjian Zhao
LLaMA-Excitor ensures a self-adaptive allocation of additional attention to input instructions, thus effectively preserving LLMs' pre-trained knowledge when fine-tuning LLMs on low-quality instruction-following datasets.
no code implementations • 1 Apr 2024 • Bo Zou, Chao Yang, Yu Qiao, Chengbin Quan, Youjian Zhao
In this paper, we are inspired by the human recognition and learning pattern and propose VideoDistill, a framework with language-aware (i. e., goal-driven) behavior in both vision perception and answer generation process.
no code implementations • CVPR 2024 • Bo Zou, Chao Yang, Yu Qiao, Chengbin Quan, Youjian Zhao
In this paper we are inspired by the human recognition and learning pattern and propose VideoDistill a framework with language-aware (i. e. goal-driven) behavior in both vision perception and answer generation process.
no code implementations • CVPR 2024 • Bo Zou, Shaofeng Wang, Hao liu, Gaoyue Sun, Yajie Wang, FeiFei Zuo, Chengbin Quan, Youjian Zhao
Teeth localization segmentation and labeling in 2D images have great potential in modern dentistry to enhance dental diagnostics treatment planning and population-based studies on oral health.
no code implementations • 22 May 2023 • Jiazhi Guan, Tianshu Hu, Hang Zhou, Zhizhi Guo, Lirui Deng, Chengbin Quan, Errui Ding, Youjian Zhao
Unlike authentic images, where the hidden messages can be extracted with precision, manipulating the facial attributes through deepfake techniques can disrupt the decoding process.
no code implementations • 21 Jul 2022 • Jiazhi Guan, Hang Zhou, Mingming Gong, Errui Ding, Jingdong Wang, Youjian Zhao
Specifically, by carefully examining the spatial and temporal properties, we propose to disrupt a real video through a Pseudo-fake Generator and create a wide range of pseudo-fake videos for training.
no code implementations • 6 Jul 2022 • Jiazhi Guan, Hang Zhou, Zhibin Hong, Errui Ding, Jingdong Wang, Chengbin Quan, Youjian Zhao
Recent advances in face forgery techniques produce nearly visually untraceable deepfake videos, which could be leveraged with malicious intentions.
1 code implementation • 6 Jun 2022 • Yunsheng Ni, Depu Meng, Changqian Yu, Chengbin Quan, Dongchun Ren, Youjian Zhao
Specifically, we first capture the different representations with different augmentations, then regularize the cosine distance of the representations to enhance the consistency.
no code implementations • 25 Sep 2019 • Haowen Xu, Wenxiao Chen, Jinlin Lai, Zhihan Li, Youjian Zhao, Dan Pei
Using powerful posterior distributions is a popular technique in variational inference.
no code implementations • 31 May 2019 • Haowen Xu, Wenxiao Chen, Jinlin Lai, Zhihan Li, Youjian Zhao, Dan Pei
Using powerful posterior distributions is a popular approach to achieving better variational inference.
9 code implementations • 12 Feb 2018 • Haowen Xu, Wenxiao Chen, Nengwen Zhao, Zeyan Li, Jiahao Bu, Zhihan Li, Ying Liu, Youjian Zhao, Dan Pei, Yang Feng, Jie Chen, Zhaogang Wang, Honglin Qiao
To ensure undisrupted business, large Internet companies need to closely monitor various KPIs (e. g., Page Views, number of online users, and number of orders) of its Web applications, to accurately detect anomalies and trigger timely troubleshooting/mitigation.