Search Results for author: Jingxiang Sun

Found 15 papers, 7 papers with code

Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer

no code implementations • 27 May 2024 • Ruizhi Shao, Youxin Pang, Zerong Zheng, Jingxiang Sun, Yebin Liu

We present a novel approach for generating high-quality, spatio-temporally coherent human videos from a single image under arbitrary viewpoints.

Paper
Add Code

DeepSeek-VL: Towards Real-World Vision-Language Understanding

2 code implementations • 8 Mar 2024 • Haoyu Lu, Wen Liu, Bo Zhang, Bingxuan Wang, Kai Dong, Bo Liu, Jingxiang Sun, Tongzheng Ren, Zhuoshu Li, Hao Yang, Yaofeng Sun, Chengqi Deng, Hanwei Xu, Zhenda Xie, Chong Ruan

The DeepSeek-VL family (both 1. 3B and 7B models) showcases superior user experiences as a vision-language chatbot in real-world applications, achieving state-of-the-art or competitive performance across a wide range of visual-language benchmarks at the same model size while maintaining robust performance on language-centric benchmarks.

Ranked #31 on Visual Question Answering on MM-Vet

Chatbot Language Modelling +3

1,765

Paper
Code

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

1 code implementation • 5 Jan 2024 • DeepSeek-AI, :, Xiao Bi, Deli Chen, Guanting Chen, Shanhuang Chen, Damai Dai, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Zhe Fu, Huazuo Gao, Kaige Gao, Wenjun Gao, Ruiqi Ge, Kang Guan, Daya Guo, JianZhong Guo, Guangbo Hao, Zhewen Hao, Ying He, Wenjie Hu, Panpan Huang, Erhang Li, Guowei Li, Jiashi Li, Yao Li, Y. K. Li, Wenfeng Liang, Fangyun Lin, A. X. Liu, Bo Liu, Wen Liu, Xiaodong Liu, Xin Liu, Yiyuan Liu, Haoyu Lu, Shanghao Lu, Fuli Luo, Shirong Ma, Xiaotao Nie, Tian Pei, Yishi Piao, Junjie Qiu, Hui Qu, Tongzheng Ren, Zehui Ren, Chong Ruan, Zhangli Sha, Zhihong Shao, Junxiao Song, Xuecheng Su, Jingxiang Sun, Yaofeng Sun, Minghui Tang, Bingxuan Wang, Peiyi Wang, Shiyu Wang, Yaohui Wang, Yongji Wang, Tong Wu, Y. Wu, Xin Xie, Zhenda Xie, Ziwei Xie, Yiliang Xiong, Hanwei Xu, R. X. Xu, Yanhong Xu, Dejian Yang, Yuxiang You, Shuiping Yu, Xingkai Yu, B. Zhang, Haowei Zhang, Lecong Zhang, Liyue Zhang, Mingchuan Zhang, Minghua Zhang, Wentao Zhang, Yichao Zhang, Chenggang Zhao, Yao Zhao, Shangyan Zhou, Shunfeng Zhou, Qihao Zhu, Yuheng Zou

The rapid development of open-source large language models (LLMs) has been truly remarkable.

1,256

Paper
Code

VectorTalker: SVG Talking Face Generation with Progressive Vectorisation

no code implementations • 18 Dec 2023 • Hao Hu, Xuan Wang, Jingxiang Sun, Yanbo Fan, Yu Guo, Caigui Jiang

To address these, we propose a novel scalable vector graphic reconstruction and animation method, dubbed VectorTalker.

Image Reconstruction Talking Face Generation +1

Paper
Add Code

InvertAvatar: Incremental GAN Inversion for Generalized Head Avatars

no code implementations • 3 Dec 2023 • Xiaochen Zhao, Jingxiang Sun, Lizhen Wang, Jinli Suo, Yebin Liu

While high fidelity and efficiency are central to the creation of digital head avatars, recent methods relying on 2D or 3D generative models often experience limitations such as shape distortion, expression inaccuracy, and identity flickering.

Image-to-Image Translation

Paper
Add Code

DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

1 code implementation • 25 Oct 2023 • Jingxiang Sun, Bo Zhang, Ruizhi Shao, Lizhen Wang, Wen Liu, Zhenda Xie, Yebin Liu

The score distillation from this 3D-aware diffusion prior provides view-consistent guidance for the scene.

3D Generation

1,842

Paper
Code

HAvatar: High-fidelity Head Avatar via Facial Model Conditioned Neural Radiance Field

no code implementations • 29 Sep 2023 • Xiaochen Zhao, Lizhen Wang, Jingxiang Sun, Hongwen Zhang, Jinli Suo, Yebin Liu

The problem of modeling an animatable 3D human head avatar under light-weight setups is of significant importance but has not been well solved.

Image-to-Image Translation

Paper
Add Code

Control4D: Efficient 4D Portrait Editing with Text

no code implementations • 31 May 2023 • Ruizhi Shao, Jingxiang Sun, Cheng Peng, Zerong Zheng, Boyao Zhou, Hongwen Zhang, Yebin Liu

We introduce Control4D, an innovative framework for editing dynamic 4D portraits using text instructions.

Paper
Add Code

StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video

1 code implementation • 1 May 2023 • Lizhen Wang, Xiaochen Zhao, Jingxiang Sun, Yuxiang Zhang, Hongwen Zhang, Tao Yu, Yebin Liu

Results and experiments demonstrate the superiority of our method in terms of image quality, full portrait video generation, and real-time re-animation compared to existing facial reenactment methods.

Face Reenactment Translation +1

378

Paper
Code

High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors

no code implementations • CVPR 2023 • Yunpeng Bai, Yanbo Fan, Xuan Wang, Yong Zhang, Jingxiang Sun, Chun Yuan, Ying Shan

Compared with existing works, we obtain superior novel view synthesis results and faithfully face reenactment performance.

Face Reenactment Novel View Synthesis +1

Paper
Add Code

Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars

2 code implementations • CVPR 2023 • Jingxiang Sun, Xuan Wang, Lizhen Wang, Xiaoyu Li, Yong Zhang, Hongwen Zhang, Yebin Liu

We propose a novel 3D GAN framework for unsupervised learning of generative, high-quality and 3D-consistent facial avatars from unstructured 2D images.

Face Model

459

Paper
Code

DiffuStereo: High Quality Human Reconstruction via Diffusion-based Stereo Using Sparse Cameras

no code implementations • 16 Jul 2022 • Ruizhi Shao, Zerong Zheng, Hongwen Zhang, Jingxiang Sun, Yebin Liu

At its core is a novel diffusion-based stereo module, which introduces diffusion models, a type of powerful generative models, into the iterative stereo matching network.

3D Human Reconstruction 4k +2

Paper
Add Code

IDE-3D: Interactive Disentangled Editing for High-Resolution 3D-aware Portrait Synthesis

1 code implementation • 31 May 2022 • Jingxiang Sun, Xuan Wang, Yichun Shi, Lizhen Wang, Jue Wang, Yebin Liu

Existing 3D-aware facial generation methods face a dilemma in quality versus editability: they either generate editable results in low resolution or high-quality ones with no editing flexibility.

Ranked #1 on 3D-Aware Image Synthesis on FFHQ 512 x 512 - 4x upscaling

3D-Aware Image Synthesis

472

Paper
Code

FENeRF: Face Editing in Neural Radiance Fields

1 code implementation • CVPR 2022 • Jingxiang Sun, Xuan Wang, Yong Zhang, Xiaoyu Li, Qi Zhang, Yebin Liu, Jue Wang

2D GANs can generate high fidelity portraits but with low view consistency.

3D-Aware Image Synthesis

223

Paper
Code

BusTime: Which is the Right Prediction Model for My Bus Arrival Time?

no code implementations • 20 Mar 2020 • Dairui Liu, Jingxiang Sun, Shen Wang

With the rise of big data technologies, many smart transportation applications have been rapidly developed in recent years including bus arrival time predictions.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.