no code implementations • 29 Mar 2024 • Zelin Zhao, Fenglei Fan, Wenlong Liao, Junchi Yan
Many contemporary studies utilize grid-based models for neural field representation, but a systematic analysis of grid-based models is still missing, hindering the improvement of those models.
no code implementations • 10 Oct 2023 • Peng Di, Jianguo Li, Hang Yu, Wei Jiang, Wenting Cai, Yang Cao, Chaoyu Chen, Dajun Chen, Hongwei Chen, Liang Chen, Gang Fan, Jie Gong, Zi Gong, Wen Hu, Tingting Guo, Zhichao Lei, Ting Li, Zheng Li, Ming Liang, Cong Liao, Bingchang Liu, Jiachen Liu, Zhiwei Liu, Shaojun Lu, Min Shen, Guangpei Wang, Huan Wang, Zhi Wang, Zhaogui Xu, Jiawei Yang, Qing Ye, Gehao Zhang, Yu Zhang, Zelin Zhao, Xunjin Zheng, Hailian Zhou, Lifu Zhu, Xianying Zhu
It is specifically designed for code-related tasks with both English and Chinese prompts and supports over 40 programming languages.
no code implementations • 29 Jul 2022 • Zelin Zhao, Jiaya Jia
On the one hand, NeRFA considers the volumetric rendering equation as a soft feature modulation procedure.
1 code implementation • 12 Jul 2022 • Zelin Zhao, Ze Wu, Yueqing Zhuang, Boxun Li, Jiaya Jia
During inference, a pixel-wise association procedure is proposed to recover object connections through frames based on the pixel-wise prediction.
no code implementations • 11 Feb 2022 • Karan Samel, Zelin Zhao, Binghong Chen, Shuang Li, Dharmashankar Subramanian, Irfan Essa, Le Song
Events across a timeline are a common data representation, seen in different temporal modalities.
2 code implementations • 12 Jan 2022 • Xudong Tang, Leonardo Zepeda-Nunez, Shengwen Yang, Zelin Zhao, Claudia Solis-Lemus
Scientists world-wide are putting together massive efforts to understand how the biodiversity that we see on Earth evolved from single-cell organisms at the origin of life and this diversification process is represented through the Tree of Life.
1 code implementation • NeurIPS 2021 • Zelin Zhao, Karan Samel, Binghong Chen, Le Song
Furthermore, we propose the Program-guided Transformer (ProTo), which integrates both semantic and structural guidance of a program by leveraging cross-attention and masked self-attention to pass messages between the specification and routines in the program.
Ranked #1 on Visual Question Answering (VQA) on GQA test-std
no code implementations • 29 Sep 2021 • Karan Samel, Zelin Zhao, Binghong Chen, Shuang Li, Dharmashankar Subramanian, Irfan Essa, Le Song
Events across a timeline are a common data representation, seen in different temporal modalities.
no code implementations • 22 Mar 2021 • Karan Samel, Zelin Zhao, Binghong Chen, Kuan Wang, Robin Luo, Le Song
In multi-modal reasoning tasks, such as visual question answering (VQA), there have been many modeling and training paradigms tested.
no code implementations • 1 Jan 2021 • Karan Samel, Zelin Zhao, Kuan Wang, Robin Luo, Binghong Chen, Le Song
We present a differentiable end-to-end program executor (DePe), which addresses Visual Question Answering (VQA) in a sample and computationally efficient manner.
2 code implementations • 23 Dec 2020 • Zelin Zhao, Chuang Gan, Jiajun Wu, Xiaoxiao Guo, Joshua B. Tenenbaum
Humans can abstract prior knowledge from very little data and use it to boost skill learning.
6 code implementations • 4 Dec 2018 • Zelin Zhao, Gao Peng, Haoyu Wang, Hao-Shu Fang, Chengkun Li, Cewu Lu
In this paper, we present an accurate yet effective solution for 6D pose estimation from an RGB image.
Ranked #17 on 6D Pose Estimation using RGB on LineMOD
4 code implementations • 2 Jul 2018 • Mingyang Jiang, Yiran Wu, Tianqi Zhao, Zelin Zhao, Cewu Lu
Recently, 3D understanding research sheds light on extracting features from point cloud directly, which requires effective shape pattern description of point clouds.