no code implementations • 11 Dec 2024 • Jiarui Zhang, Ollie Liu, Tianyu Yu, Jinyi Hu, Willie Neiswanger
For instance, Euclid outperforms the best closed-source model, Gemini-1. 5-Pro, by up to 58. 56% on certain Geoperception benchmark tasks and 10. 65% on average across all tasks.
no code implementations • 19 Sep 2024 • Jiarui Zhang
In modern commercial systems, including Recommendation, Ranking, and E-Commerce platforms, there is a trend towards improving customer experiences by incorporating Personalization context as input into Large Language Models (LLMs).
no code implementations • 30 Aug 2024 • Zhen Fan, Peng Dai, Zhuo Su, Xu Gao, Zheng Lv, Jiarui Zhang, Tianyuan Du, Guidong Wang, Yang Zhang
Specifically, EMHI provides synchronized stereo images from downward-sloping cameras on the headset and IMU data from body-worn sensors, along with pose annotations in SMPL format.
no code implementations • 11 Jul 2024 • Jiarui Zhang, Aijing Kong, Yu Tang, Zhichao Lv, Lulu Guo, Peng Hang
With the development of autonomous driving technology, there are increasing demands for vehicle control, and MPC has become a widely researched topic in both industry and academia.
no code implementations • 16 May 2024 • Jiarui Zhang, Shaojuan Wu, Xiaowang Zhang, Zhiyong Feng
Then, based on masked language model prediction, we present a target-aware relative stance sample generation method for obtaining relative bias.
1 code implementation • 21 Apr 2024 • Yifan Jiang, Jiarui Zhang, Kexuan Sun, Zhivar Sourati, Kian Ahrabian, Kaixin Ma, Filip Ilievski, Jay Pujara
Further analysis of perception questions reveals that MLLMs struggle to comprehend the visual features (near-random performance) and even count the panels in the puzzle ( <45%), hindering their ability for abstract reasoning.
1 code implementation • 12 Feb 2024 • Jiarui Zhang, Jinyi Hu, Mahyar Khayatkhoei, Filip Ilievski, Maosong Sun
Multimodal Large Language Models (MLLMs) have recently shown remarkable perceptual capability in answering visual questions, however, little is known about the limits of their perception.
no code implementations • 22 Jan 2024 • Kian Ahrabian, Zhivar Sourati, Kexuan Sun, Jiarui Zhang, Yifan Jiang, Fred Morstatter, Jay Pujara
While large language models (LLMs) are still being adopted to new domains and utilized in novel applications, we are experiencing an influx of the new generation of foundation models, namely multi-modal large language models (MLLMs).
1 code implementation • 26 Dec 2023 • Jiarui Zhang, Ruixu Geng, Xiaolong Du, Yan Chen, Houqiang Li, Yang Hu
In this work, we propose NLOS-LTM, a novel passive NLOS imaging method that effectively handles multiple light transport conditions with a single network.
no code implementations • 13 Dec 2023 • Jinta Weng, Jiarui Zhang, Yue Hu, Daidong Fa, Xiaofeng Xuand, Heyan Huang
In interaction with large language models, embedding more task-related information into prompts will make it easier to stimulate knowledge embedded in large language models.
2 code implementations • 24 Oct 2023 • Jiarui Zhang, Mahyar Khayatkhoei, Prateek Chhikara, Filip Ilievski
In particular, we show that their zero-shot accuracy in answering visual questions is very sensitive to the size of the visual subject of the question, declining up to 46% with size.
no code implementations • 18 Sep 2023 • Tianyu Liu, Steven Ding, Jiarui Zhang, Liutao Zhou
This paper proposed a novel PINN-based viscosity solution for HJB equations.
no code implementations • 10 Jun 2023 • Shuaida He, Jiarui Zhang, Xin Chen
Sliced inverse regression (SIR), which includes linear discriminant analysis (LDA) as a special case, is a popular and powerful dimension reduction tool.
1 code implementation • 5 Jun 2023 • Jiarui Zhang, Filip Ilievski, Kaixin Ma, Aravinda Kollaa, Jonathan Francis, Alessandro Oltramari
Intelligent Traffic Monitoring (ITMo) technologies hold the potential for improving road safety/security and for enabling smart city infrastructure.
no code implementations • 31 May 2023 • Jiarui Zhang, Mahyar Khayatkhoei, Prateek Chhikara, Filip Ilievski
As our initial analysis of BLIP-family models revealed difficulty with answering fine-detail questions, we investigate the following question: Can visual cropping be employed to improve the performance of state-of-the-art visual question answering models on fine-detail questions?
no code implementations • 8 May 2023 • Prateek Chhikara, Jiarui Zhang, Filip Ilievski, Jonathan Francis, Kaixin Ma
We experiment with four models on the 10 tasks in the ScienceWorld text-based game environment, to illustrate the impact of knowledge injection on various model configurations and challenging task settings.
2 code implementations • ICCV 2023 • Baixin Xu, Jiarui Zhang, Kwan-Yee Lin, Chen Qian, Ying He
To address this, we propose geometry decomposition and adopt a two-stage, coarse-to-fine training strategy, allowing for progressively capturing high-frequency geometric details.
1 code implementation • CVPR 2023 • Tong Wu, Jiarui Zhang, Xiao Fu, Yuxin Wang, Jiawei Ren, Liang Pan, Wayne Wu, Lei Yang, Jiaqi Wang, Chen Qian, Dahua Lin, Ziwei Liu
Recent advances in modeling 3D objects mostly rely on synthetic datasets due to the lack of large-scale realscanned 3D databases.
1 code implementation • 4 Dec 2022 • Jiarui Zhang, Filip Ilievski, Aravinda Kollaa, Jonathan Francis, Kaixin Ma, Alessandro Oltramari
Understanding novel situations in the traffic domain requires an intricate combination of domain-specific and causal commonsense knowledge.
no code implementations • journal 2022 • Jiarui Zhang, Yukun Cheng, Xiaotie Deng
First, we modify the verification strategy so that nodes set a probability of verifying a received transaction considering the likelihood of it being spam: transactions from a node with a low reputation have a high probability of being verified.
no code implementations • 21 May 2022 • Jiarui Zhang, Filip Ilievski, Kaixin Ma, Jonathan Francis, Alessandro Oltramari
In this paper, we study the effect of knowledge sampling strategies and sizes that can be used to generate synthetic data for adapting language models.
no code implementations • 12 Mar 2022 • Yingjie Chen, Jiarui Zhang, Tao Wang, Yun Liang
Facial action units (AUs) play an indispensable role in human emotion analysis.
1 code implementation • 3 Jun 2021 • Wenhao Li, Fanchao Qi, Maosong Sun, Xiaoyuan Yi, Jiarui Zhang
We hope this dataset can further enhance the study on incorporating deep semantics into the understanding and generation system of Chinese classical poetry.
no code implementations • 11 May 2021 • Lixin Xu, Lin Li, Kunniang Liu, Jiarui Zhang, Yuanning Chang, Yunpeng Fang, Hao Yuan, Zhiyuan Yang, Jingyuan Chen, Yiyao Wang, Yajun Fang
ransportation systems have revolutionized the form of society.
no code implementations • WS 2020 • Junxuan Chen, Xiang Li, Jiarui Zhang, Chulun Zhou, Jianwei Cui, Bin Wang, Jinsong Su
Finally, we combine the discourse structure information with the word embedding before it is fed into the encoder.
no code implementations • 10 Mar 2019 • Haofu Liao, Wei-An Lin, Jiarui Zhang, Jingdan Zhang, Jiebo Luo, S. Kevin Zhou
As the POI tracker is shift-invariant, $\text{POINT}^2$ is more robust to the initial pose of the 3D pre-intervention image.