no code implementations • Findings (EMNLP) 2021 • Jiawei Wang, Hai Zhao, Yinggong Zhao, Libin Shen
Machine reading comprehension (MRC) is a challenging NLP task for it requires to carefully deal with all linguistic granularities from word, sentence to passage.
Chinese Reading Comprehension
Machine Reading Comprehension
+1
1 code implementation • COLING 2022 • Jiawei Wang, Hai Zhao
ArT is totally unsupervised and KBs-free.
1 code implementation • 23 Jan 2025 • Jiawei Wang, Qinglai Guo, Hongbin Sun
It integrates the heterogeneous energy and transportation infrastructures in the form of a network flow model to simulate and quantify the impact of extreme weather events on the energy-transportation infrastructure system.
2 code implementations • 22 Jan 2025 • DeepSeek-AI, Daya Guo, Dejian Yang, Haowei Zhang, Junxiao Song, Ruoyu Zhang, Runxin Xu, Qihao Zhu, Shirong Ma, Peiyi Wang, Xiao Bi, Xiaokang Zhang, Xingkai Yu, Yu Wu, Z. F. Wu, Zhibin Gou, Zhihong Shao, Zhuoshu Li, Ziyi Gao, Aixin Liu, Bing Xue, Bingxuan Wang, Bochao Wu, Bei Feng, Chengda Lu, Chenggang Zhao, Chengqi Deng, Chenyu Zhang, Chong Ruan, Damai Dai, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fucong Dai, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Han Bao, Hanwei Xu, Haocheng Wang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Qu, Hui Li, JianZhong Guo, Jiashi Li, Jiawei Wang, Jingchang Chen, Jingyang Yuan, Junjie Qiu, Junlong Li, J. L. Cai, Jiaqi Ni, Jian Liang, Jin Chen, Kai Dong, Kai Hu, Kaige Gao, Kang Guan, Kexin Huang, Kuai Yu, Lean Wang, Lecong Zhang, Liang Zhao, Litong Wang, Liyue Zhang, Lei Xu, Leyi Xia, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Meng Li, Miaojun Wang, Mingming Li, Ning Tian, Panpan Huang, Peng Zhang, Qiancheng Wang, Qinyu Chen, Qiushi Du, Ruiqi Ge, Ruisong Zhang, Ruizhe Pan, Runji Wang, R. J. Chen, R. L. Jin, Ruyi Chen, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shiyu Wang, Shuiping Yu, Shunfeng Zhou, Shuting Pan, S. S. Li, Shuang Zhou, Shaoqing Wu, Shengfeng Ye, Tao Yun, Tian Pei, Tianyu Sun, T. Wang, Wangding Zeng, Wanjia Zhao, Wen Liu, Wenfeng Liang, Wenjun Gao, Wenqin Yu, Wentao Zhang, W. L. Xiao, Wei An, Xiaodong Liu, Xiaohan Wang, Xiaokang Chen, Xiaotao Nie, Xin Cheng, Xin Liu, Xin Xie, Xingchao Liu, Xinyu Yang, Xinyuan Li, Xuecheng Su, Xuheng Lin, X. Q. Li, Xiangyue Jin, Xiaojin Shen, Xiaosha Chen, Xiaowen Sun, Xiaoxiang Wang, Xinnan Song, Xinyi Zhou, Xianzu Wang, Xinxia Shan, Y. K. Li, Y. Q. Wang, Y. X. Wei, Yang Zhang, Yao Li, Yao Zhao, Yaofeng Sun, Yaohui Wang, Yi Yu, Yichao Zhang, Yifan Shi, Yiliang Xiong, Ying He, Yishi Piao, Yisong Wang, Yixuan Tan, Yiyang Ma, Yiyuan Liu, Yongqiang Guo, Yuan Ou, Yuduan Wang, Yue Gong, Yuheng Zou, Yujia He, Yunfan Xiong, Yuxiang Luo, Yuxiang You, Yuxuan Liu, Yuyang Zhou, Y. X. Zhu, Yanhong Xu, Yanping Huang, Yaohui Li, Yi Zheng, Yuchen Zhu, Yunxian Ma, Ying Tang, Yukun Zha, Yuting Yan, Z. Z. Ren, Zehui Ren, Zhangli Sha, Zhe Fu, Zhean Xu, Zhenda Xie, Zhengyan Zhang, Zhewen Hao, Zhicheng Ma, Zhigang Yan, Zhiyu Wu, Zihui Gu, Zijia Zhu, Zijun Liu, Zilin Li, Ziwei Xie, Ziyang Song, Zizheng Pan, Zhen Huang, Zhipeng Xu, Zhongyu Zhang, Zhen Zhang
We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1.
Ranked #1 on
Mathematical Reasoning
on AIME24
1 code implementation • 14 Jan 2025 • Liping Yuan, Jiawei Wang, Haomiao Sun, Yuchen Zhang, Yuan Lin
We introduce Tarsier2, a state-of-the-art large vision-language model (LVLM) designed for generating detailed and accurate video descriptions, while also exhibiting superior general video understanding capabilities.
Ranked #2 on
Video Question Answering
on TVBench
no code implementations • 10 Jan 2025 • Zheqi Lv, Wenkai Wang, Jiawei Wang, Shengyu Zhang, Fei Wu
SEAT uses more powerful EMLLMs for CoT reasoning, data selection, and evaluation generation, then trains EMLLMs with the synthesized data.
1 code implementation • 27 Dec 2024 • DeepSeek-AI, Aixin Liu, Bei Feng, Bing Xue, Bingxuan Wang, Bochao Wu, Chengda Lu, Chenggang Zhao, Chengqi Deng, Chenyu Zhang, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fucong Dai, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Han Bao, Hanwei Xu, Haocheng Wang, Haowei Zhang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Li, Hui Qu, J. L. Cai, Jian Liang, JianZhong Guo, Jiaqi Ni, Jiashi Li, Jiawei Wang, Jin Chen, Jingchang Chen, Jingyang Yuan, Junjie Qiu, Junlong Li, Junxiao Song, Kai Dong, Kai Hu, Kaige Gao, Kang Guan, Kexin Huang, Kuai Yu, Lean Wang, Lecong Zhang, Lei Xu, Leyi Xia, Liang Zhao, Litong Wang, Liyue Zhang, Meng Li, Miaojun Wang, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Mingming Li, Ning Tian, Panpan Huang, Peiyi Wang, Peng Zhang, Qiancheng Wang, Qihao Zhu, Qinyu Chen, Qiushi Du, R. J. Chen, R. L. Jin, Ruiqi Ge, Ruisong Zhang, Ruizhe Pan, Runji Wang, Runxin Xu, Ruoyu Zhang, Ruyi Chen, S. S. Li, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shaoqing Wu, Shengfeng Ye, Shirong Ma, Shiyu Wang, Shuang Zhou, Shuiping Yu, Shunfeng Zhou, Shuting Pan, T. Wang, Tao Yun, Tian Pei, Tianyu Sun, W. L. Xiao, Wangding Zeng, Wanjia Zhao, Wei An, Wen Liu, Wenfeng Liang, Wenjun Gao, Wenqin Yu, Wentao Zhang, X. Q. Li, Xiangyue Jin, Xianzu Wang, Xiao Bi, Xiaodong Liu, Xiaohan Wang, Xiaojin Shen, Xiaokang Chen, Xiaokang Zhang, Xiaosha Chen, Xiaotao Nie, Xiaowen Sun, Xiaoxiang Wang, Xin Cheng, Xin Liu, Xin Xie, Xingchao Liu, Xingkai Yu, Xinnan Song, Xinxia Shan, Xinyi Zhou, Xinyu Yang, Xinyuan Li, Xuecheng Su, Xuheng Lin, Y. K. Li, Y. Q. Wang, Y. X. Wei, Y. X. Zhu, Yang Zhang, Yanhong Xu, Yanping Huang, Yao Li, Yao Zhao, Yaofeng Sun, Yaohui Li, Yaohui Wang, Yi Yu, Yi Zheng, Yichao Zhang, Yifan Shi, Yiliang Xiong, Ying He, Ying Tang, Yishi Piao, Yisong Wang, Yixuan Tan, Yiyang Ma, Yiyuan Liu, Yongqiang Guo, Yu Wu, Yuan Ou, Yuchen Zhu, Yuduan Wang, Yue Gong, Yuheng Zou, Yujia He, Yukun Zha, Yunfan Xiong, Yunxian Ma, Yuting Yan, Yuxiang Luo, Yuxiang You, Yuxuan Liu, Yuyang Zhou, Z. F. Wu, Z. Z. Ren, Zehui Ren, Zhangli Sha, Zhe Fu, Zhean Xu, Zhen Huang, Zhen Zhang, Zhenda Xie, Zhengyan Zhang, Zhewen Hao, Zhibin Gou, Zhicheng Ma, Zhigang Yan, Zhihong Shao, Zhipeng Xu, Zhiyu Wu, Zhongyu Zhang, Zhuoshu Li, Zihui Gu, Zijia Zhu, Zijun Liu, Zilin Li, Ziwei Xie, Ziyang Song, Ziyi Gao, Zizheng Pan
We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
1 code implementation • 13 Dec 2024 • Zhiyu Wu, Xiaokang Chen, Zizheng Pan, Xingchao Liu, Wen Liu, Damai Dai, Huazuo Gao, Yiyang Ma, Chengyue Wu, Bingxuan Wang, Zhenda Xie, Yu Wu, Kai Hu, Jiawei Wang, Yaofeng Sun, Yukun Li, Yishi Piao, Kang Guan, Aixin Liu, Xin Xie, Yuxiang You, Kai Dong, Xingkai Yu, Haowei Zhang, Liang Zhao, Yisong Wang, Chong Ruan
We present DeepSeek-VL2, an advanced series of large Mixture-of-Experts (MoE) Vision-Language Models that significantly improves upon its predecessor, DeepSeek-VL, through two key major upgrades.
Ranked #1 on
Referring Expression Comprehension
on RefCOCOg-test
1 code implementation • 3 Dec 2024 • Xiaojie Yang, Hangli Ge, Jiawei Wang, Zipei Fan, Renhe Jiang, Ryosuke Shibasaki, Noboru Koshizuka
In this study, we propose a causality-augmented prediction model, called CausalMob, to analyze the causal effects of public events.
no code implementations • 25 Nov 2024 • Jiawei Wang, Zhiming Cui, Changjian Li
To overcome these limitations, we propose treating each stroke within a sketch as an entity and introducing a vector-quantized (VQ) stroke representation for fine-grained sketch generation.
no code implementations • 21 Nov 2024 • Shuai Li, Chaoyi Chen, Haotian Zheng, Jiawei Wang, Qing Xu, Jianqiang Wang, Keqiang Li
This leads to a robust data-driven predictive control framework, solved in a tube-based control manner.
no code implementations • 2 Jul 2024 • Jiawei Wang, Liping Yuan, Yuchen Zhang
In this work, we introduce Tarsier, a family of large-scale video-language models designed to generate high-quality video descriptions.
1 code implementation • arXiv 2024 • Jiawei Wang, Liping Yuan, Yuchen Zhang, Haomiao Sun
Our second contribution is the introduction of a new benchmark -- DREAM-1K (https://tarsier-vlm. github. io/) for evaluating video description models, consisting of a new challenging dataset featuring videos from diverse sources and varying complexity, along with an automatic method specifically designed to assess the quality of fine-grained video descriptions.
Ranked #1 on
Zero-Shot Video Question Answer
on NExT-QA
(using extra training data)
no code implementations • 20 May 2024 • Jiawei Wang, Kai Hu, Qiang Huo
Document layout analysis (DLA) is crucial for understanding the physical layout and logical structure of documents, serving information retrieval, document summarization, knowledge extraction, etc.
no code implementations • 16 May 2024 • Jing Yang, Xiao Wang, Yutong Wang, Jiawei Wang, Fei-Yue Wang
To achieve more accurate TKG reasoning, we propose an attention masking-based contrastive event network (AMCEN) with local-global temporal patterns for the two-stage prediction of future events.
no code implementations • 13 Mar 2024 • ZiQi Liang, Haoxiang Shi, Jiawei Wang, Keda Lu
Recurrent neural networks have become a standard modeling technique for sequential data in TTS systems and are widely used.
2 code implementations • 22 Feb 2024 • Jiawei Wang, Renhe Jiang, Chuang Yang, Zengqing Wu, Makoto Onizuka, Ryosuke Shibasaki, Noboru Koshizuka, Chuan Xiao
This paper introduces a novel approach using Large Language Models (LLMs) integrated into an agent framework for flexible and effective personal mobility generation.
no code implementations • 6 Feb 2024 • Shuai Li, Chaoyi Chen, Haotian Zheng, Jiawei Wang, Qing Xu, Keqiang Li
Data-driven predictive control promises model-free wave-dampening strategies for Connected and Autonomous Vehicles (CAVs) in mixed traffic flow.
1 code implementation • 2 Feb 2024 • Jiawei Wang, Yuchen Zhang, Jiaxin Zou, Yan Zeng, Guoqiang Wei, Liping Yuan, Hang Li
Its robust motion controllability is validated by drastic increases in the bounding box alignment metric.
1 code implementation • 22 Jan 2024 • Jiawei Wang, Kai Hu, Zhuoyao Zhong, Lei Sun, Qiang Huo
Our end-to-end system achieves state-of-the-art performance on two large-scale document layout analysis datasets (PubLayNet and DocLayNet), a high-quality hierarchical document structure reconstruction dataset (HRDoc), and our Comp-HRDoc benchmark.
no code implementations • 17 Jan 2024 • Kai Hu, Jiawei Wang, WeiHong Lin, Zhuoyao Zhong, Lei Sun, Qiang Huo
This unified approach allows for the definition of various relation types and effectively tackles hierarchical relationships in form-like documents.
no code implementations • 17 Jan 2024 • Jiawei Wang, Shunchi Zhang, Kai Hu, Chixiang Ma, Zhuoyao Zhong, Lei Sun, Qiang Huo
Contextual Text Block Detection (CTBD) is the task of identifying coherent text blocks within the complexity of natural scenes.
1 code implementation • 25 Dec 2023 • Wenzhang Liu, Wenzhe Cai, Kun Jiang, Guangran Cheng, Yuanda Wang, Jiawei Wang, Jingyu Cao, Lele Xu, Chaoxu Mu, Changyin Sun
In this paper, we present XuanCe, a comprehensive and unified deep reinforcement learning (DRL) library designed to be compatible with PyTorch, TensorFlow, and MindSpore.
no code implementations • CVPR 2024 • Jiawei Wang, Changjian Li
Sketch semantic segmentation is a well-explored and pivotal problem in computer vision involving the assignment of pre-defined part labels to individual strokes.
no code implementations • 18 Sep 2023 • Jie Li, Jiawei Wang, Shengbo Eben Li, Keqiang Li
Connected and automated vehicles (CAVs) technologies promise to attenuate undesired traffic disturbances.
no code implementations • 4 Sep 2023 • Shuai Li, Haotian Zheng, Jiawei Wang, Chaoyi Chen, Qing Xu, Jianqiang Wang, Keqiang Li
In mixed traffic where human-driven vehicles (HDVs) also exist, existing research mostly focuses on "looking ahead" (i. e., the CAVs receive information from preceding vehicles) strategies for CAVs, while recent work reveals that "looking behind" (i. e., the CAVs receive information from their rear vehicles) strategies might provide more possibilities for CAV longitudinal control.
1 code implementation • 8 Jun 2023 • Shizhe Diao, Tianyang Xu, Ruijia Xu, Jiawei Wang, Tong Zhang
Pre-trained language models (PLMs) demonstrate excellent abilities to understand texts in the generic domain while struggling in a specific domain.
no code implementations • 23 Apr 2023 • Yiming Gao, Feiyu Liu, Liang Wang, Zhenjie Lian, Weixuan Wang, Siqin Li, Xianliang Wang, Xianhan Zeng, Rundong Wang, Jiawei Wang, Qiang Fu, Wei Yang, Lanxiao Huang, Wei Liu
MOBA games, e. g., Dota2 and Honor of Kings, have been actively used as the testbed for the recent AI research on games, and various AI systems have been developed at the human level so far.
no code implementations • 21 Mar 2023 • Jiawei Wang, WeiHong Lin, Chixiang Ma, Mingze Li, Zheng Sun, Lei Sun, Qiang Huo
Unlike previous methods, we formulate table separation line prediction as a line regression problem instead of an image segmentation problem and propose a new two-stage dynamic queries enhanced DETR based separation line regression approach, named DQ-DETR, to predict separation lines from table images directly.
2 code implementations • 22 Nov 2022 • Yan Zeng, Xinsong Zhang, Hang Li, Jiawei Wang, Jipeng Zhang, Wangchunshu Zhou
Vision language pre-training aims to learn alignments between vision and language from a large amount of data.
Ranked #1 on
Cross-Modal Retrieval
on Flickr30k
(using extra training data)
1 code implementation • 24 Oct 2022 • Jiawei Wang, Yingzhao Lian, Yuning Jiang, Qing Xu, Keqiang Li, Colin N. Jones
This algorithm achieves both computation and communication efficiency, as well as trajectory data privacy, through parallel calculation.
no code implementations • 9 Aug 2022 • WeiHong Lin, Zheng Sun, Chixiang Ma, Mingze Li, Jiawei Wang, Lei Sun, Qiang Huo
We present a new table structure recognition (TSR) approach, called TSRFormer, to robustly recognizing the structures of complex tables with geometrical distortions from various table images.
Ranked #2 on
Table Recognition
on PubTabNet
(TEDS-Struct metric)
1 code implementation • 15 Jun 2022 • Shizhe Diao, Wangchunshu Zhou, Xinsong Zhang, Jiawei Wang
In this work, we disclose the potential of symmetric generative vision-language pre-training in learning to write and paint concurrently, and propose a new unified modal model, named DaVinci, trained with prefix language modeling and prefix image modeling, a simple generative self-supervised objective on image-text pairs.
no code implementations • 8 May 2022 • Jiawei Wang, Ruiyang Zhou, Fengying Wei
The role of psychological effects makes the contact between adult and environmental toxins a functional form, while the contact between larvae and environmental toxins is linear.
no code implementations • Findings (NAACL) 2022 • Xin Wang, Yasheng Wang, Yao Wan, Jiawei Wang, Pingyi Zhou, Li Li, Hao Wu, Jin Liu
Specifically, we first extract multiple code views using compiler tools, and learn the complementary information among them under a contrastive learning framework.
1 code implementation • 7 Apr 2022 • Jiawei Wang, Yang Zheng, Jianghong Dong, Chaoyi Chen, Mengchi Cai, Keqiang Li, Qing Xu
In this paper, we present the first experimental results of data-driven predictive control for connected and autonomous vehicles (CAVs) in dissipating traffic waves.
1 code implementation • 20 Mar 2022 • Jiawei Wang, Yang Zheng, Keqiang Li, Qing Xu
For the control of connected and autonomous vehicles (CAVs), most existing methods focus on model-based strategies.
1 code implementation • 26 Dec 2021 • Jiawei Wang, Hai Zhao
In detail, our model first focuses on key parts in the given context, and then generates highly related knowledge on such a basis in an association way like human thinking.
no code implementations • 1 Dec 2021 • Mengchi Cai, Qing Xu, Chunying Yang, Jianghong Dong, Chaoyi Chen, Jiawei Wang, Jianqiang Wang, Keqiang Li
Formation control methods of connected and automated vehicles have been proposed to smoothly switch the structure of vehicular formations in different scenarios.
no code implementations • 2 Nov 2021 • Jiawei Wang, Lijun Sun
However, the operation of a bus fleet is unstable in nature, and bus bunching has become a common phenomenon that undermines the efficiency and reliability of bus systems.
no code implementations • 19 Oct 2021 • Jiawei Wang, Yang Zheng, Qing Xu, Keqiang Li
In this paper, instead of relying on a parametric car-following model, we introduce a data-driven predictive control strategy to achieve safe and optimal control for CAVs in mixed traffic.
no code implementations • 29 Sep 2021 • Jiawei Wang, Konghuai Shen, Shao Ming, Jun Yin, Ming Liu
In recent years, a great progress has been witnessed for cross-domain object detection.
1 code implementation • Findings (EMNLP) 2021 • Shifeng Huang, Jiawei Wang, Jiao Xu, Da Cao, Ming Yang
Specifically, given a math word problem, the model first retrieves similar questions by a memory module and then encodes the unsolved problem and each retrieved question using a representation module.
Ranked #8 on
Math Word Problem Solving
on Math23K
3 code implementations • 25 Aug 2021 • Mengchi Cai, Qing Xu, Chaoyi Chen, Jiawei Wang, Keqiang Li, Jianqiang Wang, Xiangbin Wu
Unsignalized intersection cooperation of connected and automated vehicles (CAVs) is able to eliminate green time loss of signalized intersections and improve traffic efficiency.
no code implementations • 2 May 2021 • Jiawei Wang, Lijun Sun
However, due to the significant uncertainties in passenger demand and traffic conditions, bus operation is unstable in nature and bus bunching has become a common phenomenon that undermines the reliability and efficiency of bus services.
4 code implementations • 18 Mar 2021 • Mengchi Cai, Qing Xu, Chaoyi Chen, Jiawei Wang, Keqiang Li, Jianqiang Wang, Qianying Zhu
Multi-vehicle coordinated decision making and control can improve traffic efficiency while guaranteeing driving safety.
1 code implementation • 4 Mar 2021 • Jiawei Wang, Li Li, Andreas Zeller
More than ninety percent of published Jupyter notebooks do not state dependencies on external packages.
Software Engineering
1 code implementation • 8 Dec 2020 • Jiawei Wang, Yang Zheng, Chaoyi Chen, Qing Xu, Keqiang Li
Most existing strategies for CAVs' longitudinal control focus on downstream traffic conditions, but neglect the impact of CAVs' behaviors on upstream traffic flow.
no code implementations • 21 Sep 2020 • Brandon Paulsen, Jingbo Wang, Jiawei Wang, Chao Wang
Unfortunately, existing methods either focus on verifying a single network or rely on loose approximations to prove the equivalence of two networks.
no code implementations • 8 Sep 2020 • Keqiang Li, Jiawei Wang, Yang Zheng
In mixed traffic flow consisting of AVs and human-driven vehicles (HDVs), the prevailing platooning of multiple AVs is not the only choice for cooperative formation.
1 code implementation • 23 Jul 2020 • Jiawei Wang, Yang Zheng, Chaoyi Chen, Qing Xu, Keqiang Li
Numerical studies confirm the potential of LCC to strengthen the capability of CAVs in suppressing traffic instabilities and smoothing traffic flow.
Systems and Control Systems and Control Optimization and Control
no code implementations • 6 Jul 2020 • Tianyu Shi, Jiawei Wang, Yuankai Wu, Luis Miranda-Moreno, Lijun Sun
Instead of learning a reliable behavior for ego automated vehicle, we focus on how to improve the outcomes of the total transportation system by allowing each automated vehicle to learn cooperation with each other and regulate human-driven traffic flow.
no code implementations • 10 Oct 2018 • Jiawei Wang, Zhaoshui He, Chengjian Feng, Zhouping Zhu, Qinzhuang Lin, Jun Lv, Shengli Xie
Data collection and annotation are time-consuming in machine learning, expecially for large scale problem.
no code implementations • 29 Oct 2013 • Jimmy SJ. Ren, Wei Wang, Jiawei Wang, Stephen Liao
Sophisticated automatic incident detection (AID) technology plays a key role in contemporary transportation systems.
no code implementations • 3 Aug 2013 • Jimmy SJ. Ren, Wei Wang, Jiawei Wang, Stephen Shaoyi Liao
We argue that if the bias-variance trade-off is to be better balanced by a more effective feature selection method unlabeled data is very likely to boost the classification performance.