no code implementations • 29 Aug 2024 • Qi Dong, Rubing Huang, Chenhui Cui, Dave Towey, Ling Zhou, Jinyu Tian, Jianzhou Wang
Short-Term Electricity-Load Forecasting (STELF) refers to the prediction of the immediate demand (in the next few hours to several days) for the power system.
no code implementations • CVPR 2024 • Kunyu Shi, Qi Dong, Luis Goncalves, Zhuowen Tu, Stefano Soatto
Sequence-to-sequence vision-language models are showing promise, but their applicability is limited by their inference latency due to their autoregressive way of generating predictions.
1 code implementation • 2 Jun 2023 • Srikar Appalaraju, Peng Tang, Qi Dong, Nishant Sankaran, Yichu Zhou, R. Manmatha
We propose DocFormerv2, a multi-modal transformer for Visual Document Understanding (VDU).
Ranked #9 on Visual Question Answering (VQA) on DocVQA test (using extra training data)
no code implementations • 13 Jul 2021 • Chaofan Zhang, Guoshan Zhang, Qi Dong
This paper proposes an adaptive dynamic programming-based adaptive-gain sliding mode control (ADP-ASMC) scheme for a fixed-wing unmanned aerial vehicle (UAV) with matched and unmatched disturbances.
no code implementations • ICCV 2021 • Qi Dong, Zhuowen Tu, Haofu Liao, Yuting Zhang, Vijay Mahadevan, Stefano Soatto
Computer vision applications such as visual relationship detection and human object interaction can be formulated as a composite (structured) set detection problem in which both the parts (subject, object, and predicate) and the sum (triplet as a whole) are to be detected in a hierarchical fashion.
no code implementations • ICCV 2019 • Qi Dong, Shaogang Gong, Xiatian Zhu
Existing person search methods predominantly assume the availability of at least one-shot imagery sample of the queried person.
1 code implementation • 25 Apr 2019 • Jiabo Huang, Qi Dong, Shaogang Gong, Xiatian Zhu
Deep convolutional neural networks (CNNs) have demonstrated remarkable success in computer vision by supervisedly learning strong visual feature representations.
no code implementations • 20 Nov 2018 • Qi Dong, Xiatian Zhu, Shaogang Gong
The objective learning formulation is essential for the success of convolutional neural networks.
1 code implementation • 28 Apr 2018 • Qi Dong, Shaogang Gong, Xiatian Zhu
In particular, existing deep learning methods consider mostly either class balanced data or moderately imbalanced data in model training, and ignore the challenge of learning from significantly imbalanced training data.
1 code implementation • ICCV 2017 • Qi Dong, Shaogang Gong, Xiatian Zhu
Recognising detailed facial or clothing attributes in images of people is a challenging task for computer vision, especially when the training data are both in very large scale and extremely imbalanced among different attribute classes.
no code implementations • 12 Oct 2016 • Qi Dong, Shaogang Gong, Xiatian Zhu
Recognising detailed clothing characteristics (fine-grained attributes) in unconstrained images of people in-the-wild is a challenging task for computer vision, especially when there is only limited training data from the wild whilst most data available for model learning are captured in well-controlled environments using fashion models (well lit, no background clutter, frontal view, high-resolution).