Search Results for author: Jingwen Wang

Found 30 papers, 17 papers with code

MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video

1 code implementation1 Dec 2023 Hengyi Wang, Jingwen Wang, Lourdes Agapito

Thanks to the expressiveness of neural representations, prior works can accurately capture the motion and achieve high-fidelity reconstruction of the target object.

Neural Rendering Surface Reconstruction

SeMLaPS: Real-time Semantic Mapping with Latent Prior Networks and Quasi-Planar Segmentation

no code implementations28 Jun 2023 Jingwen Wang, Juan Tarrio, Lourdes Agapito, Pablo F. Alcantarilla, Alexander Vakhitov

We present a new methodology for real-time semantic mapping from RGB-D sequences that combines a 2D neural network and a 3D network based on a SLAM system with 3D occupancy mapping.

Image Segmentation Semantic Segmentation

First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment

1 code implementation23 Jun 2023 Tom Tongjia Chen, Hongshan Yu, Zhengeng Yang, Ming Li, Zechuan Li, Jingwen Wang, Wei Miao, Wei Sun, Chen Chen

Affordance-Centric Question-driven Task Completion (AQTC) has been proposed to acquire knowledge from videos to furnish users with comprehensive and systematic instructions.

Human-Object Interaction Detection

Co-SLAM: Joint Coordinate and Sparse Parametric Encodings for Neural Real-Time SLAM

1 code implementation CVPR 2023 Hengyi Wang, Jingwen Wang, Lourdes Agapito

We present Co-SLAM, a neural RGB-D SLAM system based on a hybrid representation, that performs robust camera tracking and high-fidelity surface reconstruction in real time.

Surface Reconstruction

MalIoT: Scalable and Real-time Malware Traffic Detection for IoT Networks

no code implementations2 Apr 2023 Ethan Weitkamp, Yusuke Satani, Adam Omundsen, Jingwen Wang, Peilong Li

The machine learning approach is vital in Internet of Things (IoT) malware traffic detection due to its ability to keep pace with the ever-evolving nature of malware.

FCC: Feature Clusters Compression for Long-Tailed Visual Recognition

1 code implementation CVPR 2023 Jian Li, Ziyao Meng, Daqian Shi, Rui Song, Xiaolei Diao, Jingwen Wang, Hao Xu

Through representation learning, DNNs can map BFs into dense clusters in feature space, while the features of minority classes often show sparse clusters.

Representation Learning

CTT-Net: A Multi-view Cross-token Transformer for Cataract Postoperative Visual Acuity Prediction

1 code implementation12 Dec 2022 Jinhong Wang, Jingwen Wang, Tingting Chen, Wenhao Zheng, Zhe Xu, Xingdi Wu, Wen Xu, Haochao Ying, Danny Chen, Jian Wu

Clinically, to assess the necessity of cataract surgery, accurately predicting postoperative VA before surgery by analyzing multi-view optical coherence tomography (OCT) images is crucially needed.

regression

Visual Subtitle Feature Enhanced Video Outline Generation

no code implementations24 Aug 2022 Qi Lv, Ziqiang Cao, Wenrui Xie, Derui Wang, Jingwen Wang, Zhiwei Hu, Tangkun Zhang, Ba Yuan, Yuanhang Li, Min Cao, Wenjie Li, Sujian Li, Guohong Fu

Furthermore, based on the similarity between video outlines and textual outlines, we use a large number of articles with chapter headings to pretrain our model.

Headline Generation Navigate +4

GO-Surf: Neural Feature Grid Optimization for Fast, High-Fidelity RGB-D Surface Reconstruction

1 code implementation29 Jun 2022 Jingwen Wang, Tymoteusz Bleja, Lourdes Agapito

We present GO-Surf, a direct feature grid optimization method for accurate and fast surface reconstruction from RGB-D sequences.

Surface Reconstruction

Siamese Encoder-based Spatial-Temporal Mixer for Growth Trend Prediction of Lung Nodules on CT Scans

1 code implementation7 Jun 2022 Jiansheng Fang, Jingwen Wang, Anwei Li, Yuguang Yan, Yonghe Hou, Chao Song, Hongbo Liu, Jiang Liu

In the management of lung nodules, we are desirable to predict nodule evolution in terms of its diameter variation on Computed Tomography (CT) scans and then provide a follow-up recommendation according to the predicted result of the growing trend of the nodule.

Computed Tomography (CT) Management

Controllable Video Captioning with an Exemplar Sentence

1 code implementation2 Dec 2021 Yitian Yuan, Lin Ma, Jingwen Wang, Wenwu Zhu

In this paper, we investigate a novel and challenging task, namely controllable video captioning with an exemplar sentence.

Caption Generation Sentence +2

DSP-SLAM: Object Oriented SLAM with Deep Shape Priors

1 code implementation21 Aug 2021 Jingwen Wang, Martin Rünz, Lourdes Agapito

We propose DSP-SLAM, an object-oriented SLAM system that builds a rich and accurate joint map of dense 3D models for foreground objects, and sparse landmark points to represent the background.

3D Object Reconstruction Object +2

Recurrent Exposure Generation for Low-Light Face Detection

1 code implementation21 Jul 2020 Jinxiu Liang, Jingwen Wang, Yuhui Quan, Tianyi Chen, Jiaying Liu, Haibin Ling, Yong Xu

REG produces progressively and efficiently intermediate images corresponding to various exposure settings, and such pseudo-exposures are then fused by MED to detect faces across different lighting conditions.

Face Detection Image Enhancement

Deep Bilateral Retinex for Low-Light Image Enhancement

no code implementations4 Jul 2020 Jinxiu Liang, Yong Xu, Yuhui Quan, Jingwen Wang, Haibin Ling, Hui Ji

Low-light images, i. e. the images captured in low-light conditions, suffer from very poor visibility caused by low contrast, color distortion and significant measurement noise.

Low-Light Image Enhancement

Weakly-Supervised Multi-Level Attentional Reconstruction Network for Grounding Textual Queries in Videos

no code implementations16 Mar 2020 Yijun Song, Jingwen Wang, Lin Ma, Zhou Yu, Jun Yu

The task of temporally grounding textual queries in videos is to localize one video segment that semantically corresponds to the given query.

Sentence

Pathomic Fusion: An Integrated Framework for Fusing Histopathology and Genomic Features for Cancer Diagnosis and Prognosis

1 code implementation18 Dec 2019 Richard J. Chen, Ming Y. Lu, Jingwen Wang, Drew F. K. Williamson, Scott J. Rodig, Neal I. Lindeman, Faisal Mahmood

Cancer diagnosis, prognosis, and therapeutic response predictions are based on morphological information from histology slides and molecular profiles from genomic data.

Feature Importance

Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos

1 code implementation NeurIPS 2019 Yitian Yuan, Lin Ma, Jingwen Wang, Wei Liu, Wenwu Zhu

Temporal sentence grounding in videos aims to detect and localize one target video segment, which semantically corresponds to a given sentence.

Sentence Temporal Sentence Grounding

Weakly Supervised Prostate TMA Classification via Graph Convolutional Networks

no code implementations29 Oct 2019 Jingwen Wang, Richard J. Chen, Ming Y. Lu, Alexander Baras, Faisal Mahmood

In prostate cancer, the Gleason score is a grading system used to measure the aggressiveness of prostate cancer from the spatial organization of cells and the distribution of glands.

Classification General Classification

Semi-Supervised Histology Classification using Deep Multiple Instance Learning and Contrastive Predictive Coding

no code implementations23 Oct 2019 Ming Y. Lu, Richard J. Chen, Jingwen Wang, Debora Dillon, Faisal Mahmood

Convolutional neural networks can be trained to perform histology slide classification using weak annotations with multiple instance learning (MIL).

Binary Classification Classification +4

Temporally Grounding Language Queries in Videos by Contextual Boundary-aware Prediction

1 code implementation11 Sep 2019 Jingwen Wang, Lin Ma, Wenhao Jiang

The task of temporally grounding language queries in videos is to temporally localize the best matched video segment corresponding to a given language (sentence).

Sentence

Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network

1 code implementation ICCV 2019 Bairui Wang, Lin Ma, Wei zhang, Wenhao Jiang, Jingwen Wang, Wei Liu

In this paper, we propose to guide the video caption generation with Part-of-Speech (POS) information, based on a gated fusion of multiple representations of input videos.

Caption Generation POS +2

Generating an Overview Report over Many Documents

no code implementations17 Aug 2019 Jingwen Wang, Hao Zhang, Cheng Zhang, Wenjing Yang, Liqun Shao, Jie Wang

To overcome this obstacle, we present NDORGS (Numerous Documents' Overview Report Generation Scheme) that integrates text filtering, keyword scoring, single-document summarization (SDS), topic modeling, MDS, and title generation to generate a coherent, well-structured ORPT.

Attribute Decision Making +2

An anomaly prediction framework for financial IT systems using hybrid machine learning methods

no code implementations30 Jul 2019 Jingwen Wang, Jingxin Liu, Juntao Pu, Qinghong Yang, Zhongchen Miao, Jian Gao, You Song

To improve the efficiency and accuracy of system failure detection and thereby reduce the impact of system failures on financial services, we propose a novel machine learning-based framework to predict the occurrence of system exceptions and failures in a financial software system.

BIG-bench Machine Learning Time Series Prediction

Non-local NetVLAD Encoding for Video Classification

no code implementations29 Sep 2018 Yongyi Tang, Xing Zhang, Jingwen Wang, Shaoxiang Chen, Lin Ma, Yu-Gang Jiang

This paper describes our solution for the 2$^\text{nd}$ YouTube-8M video understanding challenge organized by Google AI.

Classification General Classification +3

Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning

1 code implementation CVPR 2018 Jingwen Wang, Wenhao Jiang, Lin Ma, Wei Liu, Yong Xu

We propose a bidirectional proposal method that effectively exploits both past and future contexts to make proposal predictions.

Dense Video Captioning

Cannot find the paper you are looking for? You can Submit a new open access paper.