Search Results for author: Jingwen Wang

Found 30 papers, 17 papers with code

MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video

1 code implementation • 1 Dec 2023 • Hengyi Wang, Jingwen Wang, Lourdes Agapito

Thanks to the expressiveness of neural representations, prior works can accurately capture the motion and achieve high-fidelity reconstruction of the target object.

Neural Rendering Surface Reconstruction

Paper
Code

SeMLaPS: Real-time Semantic Mapping with Latent Prior Networks and Quasi-Planar Segmentation

no code implementations • 28 Jun 2023 • Jingwen Wang, Juan Tarrio, Lourdes Agapito, Pablo F. Alcantarilla, Alexander Vakhitov

We present a new methodology for real-time semantic mapping from RGB-D sequences that combines a 2D neural network and a 3D network based on a SLAM system with 3D occupancy mapping.

Image Segmentation Semantic Segmentation

Paper
Add Code

First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment

1 code implementation • 23 Jun 2023 • Tom Tongjia Chen, Hongshan Yu, Zhengeng Yang, Ming Li, Zechuan Li, Jingwen Wang, Wei Miao, Wei Sun, Chen Chen

Affordance-Centric Question-driven Task Completion (AQTC) has been proposed to acquire knowledge from videos to furnish users with comprehensive and systematic instructions.

Human-Object Interaction Detection

Paper
Code

Co-SLAM: Joint Coordinate and Sparse Parametric Encodings for Neural Real-Time SLAM

1 code implementation • CVPR 2023 • Hengyi Wang, Jingwen Wang, Lourdes Agapito

We present Co-SLAM, a neural RGB-D SLAM system based on a hybrid representation, that performs robust camera tracking and high-fidelity surface reconstruction in real time.

Surface Reconstruction

369

Paper
Code

MalIoT: Scalable and Real-time Malware Traffic Detection for IoT Networks

no code implementations • 2 Apr 2023 • Ethan Weitkamp, Yusuke Satani, Adam Omundsen, Jingwen Wang, Peilong Li

The machine learning approach is vital in Internet of Things (IoT) malware traffic detection due to its ability to keep pace with the ever-evolving nature of malware.

Paper
Add Code

FCC: Feature Clusters Compression for Long-Tailed Visual Recognition

1 code implementation • CVPR 2023 • Jian Li, Ziyao Meng, Daqian Shi, Rui Song, Xiaolei Diao, Jingwen Wang, Hao Xu

Through representation learning, DNNs can map BFs into dense clusters in feature space, while the features of minority classes often show sparse clusters.

Representation Learning

Paper
Code

CTT-Net: A Multi-view Cross-token Transformer for Cataract Postoperative Visual Acuity Prediction

1 code implementation • 12 Dec 2022 • Jinhong Wang, Jingwen Wang, Tingting Chen, Wenhao Zheng, Zhe Xu, Xingdi Wu, Wen Xu, Haochao Ying, Danny Chen, Jian Wu

Clinically, to assess the necessity of cataract surgery, accurately predicting postoperative VA before surgery by analyzing multi-view optical coherence tomography (OCT) images is crucially needed.

regression

Paper
Code

Visual Subtitle Feature Enhanced Video Outline Generation

no code implementations • 24 Aug 2022 • Qi Lv, Ziqiang Cao, Wenrui Xie, Derui Wang, Jingwen Wang, Zhiwei Hu, Tangkun Zhang, Ba Yuan, Yuanhang Li, Min Cao, Wenjie Li, Sujian Li, Guohong Fu

Furthermore, based on the similarity between video outlines and textual outlines, we use a large number of articles with chapter headings to pretrain our model.

Headline Generation Navigate +4

Paper
Add Code

GO-Surf: Neural Feature Grid Optimization for Fast, High-Fidelity RGB-D Surface Reconstruction

1 code implementation • 29 Jun 2022 • Jingwen Wang, Tymoteusz Bleja, Lourdes Agapito

We present GO-Surf, a direct feature grid optimization method for accurate and fast surface reconstruction from RGB-D sequences.

Surface Reconstruction

158

Paper
Code

Weighted Concordance Index Loss-based Multimodal Survival Modeling for Radiation Encephalopathy Assessment in Nasopharyngeal Carcinoma Radiotherapy

no code implementations • 23 Jun 2022 • Jiansheng Fang, Anwei Li, Pu-Yun OuYang, Jiajian Li, Jingwen Wang, Hongbo Liu, Fang-Yun Xie, Jiang Liu

We design a deep multimodal survival network (MSN) with two feature extractors to learn discriminative features from multimodal data.

feature selection Survival Analysis

Paper
Add Code

Siamese Encoder-based Spatial-Temporal Mixer for Growth Trend Prediction of Lung Nodules on CT Scans

1 code implementation • 7 Jun 2022 • Jiansheng Fang, Jingwen Wang, Anwei Li, Yuguang Yan, Yonghe Hou, Chao Song, Hongbo Liu, Jiang Liu

In the management of lung nodules, we are desirable to predict nodule evolution in terms of its diameter variation on Computed Tomography (CT) scans and then provide a follow-up recommendation according to the predicted result of the growing trend of the nodule.

Computed Tomography (CT) Management

Paper
Code

Controllable Video Captioning with an Exemplar Sentence

1 code implementation • 2 Dec 2021 • Yitian Yuan, Lin Ma, Jingwen Wang, Wenwu Zhu

In this paper, we investigate a novel and challenging task, namely controllable video captioning with an exemplar sentence.

Caption Generation Sentence +2

Paper
Code

DSP-SLAM: Object Oriented SLAM with Deep Shape Priors

1 code implementation • 21 Aug 2021 • Jingwen Wang, Martin Rünz, Lourdes Agapito

We propose DSP-SLAM, an object-oriented SLAM system that builds a rich and accurate joint map of dense 3D models for foreground objects, and sparse landmark points to represent the background.

3D Object Reconstruction Object +2

494

Paper
Code

Integrating Semantics and Neighborhood Information with Graph-Driven Generative Models for Document Retrieval

3 code implementations • ACL 2021 • Zijing Ou, Qinliang Su, Jianxing Yu, Bang Liu, Jingwen Wang, Ruihui Zhao, Changyou Chen, Yefeng Zheng

With the need of fast retrieval speed and small memory footprint, document hashing has been playing a crucial role in large-scale information retrieval.

Information Retrieval Retrieval

Paper
Code

Actor-Action Video Classification CSC 249/449 Spring 2020 Challenge Report

1 code implementation • 1 Aug 2020 • Jing Shi, Zhiheng Li, Haitian Zheng, Yihang Xu, Tianyou Xiao, Weitao Tan, Xiaoning Guo, Sizhe Li, Bin Yang, Zhexin Xu, Ruitao Lin, Zhongkai Shangguan, Yue Zhao, Jingwen Wang, Rohan Sharma, Surya Iyer, Ajinkya Deshmukh, Raunak Mahalik, Srishti Singh, Jayant G Rohra, Yi-Peng Zhang, Tongyu Yang, Xuan Wen, Ethan Fahnestock, Bryce Ikeda, Ian Lawson, Alan Finkelstein, Kehao Guo, Richard Magnotti, Andrew Sexton, Jeet Ketan Thaker, Yiyang Su, Chenliang Xu

This technical report summarizes submissions and compiles from Actor-Action video classification challenge held as a final project in CSC 249/449 Machine Vision course (Spring 2020) at University of Rochester

General Classification Video Classification

Paper
Code

Recurrent Exposure Generation for Low-Light Face Detection

1 code implementation • 21 Jul 2020 • Jinxiu Liang, Jingwen Wang, Yuhui Quan, Tianyi Chen, Jiaying Liu, Haibin Ling, Yong Xu

REG produces progressively and efficiently intermediate images corresponding to various exposure settings, and such pseudo-exposures are then fused by MED to detect faces across different lighting conditions.

Face Detection Image Enhancement

Paper
Code

Deep Bilateral Retinex for Low-Light Image Enhancement

no code implementations • 4 Jul 2020 • Jinxiu Liang, Yong Xu, Yuhui Quan, Jingwen Wang, Haibin Ling, Hui Ji

Low-light images, i. e. the images captured in low-light conditions, suffer from very poor visibility caused by low contrast, color distortion and significant measurement noise.

Low-Light Image Enhancement

Paper
Add Code

STH: Spatio-Temporal Hybrid Convolution for Efficient Action Recognition

no code implementations • 18 Mar 2020 • Xu Li, Jingwen Wang, Lin Ma, Kaihao Zhang, Fengzong Lian, Zhanhui Kang, Jinjun Wang

Such a design enables efficient spatio-temporal modeling and maintains a small model scale.

Action Recognition

Paper
Add Code

Weakly-Supervised Multi-Level Attentional Reconstruction Network for Grounding Textual Queries in Videos

no code implementations • 16 Mar 2020 • Yijun Song, Jingwen Wang, Lin Ma, Zhou Yu, Jun Yu

The task of temporally grounding textual queries in videos is to localize one video segment that semantically corresponds to the given query.

Sentence

Paper
Add Code

Pathomic Fusion: An Integrated Framework for Fusing Histopathology and Genomic Features for Cancer Diagnosis and Prognosis

1 code implementation • 18 Dec 2019 • Richard J. Chen, Ming Y. Lu, Jingwen Wang, Drew F. K. Williamson, Scott J. Rodig, Neal I. Lindeman, Faisal Mahmood

Cancer diagnosis, prognosis, and therapeutic response predictions are based on morphological information from histology slides and molecular profiles from genomic data.

Feature Importance

251

Paper
Code

Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos

1 code implementation • NeurIPS 2019 • Yitian Yuan, Lin Ma, Jingwen Wang, Wei Liu, Wenwu Zhu

Temporal sentence grounding in videos aims to detect and localize one target video segment, which semantically corresponds to a given sentence.

Sentence Temporal Sentence Grounding

Paper
Code

Weakly Supervised Prostate TMA Classification via Graph Convolutional Networks

no code implementations • 29 Oct 2019 • Jingwen Wang, Richard J. Chen, Ming Y. Lu, Alexander Baras, Faisal Mahmood

In prostate cancer, the Gleason score is a grading system used to measure the aggressiveness of prostate cancer from the spatial organization of cells and the distribution of glands.

Classification General Classification

Paper
Add Code

Semi-Supervised Histology Classification using Deep Multiple Instance Learning and Contrastive Predictive Coding

no code implementations • 23 Oct 2019 • Ming Y. Lu, Richard J. Chen, Jingwen Wang, Debora Dillon, Faisal Mahmood

Convolutional neural networks can be trained to perform histology slide classification using weak annotations with multiple instance learning (MIL).

Binary Classification Classification +4

Paper
Add Code

Temporally Grounding Language Queries in Videos by Contextual Boundary-aware Prediction

1 code implementation • 11 Sep 2019 • Jingwen Wang, Lin Ma, Wenhao Jiang

The task of temporally grounding language queries in videos is to temporally localize the best matched video segment corresponding to a given language (sentence).

Sentence

Paper
Code

Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network

1 code implementation • ICCV 2019 • Bairui Wang, Lin Ma, Wei zhang, Wenhao Jiang, Jingwen Wang, Wei Liu

In this paper, we propose to guide the video caption generation with Part-of-Speech (POS) information, based on a gated fusion of multiple representations of input videos.

Caption Generation POS +2

Paper
Code

Generating an Overview Report over Many Documents

no code implementations • 17 Aug 2019 • Jingwen Wang, Hao Zhang, Cheng Zhang, Wenjing Yang, Liqun Shao, Jie Wang

To overcome this obstacle, we present NDORGS (Numerous Documents' Overview Report Generation Scheme) that integrates text filtering, keyword scoring, single-document summarization (SDS), topic modeling, MDS, and title generation to generate a coherent, well-structured ORPT.

Attribute Decision Making +2

Paper
Add Code

An anomaly prediction framework for financial IT systems using hybrid machine learning methods

no code implementations • 30 Jul 2019 • Jingwen Wang, Jingxin Liu, Juntao Pu, Qinghong Yang, Zhongchen Miao, Jian Gao, You Song

To improve the efficiency and accuracy of system failure detection and thereby reduce the impact of system failures on financial services, we propose a novel machine learning-based framework to predict the occurrence of system exceptions and failures in a financial software system.

BIG-bench Machine Learning Time Series Prediction