Search Results for author: Yan Wang

Found 323 papers, 125 papers with code

Enabling Deep Residual Networks for Weakly Supervised Object Detection

no code implementations • ECCV 2020 • Yunhang Shen, Rongrong Ji, Yan Wang, Zhiwei Chen, Feng Zheng, Feiyue Huang, Yunsheng Wu

Weakly supervised object detection (WSOD) has attracted extensive research attention due to its great flexibility of exploiting large-scale image-level annotation for detector training.

Object object-detection +1

Paper
Add Code

Keep the Primary, Rewrite the Secondary: A Two-Stage Approach for Paraphrase Generation

no code implementations • Findings (ACL) 2021 • Yixuan Su, David Vandyke, Simon Baker, Yan Wang, Nigel Collier

Paraphrase Generation

Paper
Add Code

zydhjh4593@SMM4H’22: A Generic Pre-trained BERT-based Framework for Social Media Health Text Classification

no code implementations • SMM4H (COLING) 2022 • Chenghao Huang, Xiaolu Chen, Yuxi Chen, Yutong Wu, Weimin Yuan, Yan Wang, Yanru Zhang

This paper describes our proposed framework for the 10 text classification tasks of Task 1a, 2a, 2b, 3a, 4, 5, 6, 7, 8, and 9, in the Social Media Mining for Health (SMM4H) 2022.

text-classification Text Classification

Paper
Add Code

AccidentBlip2: Accident Detection With Multi-View MotionBlip2

1 code implementation • 18 Apr 2024 • Yihua Shao, Hongyi Cai, Xinwei Long, Weiyi Lang, Zhe Wang, Haoran Wu, Yan Wang, Jiayi Yin, Yang Yang, Zhen Lei

We also extend our approach to a multi-vehicle cooperative system by deploying Motion Qformer on each vehicle and simultaneously inputting the inference-generated query into the MLP for autoregressive inference.

Language Modelling Large Language Model +2

Paper
Code

Causal Deconfounding via Confounder Disentanglement for Dual-Target Cross-Domain Recommendation

no code implementations • 17 Apr 2024 • JiaJie Zhu, Yan Wang, Feng Zhu, Zhu Sun

As a result, dual-target CDR has to meet two challenges: (1) how to effectively decouple observed confounders, including single-domain confounders and cross-domain confounders, and (2) how to preserve the positive effects of observed confounders on predicted interactions, while eliminating their negative effects on capturing comprehensive user preferences.

Disentanglement

Paper
Add Code

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

1 code implementation • 16 Apr 2024 • Bin Ren, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang, Wei Zhai, Renjing Pei, Jiaming Guo, Songcen Xu, Yang Cao, ZhengJun Zha, Yan Wang, Yi Liu, Qing Wang, Gang Zhang, Liou Zhang, Shijie Zhao, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Xin Liu, Min Yan, Menghan Zhou, Yiqiang Yan, Yixuan Liu, Wensong Chan, Dehua Tang, Dong Zhou, Li Wang, Lu Tian, Barsoum Emad, Bohan Jia, Junbo Qiao, Yunshuai Zhou, Yun Zhang, Wei Li, Shaohui Lin, Shenglong Zhou, Binbin Chen, Jincheng Liao, Suiyi Zhao, Zhao Zhang, Bo wang, Yan Luo, Yanyan Wei, Feng Li, Mingshen Wang, Yawei Li, Jinhan Guan, Dehua Hu, Jiawei Yu, Qisheng Xu, Tao Sun, Long Lan, Kele Xu, Xin Lin, Jingtong Yue, Lehan Yang, Shiyi Du, Lu Qi, Chao Ren, Zeyu Han, YuHan Wang, Chaolin Chen, Haobo Li, Mingjun Zheng, Zhongbao Yang, Lianhong Song, Xingzhuo Yan, Minghan Fu, Jingyi Zhang, Baiang Li, Qi Zhu, Xiaogang Xu, Dan Guo, Chunle Guo, Jiadi Chen, Huanhuan Long, Chunjiang Duanmu, Xiaoyan Lei, Jie Liu, Weilin Jia, Weifeng Cao, Wenlong Zhang, Yanyu Mao, Ruilong Guo, Nihao Zhang, Qian Wang, Manoj Pandey, Maksym Chernozhukov, Giang Le, Shuli Cheng, Hongyuan Wang, Ziyan Wei, Qingting Tang, Liejun Wang, Yongming Li, Yanhui Guo, Hao Xu, Akram Khatami-Rizi, Ahmad Mahmoudi-Aznaveh, Chih-Chung Hsu, Chia-Ming Lee, Yi-Shiuan Chou, Amogh Joshi, Nikhil Akalwadi, Sampada Malagi, Palani Yashaswini, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi

In sub-track 1, the practical runtime performance of the submissions was evaluated, and the corresponding score was used to determine the ranking.

Image Super-Resolution

Paper
Code

RoT: Enhancing Large Language Models with Reflection on Search Trees

1 code implementation • 8 Apr 2024 • Wenyang Hui, Chengyue Jiang, Yan Wang, Kewei Tu

It uses a strong LLM to summarize guidelines from previous tree search experiences to enhance the ability of a weak LLM.

Paper
Code

Task-Aware Encoder Control for Deep Video Compression

no code implementations • 7 Apr 2024 • Xingtong Ge, Jixiang Luo, Xinjie Zhang, Tongda Xu, Guo Lu, Dailan He, Jing Geng, Yan Wang, Jun Zhang, Hongwei Qin

Prior research on deep video compression (DVC) for machine tasks typically necessitates training a unique codec for each specific task, mandating a dedicated decoder per task.

Video Compression

Paper
Add Code

Two-Phase Multi-Dose-Level PET Image Reconstruction with Dose Level Awareness

no code implementations • 2 Apr 2024 • Yuchen Fei, Yanmei Luo, Yan Wang, Jiaqi Cui, Yuanyuan Xu, Jiliu Zhou, Dinggang Shen

In this paper, to reconstruct high-quality SPET images from multi-dose-level LPET images, we design a novel two-phase multi-dose-level PET reconstruction algorithm with dose level awareness, containing a pre-training phase and a SPET prediction phase.

Image Reconstruction

Paper
Add Code

Unleashing the Potential of SAM for Medical Adaptation via Hierarchical Decoding

1 code implementation • 27 Mar 2024 • Zhiheng Cheng, Qingyue Wei, Hongru Zhu, Yan Wang, Liangqiong Qu, Wei Shao, Yuyin Zhou

This paper introduces H-SAM: a prompt-free adaptation of SAM tailored for efficient fine-tuning of medical images via a two-stage hierarchical decoding procedure.

Image Segmentation Medical Image Segmentation +3

Paper
Code

EndoGSLAM: Real-Time Dense Reconstruction and Tracking in Endoscopic Surgeries using Gaussian Splatting

no code implementations • 22 Mar 2024 • Kailing Wang, Chen Yang, Yuehao Wang, Sikuang Li, Yan Wang, Qi Dou, Xiaokang Yang, Wei Shen

Precise camera tracking, high-fidelity 3D tissue reconstruction, and real-time online visualization are critical for intrabody medical imaging devices such as endoscopes and capsule robots.

Simultaneous Localization and Mapping

Paper
Add Code

Protein Conformation Generation via Force-Guided SE(3) Diffusion Models

no code implementations • 21 Mar 2024 • Yan Wang, Lihao Wang, Yuning Shen, Yiqun Wang, Huizhuo Yuan, Yue Wu, Quanquan Gu

The conformational landscape of proteins is crucial to understanding their functionality in complex biological processes.

Paper
Add Code

Super-High-Fidelity Image Compression via Hierarchical-ROI and Adaptive Quantization

no code implementations • 19 Mar 2024 • Jixiang Luo, Yan Wang, Hongwei Qin

MSE-based models aim to improve objective metrics while generative models are leveraged to improve visual quality measured by subjective metrics.

Image Compression Quantization

Paper
Add Code

Noise Dimension of GAN: An Image Compression Perspective

no code implementations • 14 Mar 2024 • Ziran Zhu, Tongda Xu, Ling Li, Yan Wang

This trade-off depicts the best divergence we can achieve when noise is limited.

Image Compression Image Generation

Paper
Add Code

Content-aware Masked Image Modeling Transformer for Stereo Image Compression

no code implementations • 13 Mar 2024 • Xinjie Zhang, Shenyuan Gao, Zhening Liu, Jiawei Shao, Xingtong Ge, Dailan He, Tongda Xu, Yan Wang, Jun Zhang

Existing learning-based stereo image codec adopt sophisticated transformation with simple entropy models derived from single image codecs to encode latent representations.

Image Compression

Paper
Add Code

GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting

1 code implementation • 13 Mar 2024 • Xinjie Zhang, Xingtong Ge, Tongda Xu, Dailan He, Yan Wang, Hongwei Qin, Guo Lu, Jing Geng, Jun Zhang

In response, we propose a groundbreaking paradigm of image representation and compression by 2D Gaussian Splatting, named GaussianImage.

Quantization

Paper
Code

Few-shot Learning on Heterogeneous Graphs: Challenges, Progress, and Prospects

no code implementations • 10 Mar 2024 • Pengfei Ding, Yan Wang, Guanfeng Liu

In this paper, we provide a comprehensive review of existing FLHG methods, covering challenges, research progress, and future prospects.

Few-Shot Learning

Paper
Add Code

Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution

no code implementations • 9 Mar 2024 • Junxiong Lin, Yan Wang, Zeng Tao, Boyang Wang, Qing Zhao, Haorang Wang, Xuan Tong, Xinji Mai, Yuxuan Lin, Wei Song, Jiawen Yu, Shaoqi Yan, Wenqiang Zhang

Harnessing the potential of leveraging this a priori knowledge in the context of image super-resolution presents a compelling avenue.

Image Generation Image Super-Resolution

Paper
Add Code

A$^{3}$lign-DFER: Pioneering Comprehensive Dynamic Affective Alignment for Dynamic Facial Expression Recognition with CLIP

no code implementations • 7 Mar 2024 • Zeng Tao, Yan Wang, Junxiong Lin, Haoran Wang, Xinji Mai, Jiawen Yu, Xuan Tong, Ziheng Zhou, Shaoqi Yan, Qing Zhao, Liyuan Han, Wenqiang Zhang

Specifically, our A$^{3}$lign-DFER method is designed with multiple modules that work together to obtain the most suitable expanded-dimensional embeddings for classification and to achieve alignment in three key aspects: affective, dynamic, and bidirectional.

Dynamic Facial Expression Recognition Facial Expression Recognition

Paper
Add Code

Dcl-Net: Dual Contrastive Learning Network for Semi-Supervised Multi-Organ Segmentation

no code implementations • 6 Mar 2024 • Lu Wen, Zhenghao Feng, Yun Hou, Peng Wang, Xi Wu, Jiliu Zhou, Yan Wang

Semi-supervised learning is a sound measure to relieve the strict demand of abundant annotated datasets, especially for challenging multi-organ segmentation .

Contrastive Learning Organ Segmentation

Paper
Add Code

Dose Prediction Driven Radiotherapy Paramters Regression via Intra- and Inter-Relation Modeling

no code implementations • 29 Feb 2024 • Jiaqi Cui, Yuanyuan Xu, Jianghong Xiao, Yuchen Fei, Jiliu Zhou, Xingcheng Peng, Yan Wang

Deep learning has facilitated the automation of radiotherapy by predicting accurate dose distribution maps.

regression Relation

Paper
Add Code

CAMixerSR: Only Details Need More "Attention"

1 code implementation • 29 Feb 2024 • Yan Wang, Yi Liu, Shijie Zhao, Junlin Li, Li Zhang

To satisfy the rapidly increasing demands on the large image (2K-8K) super-resolution (SR), prevailing methods follow two independent tracks: 1) accelerate existing networks by content-aware routing, and 2) design better super-resolution networks via token mixer refining.

2k 8k +1

120

Paper
Code

Boosting Neural Representations for Videos with a Conditional Decoder

1 code implementation • 28 Feb 2024 • Xinjie Zhang, Ren Yang, Dailan He, Xingtong Ge, Tongda Xu, Yan Wang, Hongwei Qin, Jun Zhang

Implicit neural representations (INRs) have emerged as a promising approach for video storage and processing, showing remarkable versatility across various video tasks.

Paper
Code

EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection

1 code implementation • 23 Feb 2024 • Zhe Wang, Siqi Fan, Xiaoliang Huo, Tongda Xu, Yan Wang, Jingjing Liu, Yilun Chen, Ya-Qin Zhang

In autonomous driving, cooperative perception makes use of multi-view cameras from both vehicles and infrastructure, providing a global vantage point with rich semantic context of road conditions beyond a single vehicle viewpoint.

3D Object Detection Autonomous Driving +2

Paper
Code

A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis

1 code implementation • 20 Feb 2024 • Nailei Hei, Qianyu Guo, ZiHao Wang, Yan Wang, Haofen Wang, Wenqiang Zhang

To bridge the distribution gap between user input behavior and model training datasets, we first construct a novel Coarse-Fine Granularity Prompts dataset (CFP) and propose a novel User-Friendly Fine-Grained Text Generation framework (UF-FGTG) for automated prompt optimization.

Image Generation Prompt Engineering +1

Paper
Code

Consistency Models Improve Diffusion Inverse Solvers

no code implementations • 9 Feb 2024 • Tongda Xu, Ziran Zhu, Dailan He, Yuanyuan Wang, Ming Sun, Ning li, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang

In addition, we propose a new family of DIS using pure CM.

Image Captioning Semantic Segmentation

Paper
Add Code

Adaptive Hypergraph Network for Trust Prediction

1 code implementation • 7 Feb 2024 • Rongwei Xu, Guanfeng Liu, Yan Wang, Xuyun Zhang, Kai Zheng, Xiaofang Zhou

In this paper, we propose an Adaptive Hypergraph Network for Trust Prediction (AHNTP), a novel approach that improves trust prediction accuracy by using higher-order correlations.

Contrastive Learning Decision Making

Paper
Code

A Lightweight Inception Boosted U-Net Neural Network for Routability Prediction

1 code implementation • 7 Feb 2024 • Hailiang Li, Yan Huo, Yan Wang, Xu Yang, Miaohui Hao, Xiao Wang

As the modern CPU, GPU, and NPU chip design complexity and transistor counts keep increasing, and with the relentless shrinking of semiconductor technology nodes to nearly 1 nanometer, the placement and routing have gradually become the two most pivotal processes in modern very-large-scale-integrated (VLSI) circuit back-end design.

Avg SSIM

Paper
Code

Triplet-constraint Transformer with Multi-scale Refinement for Dose Prediction in Radiotherapy

no code implementations • 7 Feb 2024 • Lu Wen, Qihun Zhang, Zhenghao Feng, Yuanyuan Xu, Xiao Chen, Jiliu Zhou, Yan Wang

Radiotherapy is a primary treatment for cancers with the aim of applying sufficient radiation dose to the planning target volume (PTV) while minimizing dose hazards to the organs at risk (OARs).

Paper
Add Code

Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies

no code implementations • 6 Feb 2024 • Zhixuan Chu, Yan Wang, Feng Zhu, Lu Yu, Longfei Li, Jinjie Gu

The advent of large language models (LLMs) such as ChatGPT, PaLM, and GPT-4 has catalyzed remarkable advances in natural language processing, demonstrating human-like language fluency and reasoning capacities.

Position

Paper
Add Code

DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models

no code implementations • 5 Feb 2024 • Yang Sui, Huy Phan, Jinqi Xiao, Tianfang Zhang, Zijie Tang, Cong Shi, Yan Wang, Yingying Chen, Bo Yuan

In this paper, for the first time, we systematically explore the detectability of the poisoned noise input for the backdoored diffusion models, an important performance metric yet little explored in the existing works.

Backdoor Attack

Paper
Add Code

Image2Points:A 3D Point-based Context Clusters GAN for High-Quality PET Image Reconstruction

1 code implementation • 1 Feb 2024 • Jiaqi Cui, Yan Wang, Lu Wen, Pinxian Zeng, Xi Wu, Jiliu Zhou, Dinggang Shen

To obtain high-quality Positron emission tomography (PET) images while minimizing radiation exposure, numerous methods have been proposed to reconstruct standard-dose PET (SPET) images from the corresponding low-dose PET (LPET) images.

Image Reconstruction

Paper
Code

Deep Learning with Information Fusion and Model Interpretation for Health Monitoring of Fetus based on Long-term Prenatal Electronic Fetal Heart Rate Monitoring Data

1 code implementation • 27 Jan 2024 • Zenghui Lin, Xintong Liu, Nan Wang, Ruichen Li, Qingao Liu, Jingying Ma, LiWei Wang, Yan Wang, Shenda Hong

This kind of continuous monitoring, in contrast to the short-term one, collects an extended period of fetal heart data.

Specificity

Paper
Code

GMC-IQA: Exploiting Global-correlation and Mean-opinion Consistency for No-reference Image Quality Assessment

no code implementations • 19 Jan 2024 • Zewen Chen, Juan Wang, Bing Li, Chunfeng Yuan, Weiming Hu, Junxian Liu, Peng Li, Yan Wang, Youqun Zhang, Congxuan Zhang

Due to the subjective nature of image quality assessment (IQA), assessing which image has better quality among a sequence of images is more reliable than assigning an absolute mean opinion score for an image.

No-Reference Image Quality Assessment

Paper
Add Code

SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding

no code implementations • 17 Jan 2024 • Baoxiong Jia, Yixin Chen, Huangyue Yu, Yan Wang, Xuesong Niu, Tengyu Liu, Qing Li, Siyuan Huang

In comparison to recent advancements in the 2D domain, grounding language in 3D scenes faces several significant challenges: (i) the inherent complexity of 3D scenes due to the diverse object configurations, their rich attributes, and intricate relationships; (ii) the scarcity of paired 3D vision-language data to support grounded learning; and (iii) the absence of a unified learning framework to distill knowledge from grounded 3D data.

Scene Understanding Visual Grounding

Paper
Add Code

Idempotence and Perceptual Image Compression

1 code implementation • 17 Jan 2024 • Tongda Xu, Ziran Zhu, Dailan He, Yanghao Li, Lina Guo, Yuanyuan Wang, Zhe Wang, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang

However, we find that theoretically: 1) Conditional generative model-based perceptual codec satisfies idempotence; 2) Unconditional generative model with idempotence constraint is equivalent to conditional generative codec.

Image Compression

Paper
Code

LLM-Guided Multi-View Hypergraph Learning for Human-Centric Explainable Recommendation

no code implementations • 16 Jan 2024 • Zhixuan Chu, Yan Wang, Qing Cui, Longfei Li, Wenqing Chen, Zhan Qin, Kui Ren

As personalized recommendation systems become vital in the age of information overload, traditional methods relying solely on historical user interactions often fail to fully capture the multifaceted nature of human interests.

Explainable Recommendation Recommendation Systems

Paper
Add Code

A Deep Learning Representation of Spatial Interaction Model for Resilient Spatial Planning of Community Business Clusters

no code implementations • 9 Jan 2024 • Haiyan Hao, Yan Wang

To address the limitation, we propose a SIM-GAT model to predict spatiotemporal visitation flows between community business clusters and their trade areas.

Graph Attention

Paper
Add Code

TeleChat Technical Report

no code implementations • 8 Jan 2024 • Zhongjiang He, Zihan Wang, Xinzhang Liu, Shixuan Liu, Yitong Yao, Yuyao Huang, Xuelong Li, Yongxiang Li, Zhonghao Che, Zhaoxi Zhang, Yan Wang, Xin Wang, Luwen Pu, Huinan Xu, Ruiyu Fang, Yu Zhao, Jie Zhang, Xiaomeng Huang, Zhilong Lu, Jiaxin Peng, Wenjun Zheng, Shiquan Wang, Bingkai Yang, Xuewei he, Zhuoru Jiang, Qiyi Xie, Yanhan Zhang, Zhongqiu Li, Lingling Shi, Weiwei Fu, Yin Zhang, Zilu Huang, Sishi Xiong, Yuxiang Zhang, Chao Wang, Shuangyong Song

Subsequently, the model undergoes fine-tuning to align with human preferences, following a detailed methodology that we describe.

Code Generation Question Answering

Paper
Add Code

Few-Shot Causal Representation Learning for Out-of-Distribution Generalization on Heterogeneous Graphs

no code implementations • 7 Jan 2024 • Pengfei Ding, Yan Wang, Guanfeng Liu, Nan Wang, Xiaofang Zhou

To address this challenging problem, we propose a novel Causal OOD Heterogeneous graph Few-shot learning model, namely COHF.

Few-Shot Learning Out-of-Distribution Generalization +2

Paper
Add Code

Energy based diffusion generator for efficient sampling of Boltzmann distributions

no code implementations • 4 Jan 2024 • Yan Wang, Ling Guo, Hao Wu, Tao Zhou

We introduce a novel sampler called the energy based diffusion generator for generating samples from arbitrary target distributions.

Paper
Add Code

Set Prediction Guided by Semantic Concepts for Diverse Video Captioning

no code implementations • 25 Dec 2023 • Yifan Lu, Ziqi Zhang, Chunfeng Yuan, Peng Li, Yan Wang, Bing Li, Weiming Hu

Each caption in the set is attached to a concept combination indicating the primary semantic content of the caption and facilitating element alignment in set prediction.

Caption Generation Video Captioning

Paper
Add Code

DDistill-SR: Reparameterized Dynamic Distillation Network for Lightweight Image Super-Resolution

1 code implementation • 22 Dec 2023 • Yan Wang, Tongtong Su, Yusen Li, Jiuwen Cao, Gang Wang, Xiaoguang Liu

Specifically, we propose a plug-in reparameterized dynamic unit (RDU) to promote the performance and inference cost trade-off.

Image Super-Resolution

Paper
Code

Object Attribute Matters in Visual Question Answering

no code implementations • 20 Dec 2023 • Peize Li, Qingyi Si, Peng Fu, Zheng Lin, Yan Wang

In this paper, we propose a novel VQA approach from the perspective of utilizing object attribute, aiming to achieve better object-level visual-language alignment and multimodal scene understanding.

Attribute Knowledge Distillation +5

Paper
Add Code

Appeal: Allow Mislabeled Samples the Chance to be Rectified in Partial Label Learning

no code implementations • 18 Dec 2023 • Chongjie Si, Xuehui Wang, Yan Wang, Xiaokang Yang, Wei Shen

In partial label learning (PLL), each instance is associated with a set of candidate labels among which only one is ground-truth.

Partial Label Learning

Paper
Add Code

CogAgent: A Visual Language Model for GUI Agents

1 code implementation • 14 Dec 2023 • Wenyi Hong, Weihan Wang, Qingsong Lv, Jiazheng Xu, Wenmeng Yu, Junhui Ji, Yan Wang, Zihan Wang, Yuxuan Zhang, Juanzi Li, Bin Xu, Yuxiao Dong, Ming Ding, Jie Tang

People are spending an enormous amount of time on digital devices through graphical user interfaces (GUIs), e. g., computer or smartphone screens.

Ranked #14 on Visual Question Answering on MM-Vet

Language Modelling Visual Question Answering

4,975

Paper
Code

Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models

no code implementations • 9 Dec 2023 • Shitian Zhao, Zhuowan Li, Yadong Lu, Alan Yuille, Yan Wang

We propose Causal Context Generation, Causal-CoG, which is a prompting strategy that engages contextual information to enhance precise VQA during inference.

Question Answering Visual Question Answering

Paper
Add Code

Large Language Models for Intent-Driven Session Recommendations

1 code implementation • 7 Dec 2023 • Zhu Sun, Hongyang Liu, Xinghua Qu, Kaidong Feng, Yan Wang, Yew-Soon Ong

Intent-aware session recommendation (ISR) is pivotal in discerning user intents within sessions for precise predictions.

Paper
Code

Unified learning-based lossy and lossless JPEG recompression

no code implementations • 5 Dec 2023 • Jianghui Zhang, Yuanyuan Wang, Lina Guo, Jixiang Luo, Tongda Xu, Yan Wang, Zhi Wang, Hongwei Qin

Most image compression algorithms only consider uncompressed original image, while ignoring a large number of already existing JPEG images.

Image Compression Quantization

Paper
Add Code

An Embodied Generalist Agent in 3D World

1 code implementation • 18 Nov 2023 • Jiangyong Huang, Silong Yong, Xiaojian Ma, Xiongkun Linghu, Puhao Li, Yan Wang, Qing Li, Song-Chun Zhu, Baoxiong Jia, Siyuan Huang

Leveraging massive knowledge and learning schemes from large language models (LLMs), recent machine learning models show notable successes in building generalist agents that exhibit the capability of general-purpose task solving in diverse domains, including natural language processing, computer vision, and robotics.

3D dense captioning Question Answering +3

196

Paper
Code

ML-Bench: Evaluating Large Language Models for Code Generation in Repository-Level Machine Learning Tasks

1 code implementation • 16 Nov 2023 • Yuliang Liu, Xiangru Tang, Zefan Cai, Junjie Lu, Yichi Zhang, Yanjun Shao, Zexuan Deng, Helan Hu, Kaikai An, Ruijun Huang, Shuzheng Si, Sheng Chen, Haozhe Zhao, Liang Chen, Yan Wang, Tianyu Liu, Zhiwei Jiang, Baobao Chang, Yujia Qin, Wangchunshu Zhou, Yilun Zhao, Arman Cohan, Mark Gerstein

While Large Language Models (LLMs) have demonstrated proficiency in code generation benchmarks, translating these results into practical development scenarios - where leveraging existing repository-level libraries is the norm - remains challenging.

Code Generation Navigate

Paper
Code

Explainable History Distillation by Marked Temporal Point Process

no code implementations • 13 Nov 2023 • Sishun Liu, Ke Deng, Yan Wang, Xiuzhen Zhang

To efficiently solve \acrshort{ehd}, we rewrite the task into a \gls{01ip} and directly estimate the solution to the program by a model called \acrfull{model}.

counterfactual

Paper
Add Code

PepLand: a large-scale pre-trained peptide representation model for a comprehensive landscape of both canonical and non-canonical amino acids

1 code implementation • 8 Nov 2023 • Ruochi Zhang, Haoran Wu, Yuting Xiu, Kewei Li, Ningning Chen, Yu Wang, Yan Wang, Xin Gao, Fengfeng Zhou

In recent years, the scientific community has become increasingly interested on peptides with non-canonical amino acids due to their superior stability and resistance to proteolytic degradation.

Paper
Code

High-performance Power Allocation Strategies for Active IRS-aided Wireless Network

no code implementations • 7 Nov 2023 • Yifan Zhao, Xuehui Wang, Yan Wang, Xianpeng Wang, Zhilin Chen, Feng Shu, Cunhua Pan, Jiangzhou Wang

Due to its slow linear convergence from iterative GA, the proposed ESMPI-GA is high-complexity.

Paper
Add Code

Augmenting Lane Perception and Topology Understanding with Standard Definition Navigation Maps

1 code implementation • 7 Nov 2023 • Katie Z Luo, Xinshuo Weng, Yan Wang, Shuang Wu, Jie Li, Kilian Q Weinberger, Yue Wang, Marco Pavone

We propose a novel framework to integrate SD maps into online map prediction and propose a Transformer-based encoder, SD Map Encoder Representations from transFormers, to leverage priors in SD maps for the lane-topology prediction task.

Autonomous Driving Lane Detection

Paper
Code

CogVLM: Visual Expert for Pretrained Language Models

1 code implementation • 6 Nov 2023 • Weihan Wang, Qingsong Lv, Wenmeng Yu, Wenyi Hong, Ji Qi, Yan Wang, Junhui Ji, Zhuoyi Yang, Lei Zhao, Xixuan Song, Jiazheng Xu, Bin Xu, Juanzi Li, Yuxiao Dong, Ming Ding, Jie Tang

We introduce CogVLM, a powerful open-source visual language foundation model.

Ranked #4 on Visual Question Answering (VQA) on InfiMM-Eval

Language Modelling Visual Question Answering

4,975

Paper
Code

Diffusion-based Radiotherapy Dose Prediction Guided by Inter-slice Aware Structure Encoding

no code implementations • 6 Nov 2023 • Zhenghao Feng, Lu Wen, Jianghong Xiao, Yuanyuan Xu, Xi Wu, Jiliu Zhou, Xingchen Peng, Yan Wang

In the forward process, DiffDose transforms dose distribution maps into pure Gaussian noise by gradually adding small noise and a noise predictor is simultaneously trained to estimate the noise added at each timestep.

Paper
Add Code

Towards End-to-End Unsupervised Saliency Detection with Self-Supervised Top-Down Context

no code implementations • 14 Oct 2023 • Yicheng Song, Shuyong Gao, Haozhe Xing, Yiting Cheng, Yan Wang, Wenqiang Zhang

Unsupervised salient object detection aims to detect salient objects without using supervision signals eliminating the tedious task of manually labeling salient objects.

Contrastive Learning object-detection +3

Paper
Add Code

3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers

2 code implementations • 11 Oct 2023 • Jieneng Chen, Jieru Mei, Xianhang Li, Yongyi Lu, Qihang Yu, Qingyue Wei, Xiangde Luo, Yutong Xie, Ehsan Adeli, Yan Wang, Matthew Lungren, Lei Xing, Le Lu, Alan Yuille, Yuyin Zhou

In this paper, we extend the 2D TransUNet architecture to a 3D network by building upon the state-of-the-art nnU-Net architecture, and fully exploring Transformers' potential in both the encoder and decoder design.

Image Segmentation Medical Image Segmentation +3

2,132

Paper
Code

Enhancing Asynchronous Time Series Forecasting with Contrastive Relational Inference

no code implementations • 6 Sep 2023 • Yan Wang, Zhixuan Chu, Tao Zhou, Caigao Jiang, Hongyan Hao, Minjie Zhu, Xindong Cai, Qing Cui, Longfei Li, james Y zhang, Siqiao Xue, Jun Zhou

Asynchronous time series, also known as temporal event sequences, are the basis of many applications throughout different industries.

Point Processes Time Series +2

Paper
Add Code

Gene-induced Multimodal Pre-training for Image-omic Classification

no code implementations • 6 Sep 2023 • Ting Jin, Xingran Xie, Renjie Wan, Qingli Li, Yan Wang

Histology analysis of the tumor micro-environment integrated with genomic assays is the gold standard for most cancers in modern medicine.

Classification whole slide images

Paper
Add Code

Bandwidth-efficient Inference for Neural Image Compression

no code implementations • 6 Sep 2023 • Shanzhi Yin, Tongda Xu, Yongsheng Liang, Yuanyuan Wang, Yanghao Li, Yan Wang, Jingjing Liu

With neural networks growing deeper and feature maps growing larger, limited communication bandwidth with external memory (or DRAM) and power constraints become a bottleneck in implementing network inference on mobile and edge devices.

Data Compression Image Compression +1

Paper
Add Code

Enhancing Psychological Counseling with Large Language Model: A Multifaceted Decision-Support System for Non-Professionals

1 code implementation • 29 Aug 2023 • Guanghui Fu, Qing Zhao, Jianqiang Li, Dan Luo, Changwei Song, Wei Zhai, Shuo Liu, Fan Wang, Yan Wang, Lijuan Cheng, Juan Zhang, Bing Xiang Yang

In the contemporary landscape of social media, an alarming number of users express negative emotions, some of which manifest as strong suicidal intentions.

Language Modelling Large Language Model

Paper
Code

Mind vs. Mouth: On Measuring Re-judge Inconsistency of Social Bias in Large Language Models

no code implementations • 24 Aug 2023 • Yachao Zhao, Bo wang, Dongming Zhao, Kun Huang, Yan Wang, Ruifang He, Yuexian Hou

We propose that this re-judge inconsistency can be similar to the inconsistency between human's unaware implicit social bias and their aware explicit social bias.

Paper
Add Code

A Unified Framework for 3D Point Cloud Visual Grounding

1 code implementation • 23 Aug 2023 • Haojia Lin, Yongdong Luo, Xiawu Zheng, Lijiang Li, Fei Chao, Taisong Jin, Donghao Luo, Yan Wang, Liujuan Cao, Rongrong Ji

This elaborate design enables 3DRefTR to achieve both well-performing 3DRES and 3DREC capacities with only a 6% additional latency compared to the original 3DREC model.

Referring Expression Referring Expression Comprehension +1

Paper
Code

Enhancing Recommender Systems with Large Language Model Reasoning Graphs

no code implementations • 21 Aug 2023 • Yan Wang, Zhixuan Chu, Xin Ouyang, Simeng Wang, Hongyan Hao, Yue Shen, Jinjie Gu, Siqiao Xue, james Y zhang, Qing Cui, Longfei Li, Jun Zhou, Sheng Li

In this paper, we propose a novel approach that leverages large language models (LLMs) to construct personalized reasoning graphs.

Language Modelling Large Language Model +1

Paper
Add Code

Leveraging Large Language Models for Pre-trained Recommender Systems

no code implementations • 21 Aug 2023 • Zhixuan Chu, Hongyan Hao, Xin Ouyang, Simeng Wang, Yan Wang, Yue Shen, Jinjie Gu, Qing Cui, Longfei Li, Siqiao Xue, james Y zhang, Sheng Li

In this paper, we propose RecSysLLM, a novel pre-trained recommendation model based on LLMs.

Recommendation Systems

Paper
Add Code

Polymerized Feature-based Domain Adaptation for Cervical Cancer Dose Map Prediction

no code implementations • 20 Aug 2023 • Jie Zeng, Zeyu Han, Xingchen Peng, Jianghong Xiao, Peng Wang, Yan Wang

Recently, deep learning (DL) has automated and accelerated the clinical radiation therapy (RT) planning significantly by predicting accurate dose maps.

Domain Adaptation

Paper
Add Code

Contrastive Diffusion Model with Auxiliary Guidance for Coarse-to-Fine PET Reconstruction

1 code implementation • 20 Aug 2023 • Zeyu Han, YuHan Wang, Luping Zhou, Peng Wang, Binyu Yan, Jiliu Zhou, Yan Wang, Dinggang Shen

To obtain high-quality positron emission tomography (PET) scans while reducing radiation exposure to the human body, various approaches have been proposed to reconstruct standard-dose PET (SPET) images from low-dose PET (LPET) images.

Paper
Code

Conditional Perceptual Quality Preserving Image Compression

no code implementations • 16 Aug 2023 • Tongda Xu, Qian Zhang, Yanghao Li, Dailan He, Zhe Wang, Yuanyuan Wang, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang

We propose conditional perceptual quality, an extension of the perceptual quality defined in \citet{blau2018perception}, by conditioning it on user defined information.

Image Compression

Paper
Add Code

Cross-heterogeneity Graph Few-shot Learning

no code implementations • 10 Aug 2023 • Pengfei Ding, Yan Wang, Guanfeng Liu

In recent years, heterogeneous graph few-shot learning has been proposed to address the label sparsity issue in heterogeneous graphs (HGs), which contain various types of nodes and edges.

Few-Shot Learning Informativeness

Paper
Add Code

TriDo-Former: A Triple-Domain Transformer for Direct PET Reconstruction from Low-Dose Sinograms

no code implementations • 10 Aug 2023 • Jiaqi Cui, Pinxian Zeng, Xinyi Zeng, Peng Wang, Xi Wu, Jiliu Zhou, Yan Wang, Dinggang Shen

Specifically, the TriDo-Former consists of two cascaded networks, i. e., a sinogram enhancement transformer (SE-Former) for denoising the input LPET sinograms and a spatial-spectral reconstruction transformer (SSR-Former) for reconstructing SPET images from the denoised sinograms.

Denoising Image Reconstruction +1

Paper
Add Code

Kairos: Practical Intrusion Detection and Investigation using Whole-system Provenance

1 code implementation • 9 Aug 2023 • Zijun Cheng, Qiujian Lv, Jinyuan Liang, Yan Wang, Degang Sun, Thomas Pasquier, Xueyuan Han

Sifting through their design documents, we identify four common dimensions that drive the development of provenance-based intrusion detection systems (PIDSes): scope (can PIDSes detect modern attacks that infiltrate across application boundaries?

Intrusion Detection

Paper
Code

Color Image Recovery Using Generalized Matrix Completion over Higher-Order Finite Dimensional Algebra

no code implementations • 4 Aug 2023 • Liang Liao, Zhuang Guo, Qi Gao, Yan Wang, Fajun Yu, Qifeng Zhao, Stephen Johh Maybank

To improve the accuracy of color image completion with missing entries, we present a recovery method based on generalized higher-order scalars.

Matrix Completion

Paper
Add Code

Continual Learning in Predictive Autoscaling

no code implementations • 29 Jul 2023 • Hongyan Hao, Zhixuan Chu, Shiyi Zhu, Gangwei Jiang, Yan Wang, Caigao Jiang, James Zhang, Wei Jiang, Siqiao Xue, Jun Zhou

In order to surmount this challenge and effectively integrate new sample distribution, we propose a density-based sample selection strategy that utilizes kernel density estimation to calculate sample density as a reference to compute sample weight, and employs weight sampling to construct a new memory set.

Continual Learning Density Estimation

Paper
Add Code

Domain Disentanglement with Interpolative Data Augmentation for Dual-Target Cross-Domain Recommendation

no code implementations • 26 Jul 2023 • JiaJie Zhu, Yan Wang, Feng Zhu, Zhu Sun

In DIDA-CDR, we first propose an interpolative data augmentation approach to generating both relevant and diverse augmented user representations to augment sparser domain and explore potential user preferences.

Data Augmentation Disentanglement

Paper
Add Code

AIDE: A Vision-Driven Multi-View, Multi-Modal, Multi-Tasking Dataset for Assistive Driving Perception

1 code implementation • ICCV 2023 • Dingkang Yang, Shuai Huang, Zhi Xu, Zhenpeng Li, Shunli Wang, Mingcheng Li, Yuzheng Wang, Yang Liu, Kun Yang, Zhaoyu Chen, Yan Wang, Jing Liu, Peixuan Zhang, Peng Zhai, Lihua Zhang

Driver distraction has become a significant cause of severe traffic accidents over the past decade.

Paper
Code

SL: Stable Learning in Source-Free Domain Adaption for Medical Image Segmentation

no code implementations • 24 Jul 2023 • Yixin Chen, Yan Wang

In this challenge defined as Source-Free UDA, the previous UDA medical methods are limited.

Image Segmentation Medical Image Segmentation +2

Paper
Add Code

DiffDP: Radiotherapy Dose Prediction via a Diffusion Model

no code implementations • 19 Jul 2023 • Zhenghao Feng, Lu Wen, Peng Wang, Binyu Yan, Xi Wu, Jiliu Zhou, Yan Wang

To alleviate this limitation, we innovatively introduce a diffusion-based dose prediction (DiffDP) model for predicting the radiotherapy dose distribution of cancer patients.

Anatomy

Paper
Add Code

EasyTPP: Towards Open Benchmarking Temporal Point Processes

1 code implementation • 16 Jul 2023 • Siqiao Xue, Xiaoming Shi, Zhixuan Chu, Yan Wang, Hongyan Hao, Fan Zhou, Caigao Jiang, Chen Pan, James Y. Zhang, Qingsong Wen, Jun Zhou, Hongyuan Mei

In this paper, we present EasyTPP, the first central repository of research assets (e. g., data, models, evaluation programs, documentations) in the area of event sequence modeling.

Benchmarking Point Processes

176

Paper
Code

Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language Modelling

no code implementations • 16 Jul 2023 • Longyue Wang, Zefeng Du, Donghuai Liu, Deng Cai, Dian Yu, Haiyun Jiang, Yan Wang, Leyang Cui, Shuming Shi, Zhaopeng Tu

Modeling discourse -- the linguistic phenomena that go beyond individual sentences, is a fundamental yet challenging aspect of natural language processing (NLP).

Language Modelling Sentence

Paper
Add Code

Copy Is All You Need

1 code implementation • 13 Jul 2023 • Tian Lan, Deng Cai, Yan Wang, Heyan Huang, Xian-Ling Mao

The dominant text generation models compose the output by sequentially selecting words from a fixed vocabulary.

Domain Adaptation Language Modelling +1

177

Paper
Code

SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency

no code implementations • 1 Jul 2023 • Yan Wang, Yuhang Li, Ruihao Gong, Aishan Liu, Yanfei Wang, Jian Hu, Yongqiang Yao, Yunchen Zhang, Tianzi Xiao, Fengwei Yu, Xianglong Liu

Extensive studies have shown that deep learning models are vulnerable to adversarial and natural noises, yet little is known about model robustness on noises caused by different system implementations.

Benchmarking Data Augmentation +5

Paper
Add Code

Improving the Transferability of Time Series Forecasting with Decomposition Adaptation

no code implementations • 30 Jun 2023 • Yan Gao, Yan Wang, Qiang Wang

However, in time series forecasting, it is difficult to obtain enough data, which limits the performance of neural forecasting models.

Multivariate Time Series Forecasting Time Series +1

Paper
Add Code

A Unified Framework for Online Data-Driven Predictive Control with Robust Safety Guarantees

no code implementations • 29 Jun 2023 • Amin Vahidi-Moghaddam, Kaian Chen, Kaixiang Zhang, Zhaojian Li, Yan Wang, Kai Wu

Despite great successes, model predictive control (MPC) relies on an accurate dynamical model and requires high onboard computational power, impeding its wider adoption in engineering systems, especially for nonlinear real-time systems with limited computation power.

Model Predictive Control

Paper
Add Code

Extended Neighboring Extremal Optimal Control with State and Preview Perturbations

no code implementations • 7 Jun 2023 • Amin Vahidi-Moghaddam, Kaixiang Zhang, Zhaojian Li, Xunyuan Yin, Ziyou Song, Yan Wang

In this work, an extended NE (ENE) framework is developed to systematically adapt the nominal control to both state and preview perturbations.

Model Predictive Control

Paper
Add Code

Asymptotic Performance Analysis of Large-Scale Active IRS-Aided Wireless Network

no code implementations • 31 May 2023 • Yan Wang, Feng Shu, Zhihong Zhuang, Rongen Dong, Qi Zhang, Di wu, Liang Yang, Jiangzhou Wang

Numerical simulation results show that a 3-bit discrete phase shifter is required to achieve a trivial performance loss for a large-scale active IRS.

Quantization

Paper
Add Code

MedNgage: A Dataset for Understanding Engagement in Patient-Nurse Conversations

no code implementations • 31 May 2023 • Yan Wang, Heidi Ann Scharf Donovan, Sabit Hassan, Mailhe Alikhani

In this paper, we present a novel dataset (MedNgage), which consists of patient-nurse conversations about cancer symptom management.

Management

Paper
Add Code

Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate

1 code implementation • 30 May 2023 • Tian Liang, Zhiwei He, Wenxiang Jiao, Xing Wang, Yan Wang, Rui Wang, Yujiu Yang, Zhaopeng Tu, Shuming Shi

To address the DoT problem, we propose a Multi-Agent Debate (MAD) framework, in which multiple agents express their arguments in the state of "tit for tat" and a judge manages the debate process to obtain a final solution.

Arithmetic Reasoning Machine Translation

171

Paper
Code

PandaGPT: One Model To Instruction-Follow Them All

1 code implementation • 25 May 2023 • Yixuan Su, Tian Lan, Huayang Li, Jialu Xu, Yan Wang, Deng Cai

To do so, PandaGPT combines the multimodal encoders from ImageBind and the large language models from Vicuna.

Instruction Following

726

Paper
Code

Joint Uplink and Downlink Resource Allocation Towards Energy-efficient Transmission for URLLC

no code implementations • 25 May 2023 • Kang Li, Pengcheng Zhu, Yan Wang, Fu-Chun Zheng, Xiaohu You

With the proposed packet delivery mechanism, we jointly optimize bandwidth allocation and power control of uplink and downlink, antenna configuration, and subchannel assignment to minimize the average total power under the constraint of URLLC transmission requirements.

Paper
Add Code

Privacy-preserving Adversarial Facial Features

no code implementations • CVPR 2023 • Zhibo Wang, He Wang, Shuaifan Jin, Wenwen Zhang, Jiahui Hu, Yan Wang, Peng Sun, Wei Yuan, Kaixin Liu, Kui Ren

In this paper, we propose an adversarial features-based face privacy protection (AdvFace) approach to generate privacy-preserving adversarial features, which can disrupt the mapping from adversarial features to facial images to defend against reconstruction attacks.

Face Recognition Privacy Preserving

Paper
Add Code

Uncertainty Quantification in Machine Learning for Engineering Design and Health Prognostics: A Tutorial

1 code implementation • 7 May 2023 • Venkat Nemani, Luca Biggio, Xun Huan, Zhen Hu, Olga Fink, Anh Tran, Yan Wang, Xiaoge Zhang, Chao Hu

In this tutorial, we aim to provide a holistic lens on emerging UQ methods for ML models with a particular focus on neural networks and the applications of these UQ methods in tackling engineering design as well as prognostics and health management problems.

Decision Making Management +2

Paper
Code

Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation

2 code implementations • CVPR 2023 • Yunhao Bai, Duowen Chen, Qingli Li, Wei Shen, Yan Wang

In semi-supervised medical image segmentation, there exist empirical mismatch problems between labeled and unlabeled data distribution.

Ranked #2 on Semi-supervised Medical Image Segmentation on ACDC 5% labeled data

Image Segmentation Semi-supervised Medical Image Segmentation +1

1,988

Paper
Code

FVP: Fourier Visual Prompting for Source-Free Unsupervised Domain Adaptation of Medical Image Segmentation

no code implementations • 26 Apr 2023 • Yan Wang, Jian Cheng, Yixin Chen, Shuai Shao, Lanyun Zhu, Zhenzhou Wu, Tao Liu, Haogang Zhu

In FVP, the visual prompt is parameterized using only a small amount of low-frequency learnable parameters in the input frequency space, and is learned by minimizing the segmentation loss between the predicted segmentation of the prompted target image and reliable pseudo segmentation label of the target image under the frozen model.

Image Segmentation Medical Image Segmentation +4

Paper
Add Code

SAM Fails to Segment Anything? -- SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, Medical Image Segmentation, and More

1 code implementation • 18 Apr 2023 • Tianrun Chen, Lanyun Zhu, Chaotao Ding, Runlong Cao, Yan Wang, Zejian Li, Lingyun Sun, Papa Mao, Ying Zang

We can even outperform task-specific network models and achieve state-of-the-art performance in the task we tested: camouflaged object detection, shadow detection.

General Knowledge Image Segmentation +7

799

Paper
Code

Experience-Based Evolutionary Algorithms for Expensive Optimization

1 code implementation • 9 Apr 2023 • Xunzhao Yu, Yan Wang, Ling Zhu, Dimitar Filev, Xin Yao

Our experimental results on expensive multi-objective and constrained optimization problems demonstrate that experiences gained from related tasks are beneficial for the saving of evaluation budgets on the target problem.

Evolutionary Algorithms Meta-Learning

Paper
Code

Efficient Decision-based Black-box Patch Attacks on Video Recognition

no code implementations • ICCV 2023 • Kaixun Jiang, Zhaoyu Chen, Hao Huang, Jiafeng Wang, Dingkang Yang, Bo Li, Yan Wang, Wenqiang Zhang

First, STDE introduces target videos as patch textures and only adds patches on keyframes that are adaptively selected by temporal difference.

Video Recognition

Paper
Add Code

VIMI: Vehicle-Infrastructure Multi-view Intermediate Fusion for Camera-based 3D Object Detection

2 code implementations • 20 Mar 2023 • Zhe Wang, Siqi Fan, Xiaoliang Huo, Tongda Xu, Yan Wang, Jingjing Liu, Yilun Chen, Ya-Qin Zhang

In autonomous driving, Vehicle-Infrastructure Cooperative 3D Object Detection (VIC3D) makes use of multi-view cameras from both vehicles and traffic infrastructure, providing a global vantage point with rich semantic context of road conditions beyond a single vehicle viewpoint.

3D Object Detection Autonomous Driving +2

Paper
Code

SpiderMesh: Spatial-aware Demand-guided Recursive Meshing for RGB-T Semantic Segmentation

1 code implementation • 15 Mar 2023 • Siqi Fan, Zhe Wang, Yan Wang, Jingjing Liu

For semantic segmentation in urban scene understanding, RGB cameras alone often fail to capture a clear holistic topology in challenging lighting conditions.

Ranked #8 on Thermal Image Segmentation on PST900

Data Augmentation Segmentation +2

Paper
Code

Calibration-free BEV Representation for Infrastructure Perception

1 code implementation • 7 Mar 2023 • Siqi Fan, Zhe Wang, Xiaoliang Huo, Yan Wang, Jingjing Liu

Effective BEV object detection on infrastructure can greatly improve traffic scenes understanding and vehicle-toinfrastructure (V2I) cooperative perception.

Ranked #5 on 3D Object Detection on DAIR-V2X-I

3D Object Detection object-detection

Paper
Code

DCMT: A Direct Entire-Space Causal Multi-Task Framework for Post-Click Conversion Estimation

no code implementations • 13 Feb 2023 • Feng Zhu, Mingjie Zhong, Xinxing Yang, Longfei Li, Lu Yu, Tiehua Zhang, Jun Zhou, Chaochao Chen, Fei Wu, Guanfeng Liu, Yan Wang

In recommendation scenarios, there are two long-standing challenges, i. e., selection bias and data sparsity, which lead to a significant drop in prediction accuracy for both Click-Through Rate (CTR) and post-click Conversion Rate (CVR) tasks.

counterfactual Multi-Task Learning +1

Paper
Add Code

Generalized Video Anomaly Event Detection: Systematic Taxonomy and Comparison of Deep Models

1 code implementation • 10 Feb 2023 • Yang Liu, Dingkang Yang, Yan Wang, Jing Liu, Jun Liu, Azzedine Boukerche, Peng Sun, Liang Song

Video Anomaly Detection (VAD) serves as a pivotal technology in the intelligent surveillance systems, enabling the temporal or spatial identification of anomalous events within videos.

Anomaly Detection Event Detection +1

Paper
Code

IB-UQ: Information bottleneck based uncertainty quantification for neural function regression and neural operator learning

no code implementations • 7 Feb 2023 • Ling Guo, Hao Wu, Wenwen Zhou, Yan Wang, Tao Zhou

We propose a novel framework for uncertainty quantification via information bottleneck (IB-UQ) for scientific machine learning tasks, including deep neural network (DNN) regression and neural operator learning (DeepONet).

Data Augmentation Operator learning +2

Paper
Add Code

Exploring Invariant Representation for Visible-Infrared Person Re-Identification

no code implementations • 2 Feb 2023 • Lei Tan, Yukang Zhang, ShengMei Shen, Yan Wang, Pingyang Dai, Xianming Lin, Yongjian Wu, Rongrong Ji

Cross-spectral person re-identification, which aims to associate identities to pedestrians across different spectra, faces a main challenge of the modality discrepancy.

Data Augmentation Person Re-Identification

Paper
Add Code

A Counterfactual Collaborative Session-based Recommender System

1 code implementation • 31 Jan 2023 • Wenzhuo Song, Shoujin Wang, Yan Wang, Kunpeng Liu, Xueyan Liu, Minghao Yin

Next, COCO-SBRS adopts counterfactual inference to recommend items based on the outputs of the pre-trained recommendation model considering the causalities to alleviate the data sparsity problem.

counterfactual Counterfactual Inference +1

Paper
Code

Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso under Information Bottleneck Principle

1 code implementation • ICCV 2023 • Song Guo, Lei Zhang, Xiawu Zheng, Yan Wang, Yuchao Li, Fei Chao, Chenglin Wu, Shengchuan Zhang, Rongrong Ji

In this paper, we try to solve this problem by introducing a principled and unified framework based on Information Bottleneck (IB) theory, which further guides us to an automatic pruning approach.

Network Pruning

Paper
Code

Rethinking Safe Semi-supervised Learning: Transferring the Open-set Problem to A Close-set One

no code implementations • ICCV 2023 • Qiankun Ma, Jiyao Gao, Bo Zhan, Yunpeng Guo, Jiliu Zhou, Yan Wang

Conventional semi-supervised learning (SSL) lies in the close-set assumption that the labeled and unlabeled sets contain data with the same seen classes, called in-distribution (ID) data.

Paper
Add Code

Class Balanced Adaptive Pseudo Labeling for Federated Semi-Supervised Learning

no code implementations • CVPR 2023 • Ming Li, Qingli Li, Yan Wang

The second key element is that we design class balanced adaptive thresholds via considering the empirical distribution of all training data in local clients, to encourage a balanced training process.

Paper
Add Code

MagicNet: Semi-Supervised Multi-Organ Segmentation via Magic-Cube Partition and Recovery

1 code implementation • CVPR 2023 • Duowen Chen, Yunhao Bai, Wei Shen, Qingli Li, Lequan Yu, Yan Wang

Our strategy encourages unlabeled images to learn organ semantics in relative locations from the labeled images (cross-branch) and enhances the learning ability for small organs (within-branch).

Anatomy Data Augmentation +4

Paper
Code

GeneFormer: Learned Gene Compression using Transformer-based Context Modeling

no code implementations • 16 Dec 2022 • Zhanbei Cui, Yu Liao, Tongda Xu, Yan Wang

Then, we propose fixed-length parallel grouping to accelerate the decoding speed of our autoregressive model.

Data Compression

Paper
Add Code

SSDA3D: Semi-supervised Domain Adaptation for 3D Object Detection from Point Cloud

1 code implementation • 6 Dec 2022 • Yan Wang, Junbo Yin, Wei Li, Pascal Frossard, Ruigang Yang, Jianbing Shen

However, these UDA solutions just yield unsatisfactory 3D detection results when there is a severe domain shift, e. g., from Waymo (64-beam) to nuScenes (32-beam).

3D Object Detection Autonomous Driving +5

Paper
Code

A K-variate Time Series Is Worth K Words: Evolution of the Vanilla Transformer Architecture for Long-term Multivariate Time Series Forecasting

no code implementations • 6 Dec 2022 • Zanwei Zhou, RuiZhe Zhong, Chen Yang, Yan Wang, Xiaokang Yang, Wei Shen

In this study, we point out that the current tokenization strategy in MTSF Transformer architectures ignores the token uniformity inductive bias of Transformers.

Inductive Bias Multivariate Time Series Forecasting +1

Paper
Add Code

Generalizing Math Word Problem Solvers via Solution Diversification

1 code implementation • 1 Dec 2022 • Zhenwen Liang, Jipeng Zhang, Lei Wang, Yan Wang, Jie Shao, Xiangliang Zhang

In this paper, we design a new training framework for an MWP solver by introducing a solution buffer and a solution discriminator.

Math

Paper
Code

Meta Architecture for Point Cloud Analysis

1 code implementation • CVPR 2023 • Haojia Lin, Xiawu Zheng, Lijiang Li, Fei Chao, Shanshan Wang, Yan Wang, Yonghong Tian, Rongrong Ji

However, the lack of a unified framework to interpret those networks makes any systematic comparison, contrast, or analysis challenging, and practically limits healthy development of the field.

Ranked #2 on 3D Semantic Segmentation on OpenTrench3D

3D Semantic Segmentation

Paper
Code

ECM-OPCC: Efficient Context Model for Octree-based Point Cloud Compression

no code implementations • 20 Nov 2022 • Yiqi Jin, Ziyu Zhu, Tongda Xu, Yuhuan Lin, Yan Wang

For octree-based point cloud compression, previous works show that the information of ancestor nodes and sibling nodes are equally important for predicting current node.

Paper
Add Code

Large Language Models Meet Harry Potter: A Bilingual Dataset for Aligning Dialogue Agents with Characters

1 code implementation • 13 Nov 2022 • Nuo Chen, Yan Wang, Haiyun Jiang, Deng Cai, Yuhan Li, Ziyang Chen, Longyue Wang, Jia Li

In this paper, we introduce the Harry Potter Dialogue (HPD) dataset, designed to advance the study of dialogue agents and character alignment.

Ranked #1 on Persona Dialogue in Story on Harry Potter Dialogue Dataset

Dialogue Generation In-Context Learning +2

Paper
Code

FedVMR: A New Federated Learning method for Video Moment Retrieval

no code implementations • 28 Oct 2022 • Yan Wang, Xin Luo, Zhen-Duo Chen, Peng-Fei Zhang, Meng Liu, Xin-Shun Xu

As the first that is explored in VMR field, the new task is defined as video moment retrieval with distributed data.

Federated Learning Moment Retrieval +1

Paper
Add Code

DPCNet: Dual Path Multi-Excitation Collaborative Network for Facial Expression Representation Learning in Videos

no code implementations • MM '22: Proceedings of the 30th ACM International Conference on Multimedia 2022 • Yan Wang, Yixuan Sun, Wei Song, Shuyong Gao, Yiwen Huang, Zhaoyu Chen, Weifeng Ge, and Wenqiang Zhang

To obtain consistent prediction probabilities from the dual path, we further propose a dual path regularization loss, aiming to minimize the divergence between the distributions of two-path embeddings.

Ranked #13 on Dynamic Facial Expression Recognition on DFEW

Dynamic Facial Expression Recognition Representation Learning

Paper
Add Code

Real-Time Reinforcement Learning for Vision-Based Robotics Utilizing Local and Remote Computers

2 code implementations • 5 Oct 2022 • Yan Wang, Gautham Vasan, A. Rupam Mahmood

A common setup for a robotic agent is to have two different computers simultaneously: a resource-limited local computer tethered to the robot and a powerful remote computer connected wirelessly.

Reinforcement Learning (RL)

Paper
Code

Self-Supervised Monocular Depth Estimation: Solving the Edge-Fattening Problem

1 code implementation • 2 Oct 2022 • Xingyu Chen, Ruonan Zhang, Ji Jiang, Yan Wang, Ge Li, Thomas H. Li

In this paper, we redesign the patch-based triplet loss in MDE to alleviate the ubiquitous edge-fattening issue.

Ranked #1 on Unsupervised Monocular Depth Estimation on Kitti Raw

Depth Prediction Metric Learning +2

Paper
Code

Correcting the Sub-optimal Bit Allocation

no code implementations • 29 Sep 2022 • Tongda Xu, Han Gao, Yuanyuan Wang, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang

In this paper, we investigate the problem of bit allocation in Neural Video Compression (NVC).

Variational Inference Video Compression

Paper
Add Code

Spatial Moment Pooling Improves Neural Image Assessment

no code implementations • 29 Sep 2022 • Tongda Xu, Yifan Shao, Yan Wang, Hongwei Qin

In recent years, there has been widespread attention drawn to convolutional neural network (CNN) based blind image quality assessment (IQA).

Blind Image Quality Assessment

Paper
Add Code

Multi-Sample Training for Neural Image Compression

no code implementations • 28 Sep 2022 • Tongda Xu, Yan Wang, Dailan He, Chenjian Gao, Han Gao, Kunzan Liu, Hongwei Qin

This paper considers the problem of lossy neural image compression (NIC).

Image Compression Quantization

Paper
Add Code

Multi-scale Attention Network for Single Image Super-Resolution

1 code implementation • 28 Sep 2022 • Yan Wang, Yusen Li, Gang Wang, Xiaoguang Liu

ConvNets can compete with transformers in high-level tasks by exploiting larger receptive fields.

Blocking Image Super-Resolution +1

Paper
Code

HiFuse: Hierarchical Multi-Scale Feature Fusion Network for Medical Image Classification

1 code implementation • 21 Sep 2022 • Xiangzuo Huo, Gang Sun, Shengwei Tian, Yan Wang, Long Yu, Jun Long, Wendong Zhang, Aolun Li

A parallel hierarchy of local and global feature blocks is designed to efficiently extract local features and global representations at various semantic scales, with the flexibility to model at different scales and linear computational complexity relevant to image size.

Image Classification Inductive Bias +1

153

Paper
Code

Bit Allocation using Optimization

1 code implementation • 20 Sep 2022 • Tongda Xu, Han Gao, Chenjian Gao, Yuanyuan Wang, Dailan He, Jinyong Pi, Jixiang Luo, Ziyu Zhu, Mao Ye, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang

In this paper, we consider the problem of bit allocation in Neural Video Compression (NVC).

Variational Inference Video Compression

Paper
Code

Flexible Neural Image Compression via Code Editing

no code implementations • 19 Sep 2022 • Chenjian Gao, Tongda Xu, Dailan He, Hongwei Qin, Yan Wang

Neural image compression (NIC) has outperformed traditional image codecs in rate-distortion (R-D) performance.

Image Compression Quantization

Paper
Add Code

S$^3$R: Self-supervised Spectral Regression for Hyperspectral Histopathology Image Classification

no code implementations • 19 Sep 2022 • Xingran Xie, Yan Wang, Qingli Li

More concretely, we propose to learn a set of linear coefficients that can be used to represent one band by the remaining bands via masking out these bands.

Contrastive Learning Image Classification +1

Paper
Add Code

RIBAC: Towards Robust and Imperceptible Backdoor Attack against Compact DNN

1 code implementation • 22 Aug 2022 • Huy Phan, Cong Shi, Yi Xie, Tianfang Zhang, Zhuohang Li, Tianming Zhao, Jian Liu, Yan Wang, Yingying Chen, Bo Yuan

Recently backdoor attack has become an emerging threat to the security of deep neural network (DNN) models.

Backdoor Attack

Paper
Code

Trustworthy Recommender Systems

no code implementations • 10 Aug 2022 • Shoujin Wang, Xiuzhen Zhang, Yan Wang, Huan Liu, Francesco Ricci

However, researchers lack a systematic overview and discussion of the literature in this novel and fast developing field of TRSs.

Recommendation Systems

Paper
Add Code

Stochastic MPC with Dual Control for Autonomous Driving with Multi-Modal Interaction-Aware Predictions

no code implementations • 6 Aug 2022 • Siddharth H. Nair, Vijay Govindarajan, Theresa Lin, Yan Wang, Eric H. Tseng, Francesco Borrelli

The proposed approach is demonstrated on a longitudinal control example, with uncertainties in predictions of the autonomous and surrounding vehicles.

Autonomous Driving

Paper
Add Code

Effidit: Your AI Writing Assistant

no code implementations • 3 Aug 2022 • Shuming Shi, Enbo Zhao, Duyu Tang, Yan Wang, Piji Li, Wei Bi, Haiyun Jiang, Guoping Huang, Leyang Cui, Xinting Huang, Cong Zhou, Yong Dai, Dongyang Ma

In Effidit, we significantly expand the capacities of a writing assistant by providing functions in five categories: text completion, error checking, text polishing, keywords to sentences (K2S), and cloud input methods (cloud IME).

Keywords to Sentences Retrieval +3

Paper
Add Code

Ithaca365: Dataset and Driving Perception under Repeated and Challenging Weather Conditions

no code implementations • CVPR 2022 • Carlos A. Diaz-Ruiz, Youya Xia, Yurong You, Jose Nino, Junan Chen, Josephine Monica, Xiangyu Chen, Katie Luo, Yan Wang, Marc Emond, Wei-Lun Chao, Bharath Hariharan, Kilian Q. Weinberger, Mark Campbell

Advances in perception for self-driving cars have accelerated in recent years due to the availability of large-scale datasets, typically collected at specific locations and under nice weather conditions.

3D Object Detection Anomaly Detection +7

Paper
Add Code

Evaluating the Practicality of Learned Image Compression

no code implementations • 29 Jul 2022 • Hongjiu Yu, Qiancheng Sun, Jin Hu, Xingyuan Xue, Jixiang Luo, Dailan He, Yilong Li, Pengbo Wang, Yuanyuan Wang, Yaxu Dai, Yan Wang, Hongwei Qin

On CPU, the latency of our implementation is comparable with JPEG XL.

Image Compression MS-SSIM +3

Paper
Add Code

Beyond CNNs: Exploiting Further Inherent Symmetries in Medical Image Segmentation

no code implementations • 29 Jul 2022 • Shuchao Pang, Anan Du, Mehmet A. Orgun, Yan Wang, Quan Z. Sheng, Shoujin Wang, Xiaoshui Huang, Zhenmei Yu

Automatic tumor or lesion segmentation is a crucial step in medical image analysis for computer-aided diagnosis.

Image Segmentation Lesion Segmentation +3

Paper
Add Code

Weakly Supervised Video Salient Object Detection via Point Supervision

no code implementations • 15 Jul 2022 • Shuyong Gao, Haozhe Xing, Wei zhang, Yan Wang, Qianyu Guo, Wenqiang Zhang

Several works attempt to use scribble annotations to mitigate this problem, but point supervision as a more labor-saving annotation method (even the most labor-saving method among manual annotation methods for dense prediction), has not been explored.

Object object-detection +3

Paper
Add Code

Few-Shot Semantic Relation Prediction across Heterogeneous Graphs

no code implementations • 11 Jul 2022 • Pengfei Ding, Yan Wang, Guanfeng Liu, Xiaofang Zhou

In real-world scenarios, new semantic relations constantly emerge and they typically appear with only a few labeled data.

Meta-Learning Relation

Paper
Add Code

SHREC'22 Track: Sketch-Based 3D Shape Retrieval in the Wild

no code implementations • 11 Jul 2022 • Jie Qin, Shuaihang Yuan, Jiaxin Chen, Boulbaba Ben Amor, Yi Fang, Nhat Hoang-Xuan, Chi-Bien Chu, Khoi-Nguyen Nguyen-Ngoc, Thien-Tri Cao, Nhat-Khang Ngo, Tuan-Luc Huynh, Hai-Dang Nguyen, Minh-Triet Tran, Haoyang Luo, Jianning Wang, Zheng Zhang, Zihao Xin, Yang Wang, Feng Wang, Ying Tang, Haiqin Chen, Yan Wang, Qunying Zhou, Ji Zhang, Hongyuan Wang

We define two SBSR tasks and construct two benchmarks consisting of more than 46, 000 CAD models, 1, 700 realistic models, and 145, 000 sketches in total.

3D Object Retrieval 3D Shape Retrieval +1

Paper
Add Code

Multi-agent systems with CBF-based controllers -- collision avoidance and liveness from instability

no code implementations • 11 Jul 2022 • Mrdjan Jankovic, Mario Santillo, Yan Wang

Monte Carlo simulations show that decentralized, host-only control policies and CCS lack liveness while the PCCA policy performs as well as the Centralized.

Collision Avoidance

Paper
Add Code

Unsupervised Domain Adaptation through Shape Modeling for Medical Image Segmentation

1 code implementation • 6 Jul 2022 • Yuan YAO, Fengze Liu, Zongwei Zhou, Yan Wang, Wei Shen, Alan Yuille, Yongyi Lu

Previous methods proposed Variational Autoencoder (VAE) based models to learn the distribution of shape for a particular organ and used it to automatically evaluate the quality of a segmentation prediction by fitting it into the learned shape distribution.

Image Segmentation Pancreas Segmentation +3

Paper
Code

Neural Surface Reconstruction of Dynamic Scenes with Monocular RGB-D Camera

1 code implementation • 30 Jun 2022 • Hongrui Cai, Wanquan Feng, Xuetao Feng, Yan Wang, Juyong Zhang

We propose Neural-DynamicReconstruction (NDR), a template-free method to recover high-fidelity geometry and motions of a dynamic scene from a monocular RGB-D camera.

Dynamic Reconstruction Monocular Reconstruction +2

515

Paper
Code

Review Neural Networks about Image Transformation Based on IGC Learning Framework with Annotated Information

no code implementations • 21 Jun 2022 • Yuanjie Yan, Suorong Yang, Yan Wang, Jian Zhao, Furao Shen

From the perspective of this framework, we review those subtasks and give a unified interpretation of various scenarios.

Image-to-Image Translation Semantic Segmentation +2

Paper
Add Code

Automatic Prosody Annotation with Pre-Trained Text-Speech Model

1 code implementation • 16 Jun 2022 • Ziqian Dai, Jianwei Yu, Yan Wang, Nuo Chen, Yanyao Bian, Guangzhi Li, Deng Cai, Dong Yu

Prosodic boundary plays an important role in text-to-speech synthesis (TTS) in terms of naturalness and readability.

Speech Synthesis Text-To-Speech Synthesis

107

Paper
Code

PO-ELIC: Perception-Oriented Efficient Learned Image Coding

1 code implementation • 28 May 2022 • Dailan He, Ziming Yang, Hongjiu Yu, Tongda Xu, Jixiang Luo, Yuan Chen, Chenjian Gao, Xinjie Shi, Hongwei Qin, Yan Wang

In the past years, learned image compression (LIC) has achieved remarkable performance.

Image Compression MS-SSIM +1

Paper
Code

Sequential/Session-based Recommendations: Challenges, Approaches, Applications and Opportunities

no code implementations • 22 May 2022 • Shoujin Wang, Qi Zhang, Liang Hu, Xiuzhen Zhang, Yan Wang, Charu Aggarwal

In recent years, sequential recommender systems (SRSs) and session-based recommender systems (SBRSs) have emerged as a new paradigm of RSs to capture users' short-term but dynamic preferences for enabling more timely and accurate recommendations.

Session-Based Recommendations

Paper
Add Code

A Correlation Information-based Spatiotemporal Network for Traffic Flow Forecasting

2 code implementations • 20 May 2022 • Weiguo Zhu, Yongqi Sun, Xintong Yi, Yan Wang

In this paper, based on the maximal information coefficient, we present two elaborate spatiotemporal representations, spatial correlation information (SCorr) and temporal correlation information (TCorr).

Ranked #1 on Traffic Prediction on HZME(outflow)

Traffic Prediction

Paper
Code

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

2 code implementations • 11 May 2022 • Yawei Li, Kai Zhang, Radu Timofte, Luc van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu1, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoğlu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gaoand Dengwen Zhouand Qian Ning, Jingzhu Tang, Han Huang, YuFei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Dengwen Zhou, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang

The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29. 00dB on DIV2K validation set.

Image Super-Resolution

117

Paper
Code

Effectively Using Long and Short Sessions for Multi-Session-based Recommendations

no code implementations • 9 May 2022 • Zihan Wang, Gang Wu, Yan Wang

The RNN often used in previous work is not suitable to process short sessions, because RNN only focuses on the sequential relationship, which we find is not the only relationship between items in short sessions.

Session-Based Recommendations

Paper
Add Code

Language Models Can See: Plugging Visual Controls in Text Generation

1 code implementation • 5 May 2022 • Yixuan Su, Tian Lan, Yahui Liu, Fangyu Liu, Dani Yogatama, Yan Wang, Lingpeng Kong, Nigel Collier

MAGIC is a flexible framework and is theoretically compatible with any text generation tasks that incorporate image grounding.

Image Captioning Image-text matching +3

250

Paper
Code

Edge-enhanced Feature Distillation Network for Efficient Super-Resolution

1 code implementation • 19 Apr 2022 • Yan Wang

With the recently massive development in convolution neural networks, numerous lightweight CNN-based image super-resolution methods have been proposed for practical deployments on edge devices.

Image Super-Resolution

Paper
Code

Towards Lightweight Transformer via Group-wise Transformation for Vision-and-Language Tasks

1 code implementation • 16 Apr 2022 • Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Yan Wang, Liujuan Cao, Yongjian Wu, Feiyue Huang, Rongrong Ji

Despite the exciting performance, Transformer is criticized for its excessive parameters and computation cost.

Image Classification

Paper
Code

Practical Learned Lossless JPEG Recompression with Multi-Level Cross-Channel Entropy Model in the DCT Domain

no code implementations • CVPR 2022 • Lina Guo, Xinjie Shi, Dailan He, Yuanyuan Wang, Rui Ma, Hongwei Qin, Yan Wang

JPEG is a popular image compression method widely used by individuals, data center, cloud storage and network filesystems.

Image Compression

Paper
Add Code

Adaptive Frequency Learning in Two-branch Face Forgery Detection

no code implementations • 27 Mar 2022 • Neng Wang, Yang Bai, Kun Yu, Yong Jiang, Shu-Tao Xia, Yan Wang

Face forgery has attracted increasing attention in recent applications of computer vision.

Vocal Bursts Valence Prediction

Paper
Add Code

Weakly-Supervised Salient Object Detection Using Point Supervision

1 code implementation • 22 Mar 2022 • Shuyong Gao, Wei zhang, Yan Wang, Qianyu Guo, Chenglong Zhang, Yangji He, Wenqiang Zhang

Then we develop a transformer-based point-supervised saliency detection model to produce the first round of saliency maps.

Object object-detection +3

Paper
Code

ELIC: Efficient Learned Image Compression with Unevenly Grouped Space-Channel Contextual Adaptive Coding

5 code implementations • CVPR 2022 • Dailan He, Ziming Yang, Weikun Peng, Rui Ma, Hongwei Qin, Yan Wang

Recently, learned image compression techniques have achieved remarkable performance, even surpassing the best manually designed lossy image coders.

Ranked #1 on Image Compression on kodak

Image Compression

Paper
Code

ContrastMask: Contrastive Learning to Segment Every Thing

1 code implementation • CVPR 2022 • Xuehui Wang, Kai Zhao, Ruixin Zhang, Shouhong Ding, Yan Wang, Wei Shen

In this framework, annotated masks of seen categories and pseudo masks of unseen categories serve as a prior for contrastive learning, where features from the mask regions (foreground) are pulled together, and are contrasted against those from the background, and vice versa.

Instance Segmentation Segmentation +1

Paper
Code

FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos

no code implementations • CVPR 2022 • Yan Wang, Yixuan Sun, Yiwen Huang, Zhongying Liu, Shuyong Gao, Wei zhang, Weifeng Ge, Wenqiang Zhang

Current benchmarks for facial expression recognition (FER) mainly focus on static images, while there are limited datasets for FER in videos.

4k Facial Expression Recognition +1

Paper
Add Code

Multi-View Fusion Transformer for Sensor-Based Human Activity Recognition

no code implementations • 16 Feb 2022 • Yimu Wang, Kun Yu, Yan Wang, Hui Xue

In this paper, to extract a better feature for advancing the performance, we propose a novel method, namely multi-view fusion transformer (MVFT) along with a novel attention mechanism.

Human Activity Recognition Time Series +1

Paper
Add Code

Post-Training Quantization for Cross-Platform Learned Image Compression

no code implementations • 15 Feb 2022 • Dailan He, Ziming Yang, Yuan Chen, Qi Zhang, Hongwei Qin, Yan Wang

It has been witnessed that learned image compression has outperformed conventional image coding techniques and tends to be practical in industrial applications.

Image Compression Quantization

Paper
Add Code

A Contrastive Framework for Neural Text Generation

2 code implementations • 13 Feb 2022 • Yixuan Su, Tian Lan, Yan Wang, Dani Yogatama, Lingpeng Kong, Nigel Collier

Text generation is of great importance to many natural language processing applications.

Text Generation

444

Paper
Code

A Survey on Retrieval-Augmented Text Generation

no code implementations • 2 Feb 2022 • Huayang Li, Yixuan Su, Deng Cai, Yan Wang, Lemao Liu

Recently, retrieval-augmented text generation attracted increasing attention of the computational linguistics community.

Machine Translation Response Generation +3

Paper
Add Code

Approximation of Images via Generalized Higher Order Singular Value Decomposition over Finite-dimensional Commutative Semisimple Algebra

1 code implementation • 1 Feb 2022 • Liang Liao, Sen Lin, Lun Li, Xiuwei Zhang, Song Zhao, Yan Wang, Xinqiang Wang, Qi Gao, Jingyu Wang

Higher order singular value decomposition (HOSVD) extends the SVD and can approximate higher order data using sums of a few rank-one components.

Paper
Code

Protum: A New Method For Prompt Tuning Based on "[MASK]"

no code implementations • 28 Jan 2022 • Pan He, Yuxi Chen, Yan Wang, Yanru Zhang

In response to the above issue, we propose a new \textbf{Pro}mpt \textbf{Tu}ning based on "[\textbf{M}ASK]" (\textbf{Protum}) method in this paper, which constructs a classification task through the information carried by the hidden layer of "[MASK]" tokens and then predicts the labels directly rather than the answer tokens.

Language Modelling

Paper
Add Code

External Attention Assisted Multi-Phase Splenic Vascular Injury Segmentation with Limited Data

no code implementations • 4 Jan 2022 • Yuyin Zhou, David Dreizin, Yan Wang, Fengze Liu, Wei Shen, Alan L. Yuille

The spleen is one of the most commonly injured solid organs in blunt abdominal trauma.

Segmentation

Paper
Add Code

Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation

no code implementations • 24 Dec 2021 • Zhiwei Liu, Xiangyu Zhu, Lu Yang, Xiang Yan, Ming Tang, Zhen Lei, Guibo Zhu, Xuetao Feng, Yan Wang, Jinqiao Wang

In the second stage, we design a mesh refinement transformer (MRT) to respectively refine each coarse reconstruction result via a self-attention mechanism.

Ranked #64 on 3D Human Pose Estimation on 3DPW (MPJPE metric)

3D human pose and shape estimation 3D Reconstruction

Paper
Add Code

Intriguing Findings of Frequency Selection for Image Deblurring

2 code implementations • 23 Nov 2021 • Xintian Mao, Yiming Liu, Fengze Liu, Qingli Li, Wei Shen, Yan Wang

Blur was naturally analyzed in the frequency domain, by estimating the latent sharp image and the blur kernel given a blurry image.

Ranked #2 on Deblurring on RealBlur-R

Deblurring Image Deblurring +1

211

Paper
Code

Training Neural Networks for Solving 1-D Optimal Piecewise Linear Approximation

no code implementations • 14 Oct 2021 • Hangcheng Dong, Jingxiao Liao, Yan Wang, Yixin Chen, Bingguo Liu, Dong Ye, Guodong Liu

Our main contributions are that we propose the theorems to characterize the optimal solution of the PWLA problem and present the LNN method for solving it.

Paper
Add Code

Exploring Dense Retrieval for Dialogue Response Selection

1 code implementation • 13 Oct 2021 • Tian Lan, Deng Cai, Yan Wang, Yixuan Su, Heyan Huang, Xian-Ling Mao

In this study, we present a solution to directly select proper responses from a large corpus or even a nonparallel corpus that only consists of unpaired sentences, using a dense retrieval model.

Conversational Response Selection Retrieval

Paper
Code

WDCCNet: Weighted Double-Classifier Constraint Neural Network for Mammographic Image Classification

no code implementations • IEEE Transactions on Medical Imaging 2021 • Yan Wang, Zizhou Wang, Yangqin Feng, Lei Zhang

Our proposed method can be easily applied to an existing convolutional neural network to improve mammographic image classification performance.

Ranked #1 on Suspicous (BIRADS 4,5)-no suspicous (BIRADS 1,2,3) per image classification on InBreast

Image Classification Suspicous (BIRADS 4,5)-no suspicous (BIRADS 1,2,3) per image classification

Paper
Add Code

HLIC: Harmonizing Optimization Metrics in Learned Image Compression by Reinforcement Learning

no code implementations • 30 Sep 2021 • Baocheng Sun, Meng Gu, Dailan He, Tongda Xu, Yan Wang, Hongwei Qin

Learned image compression is making good progress in recent years.

Image Compression MS-SSIM +3

Paper
Add Code

Post-Training Quantization Is All You Need to Perform Cross-Platform Learned Image Compression

no code implementations • 29 Sep 2021 • Dailan He, Ziming Yang, Yan Wang, Yuan Chen, Qi Zhang, Hongwei Qin

It has been witnessed that learned image compression has outperformed conventional image coding techniques and tends to be practical in industrial applications.

Image Compression Quantization

Paper
Add Code

Fixed Neural Network Steganography: Train the images, not the network

1 code implementation • ICLR 2022 • Varsha Kishore, Xiangyu Chen, Yan Wang, Boyi Li, Kilian Q Weinberger

Recent attempts at image steganography make use of advances in deep learning to train an encoder-decoder network pair to hide and retrieve secret messages in images.

Image Steganography Steganalysis

Paper
Code

Transductive Learning for Unsupervised Text Style Transfer

1 code implementation • EMNLP 2021 • Fei Xiao, Liang Pang, Yanyan Lan, Yan Wang, HuaWei Shen, Xueqi Cheng

The proposed transductive learning approach is general and effective to the task of unsupervised style transfer, and we will apply it to the other two typical methods in the future.

Retrieval Style Transfer +3

Paper
Code

MHFC: Multi-Head Feature Collaboration for Few-Shot Learning

no code implementations • 16 Sep 2021 • Shuai Shao, Lei Xing, Yan Wang, Rui Xu, Chunyan Zhao, Yan-Jiang Wang, Bao-Di Liu

Apply the trained FEM to acquire the novel data's features and recognize them.

Few-Shot Learning

Paper
Add Code

OMPQ: Orthogonal Mixed Precision Quantization

1 code implementation • 16 Sep 2021 • Yuexiao Ma, Taisong Jin, Xiawu Zheng, Yan Wang, Huixia Li, Yongjian Wu, Guannan Jiang, Wei zhang, Rongrong Ji

Instead of solving a problem of the original integer programming, we propose to optimize a proxy metric, the concept of network orthogonality, which is highly correlated with the loss of the integer programming but also easy to optimize with linear programming.

AutoML Quantization

Paper
Code

Stylistic Retrieval-based Dialogue System with Unparallel Training Data

no code implementations • 12 Sep 2021 • Hao Fu, Yan Wang, Ruihua Song, Tianran Hu, Jianyun Nie

The ability of a dialog system to express consistent language style during conversations has a direct, positive impact on its usability and on user satisfaction.

Chatbot Data Augmentation +2

Paper
Add Code

RobustART: Benchmarking Robustness on Architecture Design and Training Techniques

1 code implementation • 11 Sep 2021 • Shiyu Tang, Ruihao Gong, Yan Wang, Aishan Liu, Jiakai Wang, Xinyun Chen, Fengwei Yu, Xianglong Liu, Dawn Song, Alan Yuille, Philip H. S. Torr, DaCheng Tao

Thus, we propose RobustART, the first comprehensive Robustness investigation benchmark on ImageNet regarding ARchitecture design (49 human-designed off-the-shelf architectures and 1200+ networks from neural architecture search) and Training techniques (10+ techniques, e. g., data augmentation) towards diverse noises (adversarial, natural, and system noises).

Adversarial Robustness Benchmarking +2

143

Paper
Code

MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers

1 code implementation • 2 Sep 2021 • Yihuai Lan, Lei Wang, Qiyuan Zhang, Yunshi Lan, Bing Tian Dai, Yan Wang, Dongxiang Zhang, Ee-Peng Lim

Over the last few years, there are a growing number of datasets and deep learning-based methods proposed for effectively solving MWPs.

Ranked #8 on Math Word Problem Solving on Math23K

Math Math Word Problem Solving

155

Paper
Code

Real World Robustness from Systematic Noise

no code implementations • 2 Sep 2021 • Yan Wang, Yuhang Li, Ruihao Gong

Systematic error, which is not determined by chance, often refers to the inaccuracy (involving either the observation or measurement process) inherent to a system.

Paper
Add Code

ASR-GLUE: A New Multi-task Benchmark for ASR-Robust Natural Language Understanding

no code implementations • 30 Aug 2021 • Lingyun Feng, Jianwei Yu, Deng Cai, Songxiang Liu, Haitao Zheng, Yan Wang

%To facilitate the research on ASR-robust general language understanding, In this paper, we propose ASR-GLUE benchmark, a new collection of 6 different NLU tasks for evaluating the performance of models under ASR error across 3 different levels of background noise and 6 speakers with various voice characteristics.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

A Unified Framework for Cross-Domain and Cross-System Recommendations

no code implementations • 18 Aug 2021 • Feng Zhu, Yan Wang, Jun Zhou, Chaochao Chen, Longfei Li, Guanfeng Liu

Moreover, to avoid negative transfer, we further propose a Personalized training strategy to minimize the embedding difference of common entities between a richer dataset and a sparser dataset, deriving three new models, i. e., GA-DTCDR-P, GA-MTCDR-P, and GA-CDR+CSR-P, for the three scenarios respectively.

Graph Embedding

Paper
Add Code

Next-item Recommendations in Short Sessions

no code implementations • 15 Jul 2021 • Wenzhuo Song, Shoujin Wang, Yan Wang, Shengsheng Wang

The obtained similar sessions are then utilized to complement and optimize the preference representation learned from the current short session by the local module for more accurate next-item recommendations in this short session.

Few-Shot Learning Recommendation Systems +1

Paper
Add Code

BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data

1 code implementation • ACL 2021 • Haoyu Song, Yan Wang, Kaiyan Zhang, Wei-Nan Zhang, Ting Liu

Maintaining consistent personas is essential for dialogue agents.

Dialogue Generation Response Generation

134

Paper
Code

The Medical Segmentation Decathlon

1 code implementation • 10 Jun 2021 • Michela Antonelli, Annika Reinke, Spyridon Bakas, Keyvan Farahani, AnnetteKopp-Schneider, Bennett A. Landman, Geert Litjens, Bjoern Menze, Olaf Ronneberger, Ronald M. Summers, Bram van Ginneken, Michel Bilello, Patrick Bilic, Patrick F. Christ, Richard K. G. Do, Marc J. Gollub, Stephan H. Heckers, William R. Jarnagin, Maureen K. McHugo, Sandy Napel, Jennifer S. Goli Pernicka, Kawal Rhode, Catalina Tobon-Gomez, Eugene Vorontsov, Henkjan Huisman, James A. Meakin, Sebastien Ourselin, Manuel Wiesenfarth, Pablo Arbelaez, Byeonguk Bae, Sihong Chen, Laura Daza, Jianjiang Feng, Baochun He, Fabian Isensee, Yuanfeng Ji, Fucang Jia, Namkug Kim, Ildoo Kim, Dorit Merhof, Akshay Pai, Beomhee Park, Mathias Perslev, Ramin Rezaiifar, Oliver Rippel, Ignacio Sarasua, Wei Shen, Jaemin Son, Christian Wachinger, Liansheng Wang, Yan Wang, Yingda Xia, Daguang Xu, Zhanwei Xu, Yefeng Zheng, Amber L. Simpson, Lena Maier-Hein, M. Jorge Cardoso

Segmentation is so far the most widely investigated medical image processing task, but the various segmentation challenges have typically been organized in isolation, such that algorithm development was driven by the need to tackle a single specific clinical problem.

Image Segmentation Segmentation +1

Paper
Code

You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Nature Gradient

1 code implementation • 4 Jun 2021 • Shaokun Zhang, Xiawu Zheng, Chenyi Yang, Yuchao Li, Yan Wang, Fei Chao, Mengdi Wang, Shen Li, Jun Yang, Rongrong Ji

Motivated by the necessity of efficient inference across various constraints on BERT, we propose a novel approach, YOCO-BERT, to achieve compress once and deploy everywhere.

AutoML Model Compression

Paper
Code

Learning Inductive Attention Guidance for Partially Supervised Pancreatic Ductal Adenocarcinoma Prediction

no code implementations • 31 May 2021 • Yan Wang, Peng Tang, Yuyin Zhou, Wei Shen, Elliot K. Fishman, Alan L. Yuille

We instantiate both the global and the local classifiers by multiple instance learning (MIL), where the attention guidance, indicating roughly where the PDAC regions are, is the key to bridging them: For global MIL based normal/PDAC classification, attention serves as a weight for each instance (voxel) during MIL pooling, which eliminates the distraction from the background; For local MIL based semi-supervised PDAC segmentation, the attention guidance is inductive, which not only provides bag-level pseudo-labels to training data without per-voxel annotations for MIL training, but also acts as a proxy of an instance-level classifier.

Multiple Instance Learning Segmentation

Paper
Add Code

Neural Machine Translation with Monolingual Translation Memory

1 code implementation • ACL 2021 • Deng Cai, Yan Wang, Huayang Li, Wai Lam, Lemao Liu

Second, the memory retriever and NMT model can be jointly optimized for the ultimate translation goal.

Domain Adaptation Machine Translation +3

Paper
Code

Optimal Estimator Design and Properties Analysis for Interconnected Systems with Asymmetric Information Structure

no code implementations • 21 May 2021 • Yan Wang, Junlin Xiong, Zaiyue Yang, Rong Su

We found that there exists a critical probability such that the EEC is bounded if the delay probability is below the critical probability.

Paper
Add Code

Graph Learning based Recommender Systems: A Review

1 code implementation • 13 May 2021 • Shoujin Wang, Liang Hu, Yan Wang, Xiangnan He, Quan Z. Sheng, Mehmet A. Orgun, Longbing Cao, Francesco Ricci, Philip S. Yu

Recent years have witnessed the fast development of the emerging topic of Graph Learning based Recommender Systems (GLRS).

Collaborative Filtering Graph Learning +1

Paper
Code

Deep Learning based Multi-modal Computing with Feature Disentanglement for MRI Image Synthesis

no code implementations • 6 May 2021 • Yuchen Fei, Bo Zhan, Mei Hong, Xi Wu, Jiliu Zhou, Yan Wang

To take full advantage of the complementary information provided by different modalities, multi-modal MRI sequences are utilized as input.

Disentanglement Image Generation

Paper
Add Code

Black-Box Dissector: Towards Erasing-based Hard-Label Model Stealing Attack

no code implementations • NeurIPS 2021 • Yixu Wang, Jie Li, Hong Liu, Yan Wang, Yongjian Wu, Feiyue Huang, Rongrong Ji

We argue this is due to the lack of rich information in the probability prediction and the overfitting caused by hard labels.

Self-Knowledge Distillation

Paper
Add Code

ISTR: End-to-End Instance Segmentation with Transformers

1 code implementation • 3 May 2021 • Jie Hu, Liujuan Cao, Yao Lu, Shengchuan Zhang, Yan Wang, Ke Li, Feiyue Huang, Ling Shao, Rongrong Ji

However, such an upgrade is not applicable to instance segmentation, due to its significantly higher output dimensions compared to object detection.

Ranked #21 on Instance Segmentation on COCO test-dev

Instance Segmentation object-detection +3

200

Paper
Code

Distributed Eco-Driving Algorithm of Vehicle Platoon Using Traffic Light and Road Slope Information

no code implementations • 26 Apr 2021 • Yan Wang, Rong Su, Wei Wang, Xiaoxu Liu, Bohui Wang

This paper investigates the problem of ecological driving (eco-driving) of vehicle platoons.

Paper
Add Code

Sketch and Customize: A Counterfactual Story Generator

1 code implementation • 2 Apr 2021 • Changying Hao, Liang Pang, Yanyan Lan, Yan Wang, Jiafeng Guo, Xueqi Cheng

In the sketch stage, a skeleton is extracted by removing words which are conflict to the counterfactual condition, from the original ending.

counterfactual Text Generation

Paper
Code

Checkerboard Context Model for Efficient Learned Image Compression

3 code implementations • CVPR 2021 • Dailan He, Yaoyan Zheng, Baocheng Sun, Yan Wang, Hongwei Qin

To the best of our knowledge, this is the first exploration on parallelization-friendly spatial context model for learned image compression.

Computational Efficiency Image Compression

Paper
Code

Exploiting Playbacks in Unsupervised Domain Adaptation for 3D Object Detection

no code implementations • 26 Mar 2021 • Yurong You, Carlos Andres Diaz-Ruiz, Yan Wang, Wei-Lun Chao, Bharath Hariharan, Mark Campbell, Kilian Q Weinberger

Self-driving cars must detect other vehicles and pedestrians in 3D to plan safe routes and avoid collisions.

3D Object Detection Autonomous Driving +3

Paper
Add Code

Distilling a Powerful Student Model via Online Knowledge Distillation

1 code implementation • 26 Mar 2021 • Shaojie Li, Mingbao Lin, Yan Wang, Yongjian Wu, Yonghong Tian, Ling Shao, Rongrong Ji

Besides, a self-distillation module is adopted to convert the feature map of deeper layers into a shallower one.

Knowledge Distillation

Paper
Code

SpecTr: Spectral Transformer for Hyperspectral Pathology Image Segmentation

1 code implementation • 5 Mar 2021 • Boxiang Yun, Yan Wang, Jieneng Chen, Huiyu Wang, Wei Shen, Qingli Li

Hyperspectral imaging (HSI) unlocks the huge potential to a wide variety of applications relied on high-precision pathology image segmentation, such as computational pathology and precision medicine.

Image Segmentation Segmentation +1

Paper
Code

Cross-Domain Recommendation: Challenges, Progress, and Prospects

no code implementations • 2 Mar 2021 • Feng Zhu, Yan Wang, Chaochao Chen, Jun Zhou, Longfei Li, Guanfeng Liu

To address the long-standing data sparsity problem in recommender systems (RSs), cross-domain recommendation (CDR) has been proposed to leverage the relatively richer information from a richer domain to improve the recommendation performance in a sparser domain.

Recommendation Systems

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.