no code implementations • ECCV 2020 • Jingwei Xin, Nannan Wang, Xinrui Jiang, Jie Li, Heng Huang, Xinbo Gao
Lighter model and faster inference are the focus of current single image super-resolution (SISR) research.
no code implementations • 15 Apr 2025 • Hongbo Li, Shangchao Yang, Ruiyang Xia, Lin Yuan, Xinbo Gao
As deepfake technologies continue to advance, passive detection methods struggle to generalize with various forgery manipulations and datasets.
no code implementations • 13 Apr 2025 • Jiahua Xu, Dawei Zhou, Lei Hu, Zaiyi Liu, Nannan Wang, Xinbo Gao
Multimodal medical images play a crucial role in the precise and comprehensive clinical diagnosis.
no code implementations • 11 Apr 2025 • Dawei Zhou, Suzhi Gang, Decheng Liu, Tongliang Liu, Nannan Wang, Xinbo Gao
The generated adversarial noise can actively interfere with the malicious manipulation model by triggering knowledge-guided and perception-related disruptions in the fake samples.
no code implementations • 28 Mar 2025 • Yang Liu, Feixiang Liu, Jiale Du, Xinbo Gao, Jungong Han
Our UMMEC method significantly improves classification performance with minimal labeled data, advancing the state-of-the-art in TFSL.
no code implementations • 28 Mar 2025 • Yang Liu, Xun Zhang, Jiale Du, Xinbo Gao, Jungong Han
Zero-shot Learning(ZSL) attains knowledge transfer from seen classes to unseen classes by exploring auxiliary category information, which is a promising yet difficult research topic.
1 code implementation • 10 Mar 2025 • Junyan Lin, Feng Gap, Lin Qi, Junyu Dong, Qian Du, Xinbo Gao
To address these limitations, we propose a novel Dynamic Cross-Modal Feature Interaction Network (DCMNet), the first framework leveraging a dynamic routing mechanism for HSI and LiDAR classification.
1 code implementation • 30 Jan 2025 • Shuyin Xia, Xiaoyu Lian, Binbin Sang, Guoyin Wang, Xinbo Gao
Fuzzy rough set theory is effective for processing datasets with complex attributes, supported by a solid mathematical foundation and closely linked to kernel methods in machine learning.
no code implementations • 21 Jan 2025 • Bo Hu, Wei Wang, Chunyi Li, Lihuo He, Leida Li, Xinbo Gao
Wide-angle video is favored for its wide viewing angle and ability to capture a large area of scenery, making it an ideal choice for sports and adventure recording.
1 code implementation • 17 Jan 2025 • Xi Yang, Haoyuan Shi, Zihan Wang, Nannan Wang, Xinbo Gao
To address this, we propose the CNN-Swin Hybrid Network (CSHNet), which combines two key modules: Swin Embedded CNN (SEC) and CNN Embedded Swin (CES), forming the SEC-CES-Bottleneck (SCB).
no code implementations • 8 Jan 2025 • Lin Yuan, Kai Liang, Xiong Li, Tao Wu, Nannan Wang, Xinbo Gao
However, many still face limitations in visual quality and often overlook the potential to recover the original face from the anonymized version, which can be valuable in specific contexts such as image forensics.
1 code implementation • 13 Dec 2024 • Kaifan Zhang, Lihuo He, Xin Jiang, Wen Lu, Di Wang, Xinbo Gao
This results in the loss of critical multimodal information in EEG.
1 code implementation • 10 Dec 2024 • Jiahua Xu, Dawei Zhou, Lei Hu, Jianfeng Guo, Feng Yang, Zaiyi Liu, Nannan Wang, Xinbo Gao
Motion artifacts present in magnetic resonance imaging (MRI) can seriously interfere with clinical diagnosis.
1 code implementation • 7 Dec 2024 • Yan Zhang, Pengcheng Zheng, Chengxiao Zeng, Bin Xiao, Zhenghao Li, Xinbo Gao
In the deblurring branch, we design a pixel-adjustable kernel block (PAKB) to estimate the local and spatial-varying blur kernels.
no code implementations • 25 Nov 2024 • Jili Xia, Lihuo He, Fei Gao, Kaifan Zhang, Leida Li, Xinbo Gao
Recently, AI-generated images (AIGIs) created by given prompts (initial prompts) have garnered widespread attention.
no code implementations • 23 Nov 2024 • De Cheng, Yue Lu, Lingfeng He, Shizhou Zhang, Xi Yang, Nannan Wang, Xinbo Gao
Continual Learning (CL) aims to equip AI models with the ability to learn a sequence of tasks over time, without forgetting previously learned knowledge.
1 code implementation • 6 Nov 2024 • Xi Yang, Xu Gu, Xingyilang Yin, Xinbo Gao
Thus, we present ScanNetV2-INS with complete ground truth labels and supplement additional instances for 3D class-agnostic instance segmentation.
1 code implementation • 31 Oct 2024 • Ke Li, Fuyu Dong, Di Wang, Shaofeng Li, Quan Wang, Xinbo Gao, Tat-Seng Chua
Furthermore, we present VisTA, a simple yet effective baseline method that unifies the tasks of question answering and grounding by delivering both visual and textual answers.
no code implementations • 17 Oct 2024 • Shuyin Xia, Bolun Shi, Yifan Wang, Jiang Xie, Guoyin Wang, Xinbo Gao
Traditional clustering algorithms often focus on the most fine-grained information and achieve clustering by calculating the distance between each pair of data points or implementing other calculations based on points.
1 code implementation • 29 Sep 2024 • Kun Cheng, Lei Yu, Zhijun Tu, Xiao He, Liyu Chen, Yong Guo, Mingrui Zhu, Nannan Wang, Xinbo Gao, Jie Hu
In this work, we design an effective diffusion transformer for image super-resolution (DiT-SR) that achieves the visual quality of prior-based methods, but through a training-from-scratch manner.
no code implementations • 28 Sep 2024 • Jiaxu Leng, Zhanjie Wu, Mingpi Tan, Yiran Liu, Ji Gan, Haosheng Chen, Xinbo Gao
While numerous Video Violence Detection (VVD) methods have focused on representation learning in Euclidean space, they struggle to learn sufficiently discriminative features, leading to weaknesses in recognizing normal events that are visually similar to violent events (\emph{i. e.}, ambiguous violence).
no code implementations • 14 Sep 2024 • Guang Yang, Jie Li, Xin Liu, Zhusi Zhong, Xinbo Gao
Existing methods take pixel intensity, texture and high-level vision task information as the standards to determine preservation of information, lacking enhancement for human perception.
no code implementations • 13 Sep 2024 • Hangyu Li, Yihan Xu, Jiangchao Yao, Nannan Wang, Xinbo Gao, Bo Han
Then, we transform the facial expression representation to a neutral representation by simulating the difference in text embeddings from textual facial expression to textual neutral.
Facial Expression Recognition
Facial Expression Recognition (FER)
1 code implementation • 7 Sep 2024 • Mingjin Zhang, Chi Zhang, Qiming Zhang, Yunsong Li, Xinbo Gao, Jing Zhang
Recent advancements in deep learning have greatly advanced the field of infrared small object detection (IRSTD).
no code implementations • 4 Sep 2024 • Yilong Chen, Zongyi Xu, Xiaoshui Huang, Shanshan Zhao, Xinqi Jiang, Xinyu Gao, Xinbo Gao
Compared to single-modal knowledge distillation, cross-modal knowledge distillation faces more severe challenges due to domain gaps between modalities.
no code implementations • 26 Aug 2024 • Chaohua Shi, Xuan Wang, Si Shi, Xule Wang, Mingrui Zhu, Nannan Wang, Xinbo Gao
However, existing diffusion models face challenges in processing and fusing information from multiple images and lack access to high-quality publicly available datasets, which prevents the application of diffusion models in food image composition.
no code implementations • 20 Aug 2024 • Huafeng Qin, Yuming Fu, Huiyan Zhang, Mounim A. El-Yacoubi, Xinbo Gao, Qun Song, Jun Wang
At the testing stage, given an adversarial sample, the MsMemoryGAN retrieves its most relevant normal patterns in memory for the reconstruction.
1 code implementation • 14 Aug 2024 • Xiao He, Huaao Tang, Zhijun Tu, Junchao Zhang, Kun Cheng, Hanting Chen, Yong Guo, Mingrui Zhu, Nannan Wang, Xinbo Gao, Jie Hu
Specifically, we introduce a novel score distillation strategy to align the data distribution between the outputs of the student and teacher models after minor noise perturbation.
no code implementations • 11 Aug 2024 • Huafeng Qin, Yuming Fu, Jing Chen, Mounim A. El-Yacoubi, Xinbo Gao, Feng Xi
In this paper, first, we propose a hybrid network structure named Global-local Vision Mamba (GLVM), to learn the local correlations in images explicitly and global dependencies among tokens for vein feature representation.
2 code implementations • 31 Jul 2024 • Aaron Grattafiori, Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Alex Vaughan, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang, Bobbie Chern, Charlotte Caucheteux, Chaya Nayak, Chloe Bi, Chris Marra, Chris McConnell, Christian Keller, Christophe Touret, Chunyang Wu, Corinne Wong, Cristian Canton Ferrer, Cyrus Nikolaidis, Damien Allonsius, Daniel Song, Danielle Pintz, Danny Livshits, Danny Wyatt, David Esiobu, Dhruv Choudhary, Dhruv Mahajan, Diego Garcia-Olano, Diego Perino, Dieuwke Hupkes, Egor Lakomkin, Ehab AlBadawy, Elina Lobanova, Emily Dinan, Eric Michael Smith, Filip Radenovic, Francisco Guzmán, Frank Zhang, Gabriel Synnaeve, Gabrielle Lee, Georgia Lewis Anderson, Govind Thattai, Graeme Nail, Gregoire Mialon, Guan Pang, Guillem Cucurell, Hailey Nguyen, Hannah Korevaar, Hu Xu, Hugo Touvron, Iliyan Zarov, Imanol Arrieta Ibarra, Isabel Kloumann, Ishan Misra, Ivan Evtimov, Jack Zhang, Jade Copet, Jaewon Lee, Jan Geffert, Jana Vranes, Jason Park, Jay Mahadeokar, Jeet Shah, Jelmer Van der Linde, Jennifer Billock, Jenny Hong, Jenya Lee, Jeremy Fu, Jianfeng Chi, Jianyu Huang, Jiawen Liu, Jie Wang, Jiecao Yu, Joanna Bitton, Joe Spisak, Jongsoo Park, Joseph Rocca, Joshua Johnstun, Joshua Saxe, Junteng Jia, Kalyan Vasuden Alwala, Karthik Prasad, Kartikeya Upasani, Kate Plawiak, Ke Li, Kenneth Heafield, Kevin Stone, Khalid El-Arini, Krithika Iyer, Kshitiz Malik, Kuenley Chiu, Kunal Bhalla, Kushal Lakhotia, Lauren Rantala-Yeary, Laurens van der Maaten, Lawrence Chen, Liang Tan, Liz Jenkins, Louis Martin, Lovish Madaan, Lubo Malo, Lukas Blecher, Lukas Landzaat, Luke de Oliveira, Madeline Muzzi, Mahesh Pasupuleti, Mannat Singh, Manohar Paluri, Marcin Kardas, Maria Tsimpoukelli, Mathew Oldham, Mathieu Rita, Maya Pavlova, Melanie Kambadur, Mike Lewis, Min Si, Mitesh Kumar Singh, Mona Hassan, Naman Goyal, Narjes Torabi, Nikolay Bashlykov, Nikolay Bogoychev, Niladri Chatterji, Ning Zhang, Olivier Duchenne, Onur Çelebi, Patrick Alrassy, Pengchuan Zhang, Pengwei Li, Petar Vasic, Peter Weng, Prajjwal Bhargava, Pratik Dubal, Praveen Krishnan, Punit Singh Koura, Puxin Xu, Qing He, Qingxiao Dong, Ragavan Srinivasan, Raj Ganapathy, Ramon Calderer, Ricardo Silveira Cabral, Robert Stojnic, Roberta Raileanu, Rohan Maheswari, Rohit Girdhar, Rohit Patel, Romain Sauvestre, Ronnie Polidoro, Roshan Sumbaly, Ross Taylor, Ruan Silva, Rui Hou, Rui Wang, Saghar Hosseini, Sahana Chennabasappa, Sanjay Singh, Sean Bell, Seohyun Sonia Kim, Sergey Edunov, Shaoliang Nie, Sharan Narang, Sharath Raparthy, Sheng Shen, Shengye Wan, Shruti Bhosale, Shun Zhang, Simon Vandenhende, Soumya Batra, Spencer Whitman, Sten Sootla, Stephane Collot, Suchin Gururangan, Sydney Borodinsky, Tamar Herman, Tara Fowler, Tarek Sheasha, Thomas Georgiou, Thomas Scialom, Tobias Speckbacher, Todor Mihaylov, Tong Xiao, Ujjwal Karn, Vedanuj Goswami, Vibhor Gupta, Vignesh Ramanathan, Viktor Kerkez, Vincent Gonguet, Virginie Do, Vish Vogeti, Vítor Albiero, Vladan Petrovic, Weiwei Chu, Wenhan Xiong, Wenyin Fu, Whitney Meers, Xavier Martinet, Xiaodong Wang, Xiaofang Wang, Xiaoqing Ellen Tan, Xide Xia, Xinfeng Xie, Xuchao Jia, Xuewei Wang, Yaelle Goldschlag, Yashesh Gaur, Yasmine Babaei, Yi Wen, Yiwen Song, Yuchen Zhang, Yue Li, Yuning Mao, Zacharie Delpierre Coudert, Zheng Yan, Zhengxing Chen, Zoe Papakipos, Aaditya Singh, Aayushi Srivastava, Abha Jain, Adam Kelsey, Adam Shajnfeld, Adithya Gangidi, Adolfo Victoria, Ahuva Goldstand, Ajay Menon, Ajay Sharma, Alex Boesenberg, Alexei Baevski, Allie Feinstein, Amanda Kallet, Amit Sangani, Amos Teo, Anam Yunus, Andrei Lupu, Andres Alvarado, Andrew Caples, Andrew Gu, Andrew Ho, Andrew Poulton, Andrew Ryan, Ankit Ramchandani, Annie Dong, Annie Franco, Anuj Goyal, Aparajita Saraf, Arkabandhu Chowdhury, Ashley Gabriel, Ashwin Bharambe, Assaf Eisenman, Azadeh Yazdan, Beau James, Ben Maurer, Benjamin Leonhardi, Bernie Huang, Beth Loyd, Beto De Paola, Bhargavi Paranjape, Bing Liu, Bo Wu, Boyu Ni, Braden Hancock, Bram Wasti, Brandon Spence, Brani Stojkovic, Brian Gamido, Britt Montalvo, Carl Parker, Carly Burton, Catalina Mejia, Ce Liu, Changhan Wang, Changkyu Kim, Chao Zhou, Chester Hu, Ching-Hsiang Chu, Chris Cai, Chris Tindal, Christoph Feichtenhofer, Cynthia Gao, Damon Civin, Dana Beaty, Daniel Kreymer, Daniel Li, David Adkins, David Xu, Davide Testuggine, Delia David, Devi Parikh, Diana Liskovich, Didem Foss, Dingkang Wang, Duc Le, Dustin Holland, Edward Dowling, Eissa Jamil, Elaine Montgomery, Eleonora Presani, Emily Hahn, Emily Wood, Eric-Tuan Le, Erik Brinkman, Esteban Arcaute, Evan Dunbar, Evan Smothers, Fei Sun, Felix Kreuk, Feng Tian, Filippos Kokkinos, Firat Ozgenel, Francesco Caggioni, Frank Kanayet, Frank Seide, Gabriela Medina Florez, Gabriella Schwarz, Gada Badeer, Georgia Swee, Gil Halpern, Grant Herman, Grigory Sizov, Guangyi, Zhang, Guna Lakshminarayanan, Hakan Inan, Hamid Shojanazeri, Han Zou, Hannah Wang, Hanwen Zha, Haroun Habeeb, Harrison Rudolph, Helen Suk, Henry Aspegren, Hunter Goldman, Hongyuan Zhan, Ibrahim Damlaj, Igor Molybog, Igor Tufanov, Ilias Leontiadis, Irina-Elena Veliche, Itai Gat, Jake Weissman, James Geboski, James Kohli, Janice Lam, Japhet Asher, Jean-Baptiste Gaya, Jeff Marcus, Jeff Tang, Jennifer Chan, Jenny Zhen, Jeremy Reizenstein, Jeremy Teboul, Jessica Zhong, Jian Jin, Jingyi Yang, Joe Cummings, Jon Carvill, Jon Shepard, Jonathan McPhie, Jonathan Torres, Josh Ginsburg, Junjie Wang, Kai Wu, Kam Hou U, Karan Saxena, Kartikay Khandelwal, Katayoun Zand, Kathy Matosich, Kaushik Veeraraghavan, Kelly Michelena, Keqian Li, Kiran Jagadeesh, Kun Huang, Kunal Chawla, Kyle Huang, Lailin Chen, Lakshya Garg, Lavender A, Leandro Silva, Lee Bell, Lei Zhang, Liangpeng Guo, Licheng Yu, Liron Moshkovich, Luca Wehrstedt, Madian Khabsa, Manav Avalani, Manish Bhatt, Martynas Mankus, Matan Hasson, Matthew Lennie, Matthias Reso, Maxim Groshev, Maxim Naumov, Maya Lathi, Meghan Keneally, Miao Liu, Michael L. Seltzer, Michal Valko, Michelle Restrepo, Mihir Patel, Mik Vyatskov, Mikayel Samvelyan, Mike Clark, Mike Macey, Mike Wang, Miquel Jubert Hermoso, Mo Metanat, Mohammad Rastegari, Munish Bansal, Nandhini Santhanam, Natascha Parks, Natasha White, Navyata Bawa, Nayan Singhal, Nick Egebo, Nicolas Usunier, Nikhil Mehta, Nikolay Pavlovich Laptev, Ning Dong, Norman Cheng, Oleg Chernoguz, Olivia Hart, Omkar Salpekar, Ozlem Kalinli, Parkin Kent, Parth Parekh, Paul Saab, Pavan Balaji, Pedro Rittner, Philip Bontrager, Pierre Roux, Piotr Dollar, Polina Zvyagina, Prashant Ratanchandani, Pritish Yuvraj, Qian Liang, Rachad Alao, Rachel Rodriguez, Rafi Ayub, Raghotham Murthy, Raghu Nayani, Rahul Mitra, Rangaprabhu Parthasarathy, Raymond Li, Rebekkah Hogan, Robin Battey, Rocky Wang, Russ Howes, Ruty Rinott, Sachin Mehta, Sachin Siby, Sai Jayesh Bondu, Samyak Datta, Sara Chugh, Sara Hunt, Sargun Dhillon, Sasha Sidorov, Satadru Pan, Saurabh Mahajan, Saurabh Verma, Seiji Yamamoto, Sharadh Ramaswamy, Shaun Lindsay, Sheng Feng, Shenghao Lin, Shengxin Cindy Zha, Shishir Patil, Shiva Shankar, Shuqiang Zhang, Sinong Wang, Sneha Agarwal, Soji Sajuyigbe, Soumith Chintala, Stephanie Max, Stephen Chen, Steve Kehoe, Steve Satterfield, Sudarshan Govindaprasad, Sumit Gupta, Summer Deng, Sungmin Cho, Sunny Virk, Suraj Subramanian, Sy Choudhury, Sydney Goldman, Tal Remez, Tamar Glaser, Tamara Best, Thilo Koehler, Thomas Robinson, Tianhe Li, Tianjun Zhang, Tim Matthews, Timothy Chou, Tzook Shaked, Varun Vontimitta, Victoria Ajayi, Victoria Montanez, Vijai Mohan, Vinay Satish Kumar, Vishal Mangla, Vlad Ionescu, Vlad Poenaru, Vlad Tiberiu Mihailescu, Vladimir Ivanov, Wei Li, Wenchen Wang, WenWen Jiang, Wes Bouaziz, Will Constable, Xiaocheng Tang, Xiaojian Wu, Xiaolan Wang, Xilun Wu, Xinbo Gao, Yaniv Kleinman, Yanjun Chen, Ye Hu, Ye Jia, Ye Qi, Yenda Li, Yilin Zhang, Ying Zhang, Yossi Adi, Youngjin Nam, Yu, Wang, Yu Zhao, Yuchen Hao, Yundi Qian, Yunlu Li, Yuzi He, Zach Rait, Zachary DeVito, Zef Rosnbrick, Zhaoduo Wen, Zhenyu Yang, Zhiwei Zhao, Zhiyu Ma
This paper presents a new set of foundation models, called Llama 3.
Ranked #3 on
Question Answering
on PeerQA
1 code implementation • 19 Jul 2024 • Decheng Liu, Zongqi Wang, Chunlei Peng, Nannan Wang, Ruimin Hu, Xinbo Gao
In the paper, we first contribute a dedicated dataset called the Fair Forgery Detection (FairFD) dataset, where we prove the racial bias of public state-of-the-art (SOTA) methods.
1 code implementation • 10 Jul 2024 • Mingjin Zhang, YuChun Wang, Jie Guo, Yunsong Li, Xinbo Gao, Jing Zhang
The recent Segment Anything Model (SAM) is a significant advancement in natural image segmentation, exhibiting potent zero-shot performance suitable for various downstream image segmentation tasks.
2 code implementations • 10 Jul 2024 • Huafeng Qin, Xin Jin, Hongyu Zhu, Hongchao Liao, Mounîm A. El-Yacoubi, Xinbo Gao
Mixup data augmentation approaches have been applied for various tasks of deep learning to improve the generalization ability of deep neural networks.
no code implementations • 17 Jun 2024 • Decheng Liu, Zhan Dang, Chunlei Peng, Nannan Wang, Ruimin Hu, Xinbo Gao
In addition, a personalized federated learning training strategy is utilized to update the parameters of the distributed detection model.
1 code implementation • 16 Jun 2024 • Decheng Liu, Qixuan Su, Chunlei Peng, Nannan Wang, Xinbo Gao
With the great development of generative model techniques, face forgery detection draws more and more attention in the related field.
1 code implementation • 16 Jun 2024 • Decheng Liu, Tao Chen, Chunlei Peng, Nannan Wang, Ruimin Hu, Xinbo Gao
The robust feature of intra-class samples can maintain appropriate diversity; 2) \textbf{Discriminability}.
no code implementations • 13 Jun 2024 • Shuang Li, Jiaxu Leng, Guozhang Li, Ji Gan, Haosheng Chen, Xinbo Gao
Specifically, IFP is designed to extract fine-grained semantic features unrelated to clothes from the raw image, guided by the cloth-agnostic text prompts.
no code implementations • 3 Jun 2024 • Zhusi Zhong, Helen Zhang, Fayez H. Fayad, Andrew C. Lancaster, John Sollee, Shreyas Kulkarni, Cheng Ting Lin, Jie Li, Xinbo Gao, Scott Collins, Colin Greineder, Sun H. Ahn, Harrison X. Bai, Zhicheng Jiao, Michael K. Atalay
Imaging features and/or clinical variables were then incorporated into DL models to predict survival outcomes.
no code implementations • 2 Jun 2024 • Haojun Xu, Yan Gao, Jie Li, Xinbo Gao
Significant action recognition performance is achieved when evaluated on the challenging NTU RGB+D, NTU RGB+D 120, and PKU-MMD benchmarks and validate that multi-granularity semantic features facilitate the differentiation of action clusters with similar visual features.
no code implementations • 29 May 2024 • Ziqi Ren, Jie Li, Xuetong Xue, Xin Li, Fan Yang, Zhicheng Jiao, Xinbo Gao
MindSemantix generates high-quality captions that are deeply rooted in the visual and semantic information derived from brain activity.
1 code implementation • 23 May 2024 • Zhusi Zhong, Jie Li, John Sollee, Scott Collins, Harrison Bai, Paul Zhang, Terrence Healey, Michael Atalay, Xinbo Gao, Zhicheng Jiao
In response to the worldwide COVID-19 pandemic, advanced automated technologies have emerged as valuable tools to aid healthcare professionals in managing an increased workload by improving radiology report generation and prognostic analysis.
1 code implementation • 5 May 2024 • Zhusi Zhong, Jie Li, Zhuoqi Ma, Scott Collins, Harrison Bai, Paul Zhang, Terrance Healey, Xinbo Gao, Michael K. Atalay, Zhicheng Jiao
The COVID-19 pandemic has strained global public health, necessitating accurate diagnosis and intervention to control disease spread and reduce mortality rates.
no code implementations • 19 Apr 2024 • Yilong Chen, Zongyi Xu, Xiaoshui Huang, Ruicheng Zhang, Xinqi Jiang, Xinbo Gao
Specifically, we propose employing scatter images to annotate LiDAR point clouds, combining a pre-trained optical flow estimation network with a foundation image segmentation model to rapidly propagate manual annotations into dense labels for both images and point clouds.
no code implementations • 4 Apr 2024 • Lei Zhang, YuHang Zhou, Yi Yang, Xinbo Gao
Despite providing high-performance solutions for computer vision tasks, the deep neural network (DNN) model has been proved to be extremely vulnerable to adversarial attacks.
no code implementations • 27 Mar 2024 • Ruoyu Zhao, Qingnan Fan, Fei Kou, Shuai Qin, Hong Gu, Wei Wu, Pengcheng Xu, Mingrui Zhu, Nannan Wang, Xinbo Gao
Two key techniques are introduced into InstructBrush, Attention-based Instruction Optimization and Transformation-oriented Instruction Initialization, to address the limitations of the previous method in terms of inversion effects and instruction generalization.
1 code implementation • 22 Mar 2024 • Lei Zhang, Xiaowei Fu, Fuxiang Huang, Yi Yang, Xinbo Gao
Person re-identification (ReID) has made great strides thanks to the data-driven deep learning techniques.
1 code implementation • 13 Mar 2024 • Zhangxuan Dang, Yu Zheng, Xinglin Lin, Chunlei Peng, Qiuyu Chen, Xinbo Gao
We consider the problem of anomaly network traffic detection and propose a three-stage anomaly detection framework using only normal traffic.
no code implementations • 5 Mar 2024 • Chenqiang Gao, Chuandong Liu, Jun Shu, Fangcen Liu, Jiang Liu, Luyu Yang, Xinbo Gao, Deyu Meng
Current state-of-the-art (SOTA) 3D object detection methods often require a large amount of 3D bounding box annotations for training.
2 code implementations • 1 Mar 2024 • Junjie Guo, Chenqiang Gao, Fangcen Liu, Deyu Meng, Xinbo Gao
To effectively mine the complementary information and adapt to misalignment situations, we propose a Multispectral Deformable Cross-attention module to adaptively sample and aggregate multi-semantic level features of infrared and visible images for each object.
1 code implementation • 22 Feb 2024 • Zhaoyang Wang, Bo Hu, Mingyang Zhang, Jie Li, Leida Li, Maoguo Gong, Xinbo Gao
Firstly, we devise a new diffusion restoration network that leverages the produced enhanced image and noise-containing images, incorporating nonlinear features obtained during the denoising process of the diffusion model, as high-level visual information.
1 code implementation • 4 Feb 2024 • Zhenxing Niu, Haodong Ren, Xinbo Gao, Gang Hua, Rong Jin
This paper focuses on jailbreaking attacks against multi-modal large language models (MLLMs), seeking to elicit MLLMs to generate objectionable responses to harmful user queries.
1 code implementation • 1 Feb 2024 • Lingfeng He, De Cheng, Nannan Wang, Xinbo Gao
While prior work focuses on establishing cross-modality pseudo-label associations to bridge the modality-gap, they ignore maintaining the instance-level homogeneous and heterogeneous consistency between the feature space and the pseudo-label space, resulting in coarse associations.
no code implementations • 29 Jan 2024 • Shiyin Dong, Mingrui Zhu, Kun Cheng, Nannan Wang, Xinbo Gao
Our purpose is to establish a unified visual perception framework, capitalizing on the potential synergies between generative and discriminative models.
1 code implementation • 26 Jan 2024 • Nuoyan Zhou, Dawei Zhou, Decheng Liu, Nannan Wang, Xinbo Gao
We introduce a feature disentangler to separate out the specific latent features from the features of the adversarial samples, thereby boosting robustness by eliminating the specific latent features.
1 code implementation • 11 Jan 2024 • Chunlei Peng, Boyu Wang, Decheng Liu, Nannan Wang, Ruimin Hu, Xinbo Gao
To address this, we mask the clothing and color information in the personal attribute description extracted through an attribute detection model.
no code implementations • 10 Jan 2024 • Huafeng Qin, Hongyu Zhu, Xin Jin, Qun Song, Mounim A. El-Yacoubi, Xinbo Gao
To this end, we propose a mixed block consisting of three modules, transformer, attention Long short-term memory (attention LSTM), and Fourier transformer.
no code implementations • CVPR 2024 • De Cheng, Zhipeng Xu, Xinyang Jiang, Nannan Wang, Dongsheng Li, Xinbo Gao
Although there is a growing focus on VFM-based domain prompt tuning for DG effectively learning prompts that disentangle invariant features across all domains remains a major challenge.
no code implementations • 20 Dec 2023 • Xingyilang Yin, Xi Yang, Liangchen Liu, Nannan Wang, Xinbo Gao
Additional offsets and modulation scalars are learned on the whole point features, which shift the deformable reference points to the regions of interest.
2 code implementations • 19 Dec 2023 • Huafeng Qin, Xin Jin, Yun Jiang, Mounim A. El-Yacoubi, Xinbo Gao
In this paper, we propose AdAutomixup, an adversarial automatic mixup augmentation approach that generates challenging samples to train a robust classifier for image classification, by alternatively optimizing the classifier and the mixup sample generator.
1 code implementation • 18 Dec 2023 • Decheng Liu, Xijun Wang, Chunlei Peng, Nannan Wang, Ruiming Hu, Xinbo Gao
Adversarial attacks involve adding perturbations to the source image to cause misclassification by the target model, which demonstrates the potential of attacking face recognition models.
1 code implementation • 17 Dec 2023 • Guang Yang, Jie Li, Xinbo Gao
Specifically, we introduce a Spatial-Frequency Fusion Block to facilitate efficient interaction between dual domains and capture complementary information from input images with different exposures.
1 code implementation • 16 Dec 2023 • Decheng Liu, Xu Luo, Chunlei Peng, Nannan Wang, Ruimin Hu, Xinbo Gao
In this paper, we propose a novel Symmetrical Bidirectional Knowledge Alignment for zero-shot sketch-based image retrieval (SBKA).
no code implementations • 14 Dec 2023 • Yan Gao, Haojun Xu, Nannan Wang, Jie Li, Xinbo Gao
In addition to the previous method of treating objects as nodes, the network innovatively treats object trajectories as nodes for information interaction, improving the graph neural network's feature representation capability.
2 code implementations • 7 Dec 2023 • Chunlei Peng, Huiqing Guo, Decheng Liu, Nannan Wang, Ruimin Hu, Xinbo Gao
Considering the complexity of the quality distribution of both real and fake faces, we propose a novel Deepfake detection framework named DeepFidelity to adaptively distinguish real and fake faces with varying image quality by mining the perceptual forgery fidelity of face images.
1 code implementation • 7 Dec 2023 • Guang Yang, Jie Li, Hanxiao Lei, Xinbo Gao
In this study, we propose a multi-scale dual attention (MDA) framework for infrared and visible image fusion, which is designed to measure and integrate complementary information in both structure and loss function at the image and patch level.
no code implementations • 5 Dec 2023 • Guozhang Li, Xinpeng Ding, De Cheng, Jie Li, Nannan Wang, Xinbo Gao
To further clarify the noise of expanded boundaries, we combine mutual learning with a tailored proposal-level contrastive objective to use a learnable approach to harmonize a balance between incomplete yet clean (initial) and comprehensive yet noisy (expanded) boundaries for more precise ones.
no code implementations • 24 Nov 2023 • Ruoyu Zhao, Mingrui Zhu, Shiyin Dong, Nannan Wang, Xinbo Gao
We propose CatVersion, an inversion-based method that learns the personalized concept through a handful of examples.
no code implementations • 15 Nov 2023 • Dongxin Chen, Mingrui Zhu, Nannan Wang, Xinbo Gao
To disentangle the latent codes in the GAN inversion space, we introduce an Identity Disentanglement Module (IDM).
no code implementations • 13 Nov 2023 • Qinlin He, Chunlei Peng, Decheng Liu, Nannan Wang, Xinbo Gao
DeepFake detection is pivotal in personal privacy and public safety.
no code implementations • 27 Oct 2023 • Shuang Li, Jiaxu Leng, Ji Gan, Mengjingcheng Mo, Xinbo Gao
One pertains to the dependence on auxiliary models for shape feature extraction in the inference phase, along with the errors in generated infrared shapes due to the intrinsic modality disparity.
no code implementations • 24 Oct 2023 • Feng Gao, Jiaxu Leng, Ji Gan, Xinbo Gao
Moreover, to train the rank prediction head better, we propose Soft Gradient L1 Loss.
1 code implementation • 5 Oct 2023 • Nuoyan Zhou, Nannan Wang, Decheng Liu, Dawei Zhou, Xinbo Gao
Deep neural networks are vulnerable to adversarial noise.
Ranked #1 on
Adversarial Defense
on CIFAR-100
no code implementations • 14 Sep 2023 • Liangchen Liu, Nannan Wang, Dawei Zhou, Xinbo Gao, Decheng Liu, Xi Yang, Tongliang Liu
This paper targets a novel trade-off problem in generalizable prompt learning for vision-language models (VLM), i. e., improving the performance on unseen classes while maintaining the performance on seen classes.
no code implementations • 11 Sep 2023 • Xiao He, Mingrui Zhu, Dongxin Chen, Nannan Wang, Xinbo Gao
In this paper, we unify the task of anonymization and visual identity information hiding and propose a novel face privacy protection method based on diffusion models, dubbed Diff-Privacy.
1 code implementation • ICCV 2023 • Zongyi Xu, Bo Yuan, Shanshan Zhao, Qianni Zhang, Xinbo Gao
The most recent methods of this kind measure the uncertainty of each pre-divided region for manual labelling but they suffer from redundant information and require additional efforts for region division.
1 code implementation • ICCV 2023 • Mingjin Zhang, Chi Zhang, Qiming Zhang, Jie Guo, Xinbo Gao, Jing Zhang
Single hyperspectral image super-resolution (single-HSI-SR) aims to restore a high-resolution hyperspectral image from a low-resolution observation.
1 code implementation • 21 Jul 2023 • Decheng Liu, Tao Chen, Chunlei Peng, Nannan Wang, Ruimin Hu, Xinbo Gao
Due to the successful development of deep image generation technology, visual data forgery detection would play a more important role in social and economic security.
no code implementations • 18 Jul 2023 • Lin Yuan, Kai Liang, Xiao Pu, Yan Zhang, Jiaxu Leng, Tao Wu, Nannan Wang, Xinbo Gao
This paper proposes a novel paradigm for facial privacy protection that unifies multiple characteristics including anonymity, diversity, reversibility and security within a single lightweight framework.
no code implementations • 6 Jul 2023 • Ruiyang Xia, Decheng Liu, Jie Li, Lin Yuan, Nannan Wang, Xinbo Gao
Advanced manipulation techniques have provided criminals with opportunities to make social panic or gain illicit profits through the generation of deceptive media, such as forged face images.
no code implementations • 14 Jun 2023 • Zhusi Zhong, Jie Li, Lulu Bi, Li Yang, Ihab Kamel, Rama Chellappa, Xinbo Gao, Harrison Bai, Zhicheng Jiao
Medical image segmentation based on deep learning often fails when deployed on images from a different domain.
no code implementations • 23 May 2023 • Mingjin Zhang, Jiamin Xu, Chengyu He, Wenteng Shang, Yunsong Li, Xinbo Gao
Synthetic aperture radar (SAR) is prevalent in the remote sensing field but is difficult to interpret in human visual perception.
no code implementations • 22 May 2023 • De Cheng, Lingfeng He, Nannan Wang, Shizhou Zhang, Zhen Wang, Xinbo Gao
To this end, we propose a novel bilateral cluster matching-based learning framework to reduce the modality gap by matching cross-modality clusters.
no code implementations • 22 May 2023 • De Cheng, Xiaojian Huang, Nannan Wang, Lingfeng He, Zhihui Li, Xinbo Gao
Unsupervised learning visible-infrared person re-identification (USL-VI-ReID) aims at learning modality-invariant features from unlabeled cross-modality dataset, which is crucial for practical applications in video surveillance systems.
1 code implementation • 21 May 2023 • Haojun Xu, Yan Gao, Zheng Hui, Jie Li, Xinbo Gao
Also, humans have brain regions dedicated to understanding the minds of others and analyzing their intentions, such as the medial prefrontal cortex of the temporal lobe.
Ranked #3 on
Skeleton Based Action Recognition
on NTU RGB+D 120
(using extra training data)
no code implementations • 18 May 2023 • Feng Gao, Jiaxu Leng, Gan Ji, Xinbo Gao
However, in crowded pedestrian detection, the performance of DETRs is still unsatisfactory due to the inappropriate sample selection method which results in more false positives.
no code implementations • 12 May 2023 • Quanxue Gao, Qianqian Wang, Han Lu, Wei Xia, Xinbo Gao
Although numerous clustering algorithms have been developed, many existing methods still leverage k-means technique to detect clusters of data points.
no code implementations • 9 May 2023 • Shiyin Dong, Mingrui Zhu, Nannan Wang, Xinbo Gao
Zero-shot sketch-based image retrieval (ZS-SBIR) is challenging due to the cross-domain nature of sketches and photos, as well as the semantic gap between seen and unseen image distributions.
1 code implementation • CVPR 2023 • Guozhang Li, De Cheng, Xinpeng Ding, Nannan Wang, Xiaoyu Wang, Xinbo Gao
For the discriminative objective, we propose a Text-Segment Mining (TSM) mechanism, which constructs a text description based on the action class label, and regards the text as the query to mine all class-related segments.
1 code implementation • 25 Apr 2023 • Guozhang Li, De Cheng, Xinpeng Ding, Nannan Wang, Jie Li, Xinbo Gao
The proposed Bi-SCC firstly adopts a temporal context augmentation to generate an augmented video that breaks the correlation between positive actions and their co-scene actions in the inter-video; Then, a semantic consistency constraint (SCC) is used to enforce the predictions of the original video and augmented video to be consistent, hence suppressing the co-scene actions.
Weakly-supervised Temporal Action Localization
Weakly Supervised Temporal Action Localization
no code implementations • 22 Apr 2023 • Lin Qi, Xuewen Qin, Feng Gao, Junyu Dong, Xinbo Gao
To this end, we put forward a spatial attention weighted unmixing network, dubbed as SAWU-Net, which learns a spatial attention network and a weighted unmixing network in an end-to-end manner for better spatial feature exploitation.
no code implementations • 21 Apr 2023 • Shuyin Xia, Guoyin Wang, Xinbo Gao, Xiaoyu Lian
This mechanism inherently possesses an adaptive multi-granularity description capacity, resulting in computational traits such as efficiency, robustness, and interpretability.
1 code implementation • CVPR 2023 • Chuandong Liu, Chenqiang Gao, Fangcen Liu, Pengcheng Li, Deyu Meng, Xinbo Gao
State-of-the-art 3D object detectors are usually trained on large-scale datasets with high-quality 3D annotations.
no code implementations • 29 Mar 2023 • Jing Li, Quanxue Gao, Qianqian Wang, Wei Xia, Xinbo Gao
Multi-view clustering (MVC) based on non-negative matrix factorization (NMF) and its variants have received a huge amount of attention in recent years due to their advantages in clustering interpretability.
no code implementations • 7 Mar 2023 • Jiang Xie, Qiao Deng, Shuyin Xia, Yangzhou Zhao, Guoyin Wang, Xinbo Gao
In recent years, the problem of fuzzy clustering has been widely concerned.
no code implementations • 2 Mar 2023 • Jiang Xie, Shuyin Xia, Guoyin Wang, Xinbo Gao
We construct coarsegrained granular-balls, and then use granular-balls and MST to implement the clustering method based on "large-scale priority", which can greatly avoid the influence of outliers and accelerate the construction process of MST.
no code implementations • 24 Jan 2023 • Xiao He, Mingrui Zhu, Nannan Wang, Xinbo Gao, Heng Yang
To address this issue, we propose a novel font generation approach by learning the Difference between different styles and the Similarity of the same style (DS-Font).
1 code implementation • CVPR 2023 • Bin Xiao, Yang Hu, Bo Liu, Xiuli Bi, Weisheng Li, Xinbo Gao
Since their binarization processes are not a component of the network, the learning-based binary descriptor cannot fully utilize the advances of deep learning.
1 code implementation • CVPR 2023 • Yongchao Wang, Bin Xiao, Xiuli Bi, Weisheng Li, Xinbo Gao
Inspired by the plain contrast idea, MCF introduces two different subnets to explore and utilize the discrepancies between subnets to correct cognitive bias of the model.
1 code implementation • 30 Dec 2022 • Decheng Liu, Zeyang Zheng, Chunlei Peng, Yukai Wang, Nannan Wang, Xinbo Gao
Face forgery detection plays an important role in personal privacy and social security.
no code implementations • International Journal of Computer Vision 2022 • Zhenwei He, Lei Zhang, Xinbo Gao, David Zhang
Our proposed MAF has two distinct contributions: (1) The Hierarchical Domain Feature Alignment (HDFA) module is introduced to minimize the image-level domain disparity, where Scale Reduction Module (SRM) reduces the feature map size without information loss and increases the training efficiency.
1 code implementation • ICCV 2023 • Mingrui Zhu, Xiao He, Nannan Wang, Xiaoyu Wang, Xinbo Gao
In this paper, we propose a novel all-to-key attention mechanism -- each position of content features is matched to stable key positions of style features -- that is more in line with the characteristics of style transfer.
no code implementations • 4 Dec 2022 • Qihuang Zhong, Liang Ding, Yibing Zhan, Yu Qiao, Yonggang Wen, Li Shen, Juhua Liu, Baosheng Yu, Bo Du, Yixin Chen, Xinbo Gao, Chunyan Miao, Xiaoou Tang, DaCheng Tao
This technical report briefly describes our JDExplore d-team's Vega v2 submission on the SuperGLUE leaderboard.
Ranked #1 on
Common Sense Reasoning
on ReCoRD
no code implementations • 30 Nov 2022 • De Cheng, Haichun Tai, Nannan Wang, Zhen Wang, Xinbo Gao
In this paper, we propose a Neighbour Consistency guided Pseudo Label Refinement (NCPLR) framework, which can be regarded as a transductive form of label propagation under the assumption that the prediction of each example should be similar to its nearest neighbours'.
1 code implementation • NIPS 2022 • De Cheng, Yixiong Ning, Nannan Wang, Xinbo Gao, Heng Yang, Yuxuan Du, Bo Han, Tongliang Liu
We show that the cycle-consistency regularization helps to minimize the volume of the transition matrix T indirectly without exploiting the estimated noisy class posterior, which could further encourage the estimated transition matrix T to converge to its optimal solution.
no code implementations • 30 Oct 2022 • Yu Zheng, Zhangxuan Dang, Chunlei Peng, Chao Yang, Xinbo Gao
In this paper, we propose an MLP-Mixer based multi-view multi-label neural network for network traffic classification.
3 code implementations • 28 Oct 2022 • Yan Zhang, Xiyuan Gao, Qingyan Duan, Jiaxu Leng, Xiao Pu, Xinbo Gao
By stacking various layers of CSA blocks, we propose the Fourier Complex Transformer (FCT) model to learn global contextual information from VHR aerial images following the hierarchical manners.
no code implementations • 21 Oct 2022 • Shuyin Xia, Xiaoyu Lian, Guoyin Wang, Xinbo Gao, Yabin Shao
Most existing fuzzy set methods use points as their input, which is the finest granularity from the perspective of granular computing.
1 code implementation • 18 Oct 2022 • Decheng Liu, Zhan Dang, Chunlei Peng, Yu Zheng, Shuang Li, Nannan Wang, Xinbo Gao
Experiments conducted on publicly available face forgery detection datasets prove the superior performance of the proposed FedForgery.
1 code implementation • 6 Oct 2022 • Shuyin Xia, Xiaoyu Lian, Guoyin Wang, Xinbo Gao, Jiancu Chen, Xiaoli Peng
Furthermore, a particle swarm optimization algorithm is designed to solve the dual model.
1 code implementation • ICCV 2023 • Zhigang Su, Dawei Zhou, Nannan Wangu, Decheng Li, Zhen Wang, Xinbo Gao
Growing leakage and misuse of visual information raise security and privacy concerns, which promotes the development of information protection.
1 code implementation • 5 Sep 2022 • Pinjun Luo, GuoQiang Xiao, Xinbo Gao, Song Wu
The designed DLKCB can split the deep-wise large kernel convolution into a smaller depth-wise convolution and a depth-wise dilated convolution without introducing massive parameters and computational overhead.
no code implementations • 25 Jul 2022 • Jingyuan Yang, Jie Li, Leida Li, Xiumei Wang, Yuxuan Ding, Xinbo Gao
In psychology, the \textit{Object-Appraisal-Emotion} model has demonstrated that each individual's emotion is affected by his/her subjective appraisal, which is further formed by the affective memory.
1 code implementation • 25 Jul 2022 • Dawei Zhou, Nannan Wang, Xinbo Gao, Bo Han, Xiaoyu Wang, Yibing Zhan, Tongliang Liu
To alleviate this negative effect, in this paper, we investigate the dependence between outputs of the target model and input adversarial samples from the perspective of information theory, and propose an adversarial defense method.
1 code implementation • 12 Jul 2022 • Decheng Liu, Weijie He, Chunlei Peng, Nannan Wang, Jie Li, Xinbo Gao
The multiple branches transformer is employed to explore the inter-correlation between different attributes in similar semantic regions for attribute feature learning.
no code implementations • 5 Jul 2022 • Yukai Wang, Chunlei Peng, Decheng Liu, Nannan Wang, Xinbo Gao
In recent years, with the rapid development of face editing and generation, more and more fake videos are circulating on social media, which has caused extreme public concerns.
no code implementations • CVPR 2022 • De Cheng, Tongliang Liu, Yixiong Ning, Nannan Wang, Bo Han, Gang Niu, Xinbo Gao, Masashi Sugiyama
In label-noise learning, estimating the transition matrix has attracted more and more attention as the matrix plays an important role in building statistically consistent classifiers.
1 code implementation • 30 May 2022 • Jiachen Yang, Zhuo Zhang, Yicheng Gong, Shukun Ma, Xiaolan Guo, Yue Yang, Shuai Xiao, Jiabao Wen, Yang Li, Xinbo Gao, Wen Lu, Qinggang Meng
Data has now become a shortcoming of deep learning.
1 code implementation • 19 Apr 2022 • Yue Zhao, Lingming Zhang, Yang Liu, Deyu Meng, Zhiming Cui, Chenqiang Gao, Xinbo Gao, Chunfeng Lian, Dinggang Shen
The state-of-the-art deep learning-based methods often simply concatenate the raw geometric attributes (i. e., coordinates and normal vectors) of mesh cells to train a single-stream network for automatic intra-oral scanner image segmentation.
no code implementations • 29 Mar 2022 • De Cheng, Yan Li, Dingwen Zhang, Nannan Wang, Xinbo Gao, Jiande Sun
To properly address this problem, we propose a novel density-variational learning framework to improve the robustness of the image dehzing model assisted by a variety of negative hazy images, to better deal with various complex hazy scenarios.
1 code implementation • CVPR 2022 • Hangyu Li, Nannan Wang, Xi Yang, Xiaoyu Wang, Xinbo Gao
In this paper, we learn an Adaptive Confidence Margin (Ada-CM) to fully leverage all unlabeled data for semi-supervised deep facial expression recognition.
Facial Expression Recognition
Facial Expression Recognition (FER)
no code implementations • 12 Mar 2022 • Lin Qi, Feng Gao, Junyu Dong, Xinbo Gao, Qian Du
Important findings on the use of spatial and spectral information in the autoencoder framework are discussed.
1 code implementation • 4 Mar 2022 • Mingrui Zhu, Yun Yi, Nannan Wang, Xiaoyu Wang, Xinbo Gao
The large discrepancy between the source non-makeup image and the reference makeup image is one of the key challenges in makeup transfer.
no code implementations • 12 Jan 2022 • Shuyin Xia, Xiaochuan Dai, Guoyin Wang, Xinbo Gao, Elisabeth Giem
In addition, this paper first provides the mathematical models for the granular-ball covering.
no code implementations • 10 Jan 2022 • Shuyin Xia, Cheng Wang, Guoyin Wang, Weiping Ding, Xinbo Gao, JianHang Yu, Yujia Zhai, Zizhong Chen
The granular-ball rough set can simultaneously represent Pawlak rough sets, and the neighborhood rough set, so as to realize the unified representation of the two.
no code implementations • CVPR 2022 • Chuandong Liu, Chenqiang Gao, Fangcen Liu, Jiang Liu, Deyu Meng, Xinbo Gao
In the meantime, we design a reliable background mining module and a point cloud filling data augmentation strategy to generate the confident data for iteratively learning with reliable supervision.
no code implementations • 29 Dec 2021 • Shuyin Xia, Xinyu Bai, Guoyin Wang, Deyu Meng, Xinbo Gao, Zizhong Chen, Elisabeth Giem
This paper present a strong data mining method based on rough set, which can realize feature selection, classification and knowledge representation at the same time.
1 code implementation • 16 Nov 2021 • Yuanfei Huang, Jie Li, Yanting Hu, Xinbo Gao, Hua Huang
Recently, deep-learning-based super-resolution methods have achieved excellent performances, but mainly focus on training a single generalized deep network by feeding numerous samples.
no code implementations • 25 Oct 2021 • Haosheng Chen, Shuyuan Lin, Yan Yan, Hanzi Wang, Xinbo Gao
In EDA, we first asynchronously fuse the event data based on its information entropy.
no code implementations • 24 Oct 2021 • Jingyuan Yang, Xinbo Gao, Leida Li, Xiumei Wang, Jinshan Ding
Inspired by this, we propose a novel Scene-Object interreLated Visual Emotion Reasoning network (SOLVER) to predict emotions from images.
no code implementations • 15 Oct 2021 • Wei Xia, Quanxue Gao, Ming Yang, Xinbo Gao
Thus, for the OOS nodes, SCAGC can directly calculate their clustering labels.
no code implementations • 29 Sep 2021 • Fangcen Liu, Chenqiang Gao, Fang Chen, Deyu Meng, WangMeng Zuo, Xinbo Gao
We adopt the self-attention mechanism of the transformer to learn the interaction information of image features in a larger range.
no code implementations • 29 Sep 2021 • De Cheng, Jingyu Zhou, Nannan Wang, Xinbo Gao
However, since person Re-Id is an open-set problem, the clustering based methods often leave out lots of outlier instances or group the instances into the wrong clusters, thus they can not make full use of the training samples as a whole.
1 code implementation • 22 Sep 2021 • Yan Li, De Cheng, Jiande Sun, Dingwen Zhang, Nannan Wang, Xinbo Gao
In this paper, we propose a single image dehazing method with an independent Detail Recovery Network (DRN), which considers capturing the details from the input image over a separate network and then integrates them into a coarse dehazed image.
no code implementations • 4 Sep 2021 • Jingyuan Yang, Jie Li, Xiumei Wang, Yuxuan Ding, Xinbo Gao
Then, we design three specific networks, i. e., Global-Net, Semantic-Net and Expression-Net, to extract distinct emotional features from different stimuli simultaneously.
no code implementations • ICCV 2021 • Xinpeng Ding, Nannan Wang, Shiwei Zhang, De Cheng, Xiaomeng Li, Ziyuan Huang, Mingqian Tang, Xinbo Gao
The contrastive objective aims to learn effective representations by contrastive learning, while the caption objective can train a powerful video encoder supervised by texts.
no code implementations • 15 Aug 2021 • Quanxue Gao, Wei Xia, Xinbo Gao, Xiangdong Zhang, Qin Li, DaCheng Tao
Despite the impressive clustering performance and efficiency in characterizing both the relationship between data and cluster structure, existing graph-based multi-view clustering methods still have the following drawbacks.
no code implementations • 29 Jun 2021 • Tianyu Jiang, Quanxue Gao, Xinbo Gao
Specifically, we construct a hidden and tractable large graph by anchor graph for each view and well exploit complementary information embedded in anchor graphs of different views by tensor Schatten p-norm regularizer.
no code implementations • CVPR 2021 • Jingyuan Yang, Jie Li, Leida Li, Xiumei Wang, Xinbo Gao
Visual Emotion Analysis (VEA) has attracted increasing attention recently with the prevalence of sharing images on social networks.
no code implementations • CVPR 2021 • Zheng Hui, Jie Li, Xiumei Wang, Xinbo Gao
Instead of considering iterative strategy, we make the blur kernel predictor trainable in the whole blind SR model, in which AMNet is well-trained.
no code implementations • CVPR 2021 • Lingming Zhang, Yue Zhao, Deyu Meng, Zhiming Cui, Chenqiang Gao, Xinbo Gao, Chunfeng Lian, Dinggang Shen
State-of-the-art methods directly concatenate the raw attributes of 3D inputs, namely coordinates and normal vectors of mesh cells, to train a single-stream network for fully-automated tooth segmentation.
no code implementations • 10 Jun 2021 • Dawei Zhou, Nannan Wang, Xinbo Gao, Bo Han, Jun Yu, Xiaoyu Wang, Tongliang Liu
However, pre-processing methods may suffer from the robustness degradation effect, in which the defense reduces rather than improving the adversarial robustness of a target model in a white-box setting.
no code implementations • 9 Jun 2021 • Dawei Zhou, Tongliang Liu, Bo Han, Nannan Wang, Chunlei Peng, Xinbo Gao
However, given the continuously evolving attacks, models trained on seen types of adversarial examples generally cannot generalize well to unseen types of adversarial examples.
no code implementations • 17 May 2021 • Andrey Ignatov, Andres Romero, Heewon Kim, Radu Timofte, Chiu Man Ho, Zibo Meng, Kyoung Mu Lee, Yuxiang Chen, Yutong Wang, Zeyu Long, Chenhao Wang, Yifei Chen, Boshen Xu, Shuhang Gu, Lixin Duan, Wen Li, Wang Bofei, Zhang Diankai, Zheng Chengjian, Liu Shaoli, Gao Si, Zhang Xiaofeng, Lu Kaidi, Xu Tianyu, Zheng Hui, Xinbo Gao, Xiumei Wang, Jiaming Guo, Xueyi Zhou, Hao Jia, Youliang Yan
Video super-resolution has recently become one of the most important mobile-related problems due to the rise of video communication and streaming services.
no code implementations • ICCV 2021 • Dawei Zhou, Nannan Wang, Chunlei Peng, Xinbo Gao, Xiaoyu Wang, Jun Yu, Tongliang Liu
Then, we train a denoising model to minimize the distances between the adversarial examples and the natural examples in the class activation feature space.
2 code implementations • CVPR 2021 • Tianwei Lin, Zhuoqi Ma, Fu Li, Dongliang He, Xin Li, Errui Ding, Nannan Wang, Jie Li, Xinbo Gao
Inspired by the common painting process of drawing a draft and revising the details, we introduce a novel feed-forward method named Laplacian Pyramid Network (LapStyle).
1 code implementation • 29 Mar 2021 • Yuanfei Huang, Jie Li, Yanting Hu, Xinbo Gao, Hua Huang
Being extremely dependent on iterative estimation of the degradation prior or optimization of the model from scratch, the existing blind super-resolution (SR) methods are generally time-consuming and less effective, as the estimation of degradation proceeds from a blind initialization and lacks interpretable degradation priors.
no code implementations • ICCV 2021 • Ziyu Wei, Xi Yang, Nannan Wang, Xinbo Gao
Visible infrared person re-identification (VI-REID) aims to match pedestrian images between the daytime visible and nighttime infrared camera views.
no code implementations • 1 Jan 2021 • Dawei Zhou, Tongliang Liu, Bo Han, Nannan Wang, Xinbo Gao
Motivated by this observation, we propose a defense framework ADD-Defense, which extracts the invariant information called \textit{perturbation-invariant representation} (PIR) to defend against widespread adversarial examples.
1 code implementation • 26 Dec 2020 • Lingming Zhang, Yue Zhao, Deyu Meng, Zhiming Cui, Chenqiang Gao, Xinbo Gao, Chunfeng Lian, Dinggang Shen
State-of-the-art methods directly concatenate the raw attributes of 3D inputs, namely coordinates and normal vectors of mesh cells, to train a single-stream network for fully-automated tooth segmentation.
no code implementations • 3 Dec 2020 • Bo Liu, Ranglei Wu, Xiuli Bi, Bin Xiao, Weisheng Li, Guoyin Wang, Xinbo Gao
The unfixed encoder autonomously learns the image fingerprints that differentiate between the tampered and non-tampered regions, whereas the fixed encoder intentionally provides the direction information that assists the learning and detection of the network.
no code implementations • 31 Oct 2020 • Shuyin Xia, Wenhua Li, Guoyin Wang, Xinbo Gao, Changqing Zhang, Elisabeth Giem
Based on the theorem, we propose the LRA framework for accelerating rough set algorithms.
1 code implementation • 28 Sep 2020 • Yuanfei Huang, Jie Li, Xinbo Gao, Yanting Hu, Wen Lu
To solve them, we propose a purposeful and interpretable detail-fidelity attention network to progressively process these smoothes and details in divide-and-conquer manner, which is a novel and specific prospect of image super-resolution for the purpose on improving the detail fidelity, instead of blindly designing or employing the deep CNNs architectures for merely feature representation in local receptive fields.
no code implementations • 16 Sep 2020 • Zhikang Wang, Lihuo He, Xinbo Gao, Jane Shen
The mask recalibrates the features to amplify the valuable characteristics and diminish the noise.
3 code implementations • 15 Sep 2020 • Kai Zhang, Martin Danelljan, Yawei Li, Radu Timofte, Jie Liu, Jie Tang, Gangshan Wu, Yu Zhu, Xiangyu He, Wenjie Xu, Chenghua Li, Cong Leng, Jian Cheng, Guangyang Wu, Wenyi Wang, Xiaohong Liu, Hengyuan Zhao, Xiangtao Kong, Jingwen He, Yu Qiao, Chao Dong, Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan, Xiaochuan Li, Zhiqiang Lang, Jiangtao Nie, Wei Wei, Lei Zhang, Abdul Muqeet, Jiwon Hwang, Subin Yang, JungHeum Kang, Sung-Ho Bae, Yongwoo Kim, Geun-Woo Jeon, Jun-Ho Choi, Jun-Hyuk Kim, Jong-Seok Lee, Steven Marty, Eric Marty, Dongliang Xiong, Siang Chen, Lin Zha, Jiande Jiang, Xinbo Gao, Wen Lu, Haicheng Wang, Vineeth Bhaskara, Alex Levinshtein, Stavros Tsogkas, Allan Jepson, Xiangzhen Kong, Tongtong Zhao, Shanshan Zhao, Hrishikesh P. S, Densen Puthussery, Jiji C. V, Nan Nan, Shuai Liu, Jie Cai, Zibo Meng, Jiaming Ding, Chiu Man Ho, Xuehui Wang, Qiong Yan, Yuzhi Zhao, Long Chen, Jiangtao Zhang, Xiaotong Luo, Liang Chen, Yanyun Qu, Long Sun, Wenhao Wang, Zhenbing Liu, Rushi Lan, Rao Muhammad Umer, Christian Micheloni
This paper reviews the AIM 2020 challenge on efficient single image super-resolution with focus on the proposed solutions and results.
no code implementations • 3 Sep 2020 • Lei Zhang, Zhenwei He, Yi Yang, Liang Wang, Xinbo Gao
The traditional object retrieval task aims to learn a discriminative feature representation with intra-similarity and inter-dissimilarity, which supposes that the objects in an image are manually or automatically pre-cropped exactly.
no code implementations • 3 Jul 2020 • Xinpeng Ding, Nannan Wang, Xinbo Gao, Jie Li, Xiaoyu Wang, Tongliang Liu
Specifically, we devise a partial segment loss regarded as a loss sampling to learn integral action parts from labeled segments.
Weakly-supervised Temporal Action Localization
Weakly Supervised Temporal Action Localization
no code implementations • 25 Jun 2020 • Zhenxi Zhang, Chunna Tian, Jie Li, Zhusi Zhong, Zhicheng Jiao, Xinbo Gao
Further, we propose a context encoding module to utilize the global predictor from the error map to enhance the feature representation and regularize the networks.
no code implementations • 25 May 2020 • Bing Cao, Nannan Wang, Xinbo Gao, Jie Li, Zhifeng Li
Heterogeneous face recognition (HFR) refers to matching face images acquired from different domains with wide applications in security scenarios.
no code implementations • 3 May 2020 • Kai Zhang, Shuhang Gu, Radu Timofte, Taizhang Shang, Qiuju Dai, Shengchen Zhu, Tong Yang, Yandong Guo, Younghyun Jo, Sejong Yang, Seon Joo Kim, Lin Zha, Jiande Jiang, Xinbo Gao, Wen Lu, Jing Liu, Kwangjin Yoon, Taegyun Jeon, Kazutoshi Akita, Takeru Ooba, Norimichi Ukita, Zhipeng Luo, Yuehan Yao, Zhenyu Xu, Dongliang He, Wenhao Wu, Yukang Ding, Chao Li, Fu Li, Shilei Wen, Jianwei Li, Fuzhi Yang, Huan Yang, Jianlong Fu, Byung-Hoon Kim, JaeHyun Baek, Jong Chul Ye, Yuchen Fan, Thomas S. Huang, Junyeop Lee, Bokyeung Lee, Jungki Min, Gwantae Kim, Kanghyu Lee, Jaihyun Park, Mykola Mykhailych, Haoyu Zhong, Yukai Shi, Xiaojun Yang, Zhijing Yang, Liang Lin, Tongtong Zhao, Jinjia Peng, Huibing Wang, Zhi Jin, Jiahao Wu, Yifu Chen, Chenming Shang, Huanrong Zhang, Jeongki Min, Hrishikesh P. S, Densen Puthussery, Jiji C. V
This paper reviews the NTIRE 2020 challenge on perceptual extreme super-resolution with focus on proposed solutions and results.
no code implementations • 16 Feb 2020 • Jingwei Xin, Nannan Wang, Xinrui Jiang, Jie Li, Xinbo Gao, Zhifeng Li
In the SR processing, we first generated a group of FACs from the input LR face, and then reconstructed the HR face from this group of FACs.
no code implementations • 15 Feb 2020 • Jingwei Xin, Nannan Wang, Jie Li, Xinbo Gao, Zhifeng Li
Current state-of-the-art CNN methods usually treat the VSR problem as a large number of separate multi-frame super-resolution tasks, at which a batch of low resolution (LR) frames is utilized to generate a single high resolution (HR) frame, and running a slide window to select LR frames over the entire video would obtain a series of HR frames.
no code implementations • 13 Feb 2020 • Haosheng Chen, Qiangqiang Wu, Yanjie Liang, Xinbo Gao, Hanzi Wang
To achieve this goal, we present an Adaptive Time-Surface with Linear Time Decay (ATSLTD) event-to-frame conversion algorithm, which asynchronously and effectively warps the spatio-temporal information of asynchronous retinal events to a sequence of ATSLTD frames with clear object contours.
3 code implementations • 7 Feb 2020 • Zheng Hui, Jie Li, Xiumei Wang, Xinbo Gao
Besides, we devise a geometrical alignment constraint item to compensate for the pixel-based distance between prediction features and ground-truth ones.
Ranked #1 on
Facial Inpainting
on FFHQ
1 code implementation • 18 Nov 2019 • Andreas Lugmayr, Martin Danelljan, Radu Timofte, Manuel Fritsche, Shuhang Gu, Kuldeep Purohit, Praveen Kandula, Maitreya Suin, A. N. Rajagopalan, Nam Hyung Joon, Yu Seung Won, Guisik Kim, Dokyeong Kwon, Chih-Chung Hsu, Chia-Hsiang Lin, Yuanfei Huang, Xiaopeng Sun, Wen Lu, Jie Li, Xinbo Gao, Sefi Bell-Kligler
For training, only one set of source input images is therefore provided in the challenge.
2 code implementations • 4 Nov 2019 • Kai Zhang, Shuhang Gu, Radu Timofte, Zheng Hui, Xiumei Wang, Xinbo Gao, Dongliang Xiong, Shuai Liu, Ruipeng Gang, Nan Nan, Chenghua Li, Xueyi Zou, Ning Kang, Zhan Wang, Hang Xu, Chaofeng Wang, Zheng Li, Lin-Lin Wang, Jun Shi, Wenyu Sun, Zhiqiang Lang, Jiangtao Nie, Wei Wei, Lei Zhang, Yazhe Niu, Peijin Zhuo, Xiangzhen Kong, Long Sun, Wenhao Wang
The challenge had 3 tracks.
4 code implementations • 26 Sep 2019 • Zheng Hui, Xinbo Gao, Yunchu Yang, Xiumei Wang
In recent years, single image super-resolution (SISR) methods using deep convolution neural network (CNN) have achieved impressive results.
Ranked #16 on
Image Super-Resolution
on Manga109 - 3x upscaling
1 code implementation • 24 Jul 2019 • Zheng Hui, Jie Li, Xinbo Gao, Xiumei Wang
In this paper, we propose a novel perceptual image super-resolution method that progressively generates visually high-quality results by constructing a stage-wise network.
Ranked #14 on
Image Super-Resolution
on BSD100 - 4x upscaling
(SSIM metric)
no code implementations • 27 Jun 2019 • Ziqi Ren, Jie Li, Xuetong Xue, Xin Li, Fan Yang, Zhicheng Jiao, Xinbo Gao
In addition, we introduce a novel three-stage learning approach which enables the (cognitive) encoder to gradually distill useful knowledge from the paired (visual) encoder during the learning process.
no code implementations • 18 Jun 2019 • Zhenxi Zhang, Jie Li, Zhusi Zhong, Zhicheng Jiao, Xinbo Gao
3D image segmentation is one of the most important and ubiquitous problems in medical image processing.
no code implementations • 17 Jun 2019 • Zhusi Zhong, Jie Li, Zhenxi Zhang, Zhicheng Jiao, Xinbo Gao
We train the deep encoder-decoder for landmark detection, and combine global landmark configuration with local high-resolution feature responses.
no code implementations • 4 Apr 2019 • Cheng Deng, Zhao Li, Xinbo Gao, DaCheng Tao
In this area, extracting effective statistical characteristics from a JPEG image for classification remains a challenge.
no code implementations • 4 Apr 2019 • Cheng Deng, Zhaojia Chen, Xianglong Liu, Xinbo Gao, DaCheng Tao
Given the benefits of its low storage requirements and high retrieval efficiency, hashing has recently received increasing attention.
no code implementations • 3 Apr 2019 • Hao Wang, Cheng Deng, Xinxu Xu, Wei Liu, Xinbo Gao, DaCheng Tao
Previous works mostly focus on a generative approach that takes a highly abstract and sparse sketch as input and then synthesizes the corresponding natural image.
no code implementations • 12 Mar 2019 • Lei Zhang, Xinbo Gao
Domain is referred to as the state of the world at a certain moment.
no code implementations • 19 Dec 2018 • Xiaodan Zhang, Xinbo Gao, Wen Lu, Lihuo He
The former aims to mimic the functions of peripheral vision to encode the holistic information and provide the attended regions.
no code implementations • 28 Sep 2018 • Yanting Hu, Jie Li, Yuanfei Huang, Xinbo Gao
To capture more informative features and maintain long-term information for image super-resolution, we propose a channel-wise and spatial feature modulation (CSFM) network in which a sequence of feature-modulation memory (FMM) modules is cascaded with a densely connected structure to transform low-resolution features to high informative features.
no code implementations • 23 May 2018 • Xi Yang, Xinbo Gao, Bin Song, Nannan Wang, Dong Yang
In this paper, we aim to explore a new search method for images captured with circular fisheye lens, especially the aurora images.
1 code implementation • CVPR 2018 • Chao Li, Cheng Deng, Ning li, Wei Liu, Xinbo Gao, DaCheng Tao
In addition, we harness a self-supervised semantic network to discover high-level semantic information in the form of multi-label annotations.
2 code implementations • CVPR 2018 • Zheng Hui, Xiumei Wang, Xinbo Gao
Recently, deep convolutional neural networks (CNNs) have been demonstrated remarkable progress on single image super-resolution.
Ranked #4 on
Image Super-Resolution
on IXI
no code implementations • 24 Feb 2018 • Yanting Hu, Xinbo Gao, Jie Li, Yuanfei Huang, Hanzi Wang
To improve information flow and to capture sufficient knowledge for reconstructing the high-frequency details, we propose a cascaded multi-scale cross network (CMSC) in which a sequence of subnetworks is cascaded to infer high resolution features in a coarse-to-fine manner.
no code implementations • 28 Nov 2017 • Haoxuan You, Zhicheng Jiao, Haojun Xu, Jie Li, Ying Wang, Xinbo Gao
Generative adversarial network (GAN) has gotten wide re-search interest in the field of deep learning.
no code implementations • ICCV 2017 • Zhenxing Niu, Mo Zhou, Le Wang, Xinbo Gao, Gang Hua
We address the problem of dense visual-semantic embedding that maps not only full sentences and whole images but also phrases within sentences and salient regions within images into a multimodal embedding space.
no code implementations • 8 Jan 2017 • Nannan Wang, Xinbo Gao, Jie Li
The most time-consuming or main computation complexity for exemplar-based face sketch synthesis methods lies in the neighbor selection process.
no code implementations • 1 Jul 2016 • Chunlei Peng, Xinbo Gao, Nannan Wang, Jie Li
An adaptive sparse graphical representation scheme is designed to represent heterogeneous face images, where a Markov networks model is constructed to generate adaptive sparse vectors.
no code implementations • CVPR 2016 • Zhenxing Niu, Mo Zhou, Le Wang, Xinbo Gao, Gang Hua
To address the non-stationary property of aging patterns, age estimation can be cast as an ordinal regression problem.
no code implementations • 25 Mar 2016 • Nannan Wang, Jie Li, Leiyu Sun, Bin Song, Xinbo Gao
In this paper, we proposed a synthesized face sketch recognition framework based on full-reference image quality assessment metrics.
no code implementations • 2 Mar 2015 • Chunlei Peng, Xinbo Gao, Nannan Wang, Jie Li
Heterogeneous face recognition (HFR) refers to matching face images acquired from different sources (i. e., different sensors or different wavelengths) for identification.
no code implementations • 4 Oct 2014 • Nannan Wang, Xinbo Gao, DaCheng Tao, Xuelong. Li
CLM-based methods consist of a shape model and a number of local experts, each of which is utilized to detect a facial feature point.
no code implementations • CVPR 2014 • Zhenxing Niu, Gang Hua, Xinbo Gao, Qi Tian
In such way, we can efficiently leverage the loosely related tags, and build an intermediate level representation for a collection of weakly annotated images.
no code implementations • 1 Sep 2013 • Fei Gao, DaCheng Tao, Xinbo Gao, Xuelong. Li
The proposed BIQA method is one of learning to rank.