Search Results for author: Lei Wang

Found 352 papers, 118 papers with code

ReDro: Efficiently Learning Large-sized SPD Visual Representation

no code implementations ECCV 2020 Saimunur Rahman, Lei Wang, Changming Sun, Luping Zhou

When learning this representation in deep networks, eigen-decomposition of covariance matrix is usually needed for a key step called matrix normalisation.

Fine-Grained Image Classification

RotateCT: Knowledge Graph Embedding by Rotation and Coordinate Transformation in Complex Space

no code implementations COLING 2022 Yao Dong, Lei Wang, Ji Xiang, Xiaobo Guo, Yuqiang Xie

Knowledge graph embedding, which aims to learn representations of entities and relations in knowledge graphs, finds applications in various downstream tasks.

Computational Efficiency Knowledge Graph Embedding +3

基于预训练语言模型的案件要素识别方法(A Method for Case Factor Recognition Based on Pre-trained Language Models)

no code implementations CCL 2020 Haishun Liu, Lei Wang, Yanguang Chen, Shuchen Zhang, Yuanyuan Sun, Hongfei Lin

案件要素识别指将案件描述中重要事实描述自动抽取出来, 并根据领域专家设计的要素体系进行分类, 是智慧司法领域的重要研究内容。基于传统神经网络的文本编码难以提取深层次特征, 基于阈值的多标签分类难以捕获标签间依赖关系, 因此本文提出了基于预训练语言模型的多标签文本分类模型。该模型采用以Layer-attentive策略进行特征融合的语言模型作为编码器, 使用基于LSTM的序列生成模型作为解码器。在“CAIL2019”数据集上进行实验, 该方法比基于循环神经网络的算法在F1值上最高可提升7. 6%, 在相同超参数设置下比基础语言模型(BERT)提升约3. 2%。

Chaotic Masking Protocol for Secure Communication and Attack Detection in Remote Estimation of Cyber-Physical Systems

no code implementations14 Mar 2024 Tao Chen, Andreu Cecilia, Daniele Astolfi, Lei Wang, Zhitao Liu, Hongye Su

In remote estimation of cyber-physical systems (CPSs), sensor measurements transmitted through network may be attacked by adversaries, leading to leakage risk of privacy (e. g., the system state), and/or failure of the remote estimator.

Gradient-Aware Logit Adjustment Loss for Long-tailed Classifier

1 code implementation14 Mar 2024 Fan Zhang, Wei Qin, Weijieying Ren, Lei Wang, Zetong Chen, Richang Hong

Additionally, We find that most of the solutions to long-tailed problems are still biased towards head classes in the end, and we propose a simple and post hoc prediction re-balancing strategy to further mitigate the basis toward head class.

Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning

1 code implementation29 Feb 2024 Weijieying Ren, Xinlong Li, Lei Wang, Tianxiang Zhao, Wei Qin

Through extensive experiments, we uncover the mode connectivity phenomenon in the LLMs continual learning scenario and find that it can strike a balance between plasticity and stability.

Continual Learning Language Modelling +1

All in a Single Image: Large Multimodal Models are In-Image Learners

1 code implementation28 Feb 2024 Lei Wang, Wanyu Xu, Zhiqiang Hu, Yihuai Lan, Shan Dong, Hao Wang, Roy Ka-Wei Lee, Ee-Peng Lim

This paper introduces a new in-context learning (ICL) mechanism called In-Image Learning (I$^2$L) that combines demonstration examples, visual cues, and instructions into a single image to enhance the capabilities of GPT-4V.

Hallucination In-Context Learning +1

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

1 code implementation27 Feb 2024 Shuming Ma, Hongyu Wang, Lingxiao Ma, Lei Wang, Wenhui Wang, Shaohan Huang, Li Dong, Ruiping Wang, Jilong Xue, Furu Wei

Recent research, such as BitNet, is paving the way for a new era of 1-bit Large Language Models (LLMs).

Mastering the Game of Guandan with Deep Reinforcement Learning and Behavior Regulating

no code implementations21 Feb 2024 Yifan Yanggong, Hao Pan, Lei Wang

Games are a simplified model of reality and often serve as a favored platform for Artificial Intelligence (AI) research.

Decision Making

Advancing Anomaly Detection: An Adaptation Model and a New Dataset

no code implementations7 Feb 2024 Liyun Zhu, Arjun Raj, Lei Wang

To address these challenges, we propose the Scenario-Adaptive Anomaly Detection (SA2D) method, leveraging the few-shot learning framework for faster adaptation of pre-trained models to new concepts.

Anomaly Detection Few-Shot Learning

Taylor Videos for Action Recognition

1 code implementation5 Feb 2024 Lei Wang, Xiuyuan Yuan, Tom Gedeon, Liang Zheng

Addressing these challenges, we propose the Taylor video, a new video format that highlights the dominate motions (e. g., a waving hand) in each of its frames named the Taylor frame.

Action Recognition Optical Flow Estimation

Localization of Dummy Data Injection Attacks in Power Systems Considering Incomplete Topological Information: A Spatio-Temporal Graph Wavelet Convolutional Neural Network Approach

no code implementations27 Jan 2024 Zhaoyang Qu, Yunchang Dong, Yang Li, Siqi Song, Tao Jiang, Min Li, Qiming Wang, Lei Wang, Xiaoyong Bo, Jiye Zang, Qi Xu

Unfortunately, this approach tends to overlook the inherent topological correlations within the non-Euclidean spatial attributes of power grid data, consequently leading to diminished accuracy in attack localization.

Inference Attacks Against Face Recognition Model without Classification Layers

no code implementations24 Jan 2024 Yuanqing Huang, Huilong Chen, Yinggui Wang, Lei Wang

To the best of our knowledge, the proposed attack model is the very first in the literature developed for FR models without a classification layer.

Face Recognition Generative Adversarial Network +3

A Hypernetwork Based Framework for Non-Stationary Channel Prediction

no code implementations16 Jan 2024 Guanzhang Liu, Zhengyang Hu, Lei Wang, Hongying Zhang, Jiang Xue, Michail Matthaiou

In this paper, a hypernetwork based framework is proposed for non-stationary channel prediction.

Distributed Solvers for Network Linear Equations with Scalarized Compression

no code implementations12 Jan 2024 Lei Wang, Zihao Ren, Deming Yuan, Guodong Shi

We then employ such a compressed consensus flow as a fundamental consensus subroutine to develop distributed continuous-time and discrete-time solvers for network linear equations, and prove their exponential convergence properties under scalar node communications.

Distributed Computing

YAYI-UIE: A Chat-Enhanced Instruction Tuning Framework for Universal Information Extraction

1 code implementation24 Dec 2023 Xinglin Xiao, Yijie Wang, Nan Xu, Yuqi Wang, Hanxuan Yang, Minzheng Wang, Yin Luo, Lei Wang, Wenji Mao, Daniel Zeng

The difficulty of the information extraction task lies in dealing with the task-specific label schemas and heterogeneous data structures.

UIE

Gemini: A Family of Highly Capable Multimodal Models

no code implementations The Keyword 2023 Gemini Team, Rohan Anil, Sebastian Borgeaud, Yonghui Wu, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Slav Petrov, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee, Fabio Viola, Malcolm Reynolds, Yuanzhong Xu, Ryan Doherty, Eli Collins, Clemens Meyer, Eliza Rutherford, Erica Moreira, Kareem Ayoub, Megha Goel, George Tucker, Enrique Piqueras, Maxim Krikun, Iain Barr, Nikolay Savinov, Ivo Danihelka, Becca Roelofs, Anaïs White, Anders Andreassen, Tamara von Glehn, Lakshman Yagati, Mehran Kazemi, Lucas Gonzalez, Misha Khalman, Jakub Sygnowski, Alexandre Frechette, Charlotte Smith, Laura Culp, Lev Proleev, Yi Luan, Xi Chen, James Lottes, Nathan Schucher, Federico Lebron, Alban Rrustemi, Natalie Clay, Phil Crone, Tomas Kocisky, Jeffrey Zhao, Bartek Perz, Dian Yu, Heidi Howard, Adam Bloniarz, Jack W. Rae, Han Lu, Laurent SIfre, Marcello Maggioni, Fred Alcober, Dan Garrette, Megan Barnes, Shantanu Thakoor, Jacob Austin, Gabriel Barth-Maron, William Wong, Rishabh Joshi, Rahma Chaabouni, Deeni Fatiha, Arun Ahuja, Ruibo Liu, Yunxuan Li, Sarah Cogan, Jeremy Chen, Chao Jia, Chenjie Gu, Qiao Zhang, Jordan Grimstad, Ale Jakse Hartman, Martin Chadwick, Gaurav Singh Tomar, Xavier Garcia, Evan Senter, Emanuel Taropa, Thanumalayan Sankaranarayana Pillai, Jacob Devlin, Michael Laskin, Diego de Las Casas, Dasha Valter, Connie Tao, Lorenzo Blanco, Adrià Puigdomènech Badia, David Reitter, Mianna Chen, Jenny Brennan, Clara Rivera, Sergey Brin, Shariq Iqbal, Gabriela Surita, Jane Labanowski, Abhi Rao, Stephanie Winkler, Emilio Parisotto, Yiming Gu, Kate Olszewska, Yujing Zhang, Ravi Addanki, Antoine Miech, Annie Louis, Laurent El Shafey, Denis Teplyashin, Geoff Brown, Elliot Catt, Nithya Attaluri, Jan Balaguer, Jackie Xiang, Pidong Wang, Zoe Ashwood, Anton Briukhov, Albert Webson, Sanjay Ganapathy, Smit Sanghavi, Ajay Kannan, Ming-Wei Chang, Axel Stjerngren, Josip Djolonga, Yuting Sun, Ankur Bapna, Matthew Aitchison, Pedram Pejman, Henryk Michalewski, Tianhe Yu, Cindy Wang, Juliette Love, Junwhan Ahn, Dawn Bloxwich, Kehang Han, Peter Humphreys, Thibault Sellam, James Bradbury, Varun Godbole, Sina Samangooei, Bogdan Damoc, Alex Kaskasoli, Sébastien M. R. Arnold, Vijay Vasudevan, Shubham Agrawal, Jason Riesa, Dmitry Lepikhin, Richard Tanburn, Srivatsan Srinivasan, Hyeontaek Lim, Sarah Hodkinson, Pranav Shyam, Johan Ferret, Steven Hand, Ankush Garg, Tom Le Paine, Jian Li, Yujia Li, Minh Giang, Alexander Neitz, Zaheer Abbas, Sarah York, Machel Reid, Elizabeth Cole, Aakanksha Chowdhery, Dipanjan Das, Dominika Rogozińska, Vitaly Nikolaev, Pablo Sprechmann, Zachary Nado, Lukas Zilka, Flavien Prost, Luheng He, Marianne Monteiro, Gaurav Mishra, Chris Welty, Josh Newlan, Dawei Jia, Miltiadis Allamanis, Clara Huiyi Hu, Raoul de Liedekerke, Justin Gilmer, Carl Saroufim, Shruti Rijhwani, Shaobo Hou, Disha Shrivastava, Anirudh Baddepudi, Alex Goldin, Adnan Ozturel, Albin Cassirer, Yunhan Xu, Daniel Sohn, Devendra Sachan, Reinald Kim Amplayo, Craig Swanson, Dessie Petrova, Shashi Narayan, Arthur Guez, Siddhartha Brahma, Jessica Landon, Miteyan Patel, Ruizhe Zhao, Kevin Villela, Luyu Wang, Wenhao Jia, Matthew Rahtz, Mai Giménez, Legg Yeung, Hanzhao Lin, James Keeling, Petko Georgiev, Diana Mincu, Boxi Wu, Salem Haykal, Rachel Saputro, Kiran Vodrahalli, James Qin, Zeynep Cankara, Abhanshu Sharma, Nick Fernando, Will Hawkins, Behnam Neyshabur, Solomon Kim, Adrian Hutter, Priyanka Agrawal, Alex Castro-Ros, George van den Driessche, Tao Wang, Shuo-Yiin Chang, Paul Komarek, Ross Mcilroy, Mario Lučić, Guodong Zhang, Wael Farhan, Michael Sharman, Paul Natsev, Paul Michel, Yong Cheng, Yamini Bansal, Siyuan Qiao, Kris Cao, Siamak Shakeri, Christina Butterfield, Justin Chung, Paul Kishan Rubenstein, Shivani Agrawal, Arthur Mensch, Kedar Soparkar, Karel Lenc, Timothy Chung, Aedan Pope, Loren Maggiore, Jackie Kay, Priya Jhakra, Shibo Wang, Joshua Maynez, Mary Phuong, Taylor Tobin, Andrea Tacchetti, Maja Trebacz, Kevin Robinson, Yash Katariya, Sebastian Riedel, Paige Bailey, Kefan Xiao, Nimesh Ghelani, Lora Aroyo, Ambrose Slone, Neil Houlsby, Xuehan Xiong, Zhen Yang, Elena Gribovskaya, Jonas Adler, Mateo Wirth, Lisa Lee, Music Li, Thais Kagohara, Jay Pavagadhi, Sophie Bridgers, Anna Bortsova, Sanjay Ghemawat, Zafarali Ahmed, Tianqi Liu, Richard Powell, Vijay Bolina, Mariko Iinuma, Polina Zablotskaia, James Besley, Da-Woon Chung, Timothy Dozat, Ramona Comanescu, Xiance Si, Jeremy Greer, Guolong Su, Martin Polacek, Raphaël Lopez Kaufman, Simon Tokumine, Hexiang Hu, Elena Buchatskaya, Yingjie Miao, Mohamed Elhawaty, Aditya Siddhant, Nenad Tomasev, Jinwei Xing, Christina Greer, Helen Miller, Shereen Ashraf, Aurko Roy, Zizhao Zhang, Ada Ma, Angelos Filos, Milos Besta, Rory Blevins, Ted Klimenko, Chih-Kuan Yeh, Soravit Changpinyo, Jiaqi Mu, Oscar Chang, Mantas Pajarskas, Carrie Muir, Vered Cohen, Charline Le Lan, Krishna Haridasan, Amit Marathe, Steven Hansen, Sholto Douglas, Rajkumar Samuel, Mingqiu Wang, Sophia Austin, Chang Lan, Jiepu Jiang, Justin Chiu, Jaime Alonso Lorenzo, Lars Lowe Sjösund, Sébastien Cevey, Zach Gleicher, Thi Avrahami, Anudhyan Boral, Hansa Srinivasan, Vittorio Selo, Rhys May, Konstantinos Aisopos, Léonard Hussenot, Livio Baldini Soares, Kate Baumli, Michael B. Chang, Adrià Recasens, Ben Caine, Alexander Pritzel, Filip Pavetic, Fabio Pardo, Anita Gergely, Justin Frye, Vinay Ramasesh, Dan Horgan, Kartikeya Badola, Nora Kassner, Subhrajit Roy, Ethan Dyer, Víctor Campos, Alex Tomala, Yunhao Tang, Dalia El Badawy, Elspeth White, Basil Mustafa, Oran Lang, Abhishek Jindal, Sharad Vikram, Zhitao Gong, Sergi Caelles, Ross Hemsley, Gregory Thornton, Fangxiaoyu Feng, Wojciech Stokowiec, Ce Zheng, Phoebe Thacker, Çağlar Ünlü, Zhishuai Zhang, Mohammad Saleh, James Svensson, Max Bileschi, Piyush Patil, Ankesh Anand, Roman Ring, Katerina Tsihlas, Arpi Vezer, Marco Selvi, Toby Shevlane, Mikel Rodriguez, Tom Kwiatkowski, Samira Daruki, Keran Rong, Allan Dafoe, Nicholas FitzGerald, Keren Gu-Lemberg, Mina Khan, Lisa Anne Hendricks, Marie Pellat, Vladimir Feinberg, James Cobon-Kerr, Tara Sainath, Maribeth Rauh, Sayed Hadi Hashemi, Richard Ives, Yana Hasson, Yaguang Li, Eric Noland, Yuan Cao, Nathan Byrd, Le Hou, Qingze Wang, Thibault Sottiaux, Michela Paganini, Jean-Baptiste Lespiau, Alexandre Moufarek, Samer Hassan, Kaushik Shivakumar, Joost van Amersfoort, Amol Mandhane, Pratik Joshi, Anirudh Goyal, Matthew Tung, Andrew Brock, Hannah Sheahan, Vedant Misra, Cheng Li, Nemanja Rakićević, Mostafa Dehghani, Fangyu Liu, Sid Mittal, Junhyuk Oh, Seb Noury, Eren Sezener, Fantine Huot, Matthew Lamm, Nicola De Cao, Charlie Chen, Gamaleldin Elsayed, Ed Chi, Mahdis Mahdieh, Ian Tenney, Nan Hua, Ivan Petrychenko, Patrick Kane, Dylan Scandinaro, Rishub Jain, Jonathan Uesato, Romina Datta, Adam Sadovsky, Oskar Bunyan, Dominik Rabiej, Shimu Wu, John Zhang, Gautam Vasudevan, Edouard Leurent, Mahmoud Alnahlawi, Ionut Georgescu, Nan Wei, Ivy Zheng, Betty Chan, Pam G Rabinovitch, Piotr Stanczyk, Ye Zhang, David Steiner, Subhajit Naskar, Michael Azzam, Matthew Johnson, Adam Paszke, Chung-Cheng Chiu, Jaume Sanchez Elias, Afroz Mohiuddin, Faizan Muhammad, Jin Miao, Andrew Lee, Nino Vieillard, Sahitya Potluri, Jane Park, Elnaz Davoodi, Jiageng Zhang, Jeff Stanway, Drew Garmon, Abhijit Karmarkar, Zhe Dong, Jong Lee, Aviral Kumar, Luowei Zhou, Jonathan Evens, William Isaac, Zhe Chen, Johnson Jia, Anselm Levskaya, Zhenkai Zhu, Chris Gorgolewski, Peter Grabowski, Yu Mao, Alberto Magni, Kaisheng Yao, Javier Snaider, Norman Casagrande, Paul Suganthan, Evan Palmer, Geoffrey Irving, Edward Loper, Manaal Faruqui, Isha Arkatkar, Nanxin Chen, Izhak Shafran, Michael Fink, Alfonso Castaño, Irene Giannoumis, Wooyeol Kim, Mikołaj Rybiński, Ashwin Sreevatsa, Jennifer Prendki, David Soergel, Adrian Goedeckemeyer, Willi Gierke, Mohsen Jafari, Meenu Gaba, Jeremy Wiesner, Diana Gage Wright, Yawen Wei, Harsha Vashisht, Yana Kulizhskaya, Jay Hoover, Maigo Le, Lu Li, Chimezie Iwuanyanwu, Lu Liu, Kevin Ramirez, Andrey Khorlin, Albert Cui, Tian Lin, Marin Georgiev, Marcus Wu, Ricardo Aguilar, Keith Pallo, Abhishek Chakladar, Alena Repina, Xihui Wu, Tom van der Weide, Priya Ponnapalli, Caroline Kaplan, Jiri Simsa, Shuangfeng Li, Olivier Dousse, Jeff Piper, Nathan Ie, Minnie Lui, Rama Pasumarthi, Nathan Lintz, Anitha Vijayakumar, Lam Nguyen Thiet, Daniel Andor, Pedro Valenzuela, Cosmin Paduraru, Daiyi Peng, Katherine Lee, Shuyuan Zhang, Somer Greene, Duc Dung Nguyen, Paula Kurylowicz, Sarmishta Velury, Sebastian Krause, Cassidy Hardin, Lucas Dixon, Lili Janzer, Kiam Choo, Ziqiang Feng, Biao Zhang, Achintya Singhal, Tejasi Latkar, Mingyang Zhang, Quoc Le, Elena Allica Abellan, Dayou Du, Dan McKinnon, Natasha Antropova, Tolga Bolukbasi, Orgad Keller, David Reid, Daniel Finchelstein, Maria Abi Raad, Remi Crocker, Peter Hawkins, Robert Dadashi, Colin Gaffney, Sid Lall, Ken Franko, Egor Filonov, Anna Bulanova, Rémi Leblond, Vikas Yadav, Shirley Chung, Harry Askham, Luis C. Cobo, Kelvin Xu, Felix Fischer, Jun Xu, Christina Sorokin, Chris Alberti, Chu-Cheng Lin, Colin Evans, Hao Zhou, Alek Dimitriev, Hannah Forbes, Dylan Banarse, Zora Tung, Jeremiah Liu, Mark Omernick, Colton Bishop, Chintu Kumar, Rachel Sterneck, Ryan Foley, Rohan Jain, Swaroop Mishra, Jiawei Xia, Taylor Bos, Geoffrey Cideron, Ehsan Amid, Francesco Piccinno, Xingyu Wang, Praseem Banzal, Petru Gurita, Hila Noga, Premal Shah, Daniel J. Mankowitz, Alex Polozov, Nate Kushman, Victoria Krakovna, Sasha Brown, Mohammadhossein Bateni, Dennis Duan, Vlad Firoiu, Meghana Thotakuri, Tom Natan, Anhad Mohananey, Matthieu Geist, Sidharth Mudgal, Sertan Girgin, Hui Li, Jiayu Ye, Ofir Roval, Reiko Tojo, Michael Kwong, James Lee-Thorp, Christopher Yew, Quan Yuan, Sumit Bagri, Danila Sinopalnikov, Sabela Ramos, John Mellor, Abhishek Sharma, Aliaksei Severyn, Jonathan Lai, Kathy Wu, Heng-Tze Cheng, David Miller, Nicolas Sonnerat, Denis Vnukov, Rory Greig, Jennifer Beattie, Emily Caveness, Libin Bai, Julian Eisenschlos, Alex Korchemniy, Tomy Tsai, Mimi Jasarevic, Weize Kong, Phuong Dao, Zeyu Zheng, Frederick Liu, Fan Yang, Rui Zhu, Mark Geller, Tian Huey Teh, Jason Sanmiya, Evgeny Gladchenko, Nejc Trdin, Andrei Sozanschi, Daniel Toyama, Evan Rosen, Sasan Tavakkol, Linting Xue, Chen Elkind, Oliver Woodman, John Carpenter, George Papamakarios, Rupert Kemp, Sushant Kafle, Tanya Grunina, Rishika Sinha, Alice Talbert, Abhimanyu Goyal, Diane Wu, Denese Owusu-Afriyie, Cosmo Du, Chloe Thornton, Jordi Pont-Tuset, Pradyumna Narayana, Jing Li, Sabaer Fatehi, John Wieting, Omar Ajmeri, Benigno Uria, Tao Zhu, Yeongil Ko, Laura Knight, Amélie Héliou, Ning Niu, Shane Gu, Chenxi Pang, Dustin Tran, Yeqing Li, Nir Levine, Ariel Stolovich, Norbert Kalb, Rebeca Santamaria-Fernandez, Sonam Goenka, Wenny Yustalim, Robin Strudel, Ali Elqursh, Balaji Lakshminarayanan, Charlie Deck, Shyam Upadhyay, Hyo Lee, Mike Dusenberry, Zonglin Li, Xuezhi Wang, Kyle Levin, Raphael Hoffmann, Dan Holtmann-Rice, Olivier Bachem, Summer Yue, Sho Arora, Eric Malmi, Daniil Mirylenka, Qijun Tan, Christy Koh, Soheil Hassas Yeganeh, Siim Põder, Steven Zheng, Francesco Pongetti, Mukarram Tariq, Yanhua Sun, Lucian Ionita, Mojtaba Seyedhosseini, Pouya Tafti, Ragha Kotikalapudi, Zhiyu Liu, Anmol Gulati, Jasmine Liu, Xinyu Ye, Bart Chrzaszcz, Lily Wang, Nikhil Sethi, Tianrun Li, Ben Brown, Shreya Singh, Wei Fan, Aaron Parisi, Joe Stanton, Chenkai Kuang, Vinod Koverkathu, Christopher A. Choquette-Choo, Yunjie Li, TJ Lu, Abe Ittycheriah, Prakash Shroff, Pei Sun, Mani Varadarajan, Sanaz Bahargam, Rob Willoughby, David Gaddy, Ishita Dasgupta, Guillaume Desjardins, Marco Cornero, Brona Robenek, Bhavishya Mittal, Ben Albrecht, Ashish Shenoy, Fedor Moiseev, Henrik Jacobsson, Alireza Ghaffarkhah, Morgane Rivière, Alanna Walton, Clément Crepy, Alicia Parrish, YuAn Liu, Zongwei Zhou, Clement Farabet, Carey Radebaugh, Praveen Srinivasan, Claudia van der Salm, Andreas Fidjeland, Salvatore Scellato, Eri Latorre-Chimoto, Hanna Klimczak-Plucińska, David Bridson, Dario de Cesare, Tom Hudson, Piermaria Mendolicchio, Lexi Walker, Alex Morris, Ivo Penchev, Matthew Mauger, Alexey Guseynov, Alison Reid, Seth Odoom, Lucia Loher, Victor Cotruta, Madhavi Yenugula, Dominik Grewe, Anastasia Petrushkina, Tom Duerig, Antonio Sanchez, Steve Yadlowsky, Amy Shen, Amir Globerson, Adam Kurzrok, Lynette Webb, Sahil Dua, Dong Li, Preethi Lahoti, Surya Bhupatiraju, Dan Hurt, Haroon Qureshi, Ananth Agarwal, Tomer Shani, Matan Eyal, Anuj Khare, Shreyas Rammohan Belle, Lei Wang, Chetan Tekur, Mihir Sanjay Kale, Jinliang Wei, Ruoxin Sang, Brennan Saeta, Tyler Liechty, Yi Sun, Yao Zhao, Stephan Lee, Pandu Nayak, Doug Fritz, Manish Reddy Vuyyuru, John Aslanides, Nidhi Vyas, Martin Wicke, Xiao Ma, Taylan Bilal, Evgenii Eltyshev, Daniel Balle, Nina Martin, Hardie Cate, James Manyika, Keyvan Amiri, Yelin Kim, Xi Xiong, Kai Kang, Florian Luisier, Nilesh Tripuraneni, David Madras, Mandy Guo, Austin Waters, Oliver Wang, Joshua Ainslie, Jason Baldridge, Han Zhang, Garima Pruthi, Jakob Bauer, Feng Yang, Riham Mansour, Jason Gelman, Yang Xu, George Polovets, Ji Liu, Honglong Cai, Warren Chen, XiangHai Sheng, Emily Xue, Sherjil Ozair, Adams Yu, Christof Angermueller, Xiaowei Li, Weiren Wang, Julia Wiesinger, Emmanouil Koukoumidis, Yuan Tian, Anand Iyer, Madhu Gurumurthy, Mark Goldenson, Parashar Shah, MK Blake, Hongkun Yu, Anthony Urbanowicz, Jennimaria Palomaki, Chrisantha Fernando, Kevin Brooks, Ken Durden, Harsh Mehta, Nikola Momchev, Elahe Rahimtoroghi, Maria Georgaki, Amit Raul, Sebastian Ruder, Morgan Redshaw, Jinhyuk Lee, Komal Jalan, Dinghua Li, Ginger Perng, Blake Hechtman, Parker Schuh, Milad Nasr, Mia Chen, Kieran Milan, Vladimir Mikulik, Trevor Strohman, Juliana Franco, Tim Green, Demis Hassabis, Koray Kavukcuoglu, Jeffrey Dean, Oriol Vinyals

This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding.

Arithmetic Reasoning Code Generation +3

Roll With the Punches: Expansion and Shrinkage of Soft Label Selection for Semi-supervised Fine-Grained Learning

1 code implementation19 Dec 2023 Yue Duan, Zhen Zhao, Lei Qi, Luping Zhou, Lei Wang, Yinghuan Shi

While semi-supervised learning (SSL) has yielded promising results, the more realistic SSL scenario remains to be explored, in which the unlabeled data exhibits extremely high recognition difficulty, e. g., fine-grained visual classification in the context of SSL (SS-FGVC).

Fine-Grained Image Classification Pseudo Label

Federated Learning with Instance-Dependent Noisy Label

no code implementations16 Dec 2023 Lei Wang, Jieming Bian, Jie Xu

We introduce a novel algorithm called FedBeat (Federated Learning with Bayesian Ensemble-Assisted Transition Matrix Estimation).

Federated Learning

Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites

1 code implementation4 Dec 2023 Lei Wang, Jiabang He, Shenshen Li, Ning Liu, Ee-Peng Lim

The fine-grained object attributes and behaviors non-existent in the image may still be generated but not measured by the current evaluation methods.

Hallucination Hallucination Evaluation +2

Learning with Noisy Low-Cost MOS for Image Quality Assessment via Dual-Bias Calibration

no code implementations27 Nov 2023 Lei Wang, Qingbo Wu, Desen Yuan, King Ngi Ngan, Hongliang Li, Fanman Meng, Linfeng Xu

Learning based image quality assessment (IQA) models have obtained impressive performance with the help of reliable subjective quality labels, where mean opinion score (MOS) is the most popular choice.

Image Quality Assessment

SpliceMix: A Cross-scale and Semantic Blending Augmentation Strategy for Multi-label Image Classification

1 code implementation26 Nov 2023 Lei Wang, Yibing Zhan, Leilei Ma, Dapeng Tao, Liang Ding, Chen Gong

The "splice" in our method is two-fold: 1) Each mixed image is a splice of several downsampled images in the form of a grid, where the semantics of images attending to mixing are blended without object deficiencies for alleviating co-occurred bias; 2) We splice mixed images and the original mini-batch to form a new SpliceMixed mini-batch, which allows an image with different scales to contribute to training together.

Data Augmentation Multi-Label Image Classification

HiH: A Multi-modal Hierarchy in Hierarchy Network for Unconstrained Gait Recognition

no code implementations19 Nov 2023 Lei Wang, Yinchi Ma, Peng Luan, Wei Yao, CongCong Li, Bo Liu

Gait recognition has achieved promising advances in controlled settings, yet it significantly struggles in unconstrained environments due to challenges such as view changes, occlusions, and varying walking speeds.

Gait Recognition

CAFE: Carbon-Aware Federated Learning in Geographically Distributed Data Centers

no code implementations6 Nov 2023 Jieming Bian, Lei Wang, Shaolei Ren, Jie Xu

Training large-scale artificial intelligence (AI) models demands significant computational power and energy, leading to increased carbon footprint with potential environmental repercussions.

Federated Learning

A Systematic Evaluation of GPT-4V's Multimodal Capability for Medical Image Analysis

no code implementations31 Oct 2023 Yingshu Li, Yunyi Liu, Zhanyu Wang, Xinyu Liang, Lei Wang, Lingqiao Liu, Leyang Cui, Zhaopeng Tu, Longyue Wang, Luping Zhou

This work conducts an evaluation of GPT-4V's multimodal capability for medical image analysis, with a focus on three representative tasks of radiology report generation, medical visual question answering, and medical visual grounding.

Descriptive Medical Visual Question Answering +3

R$^3$ Prompting: Review, Rephrase and Resolve for Chain-of-Thought Reasoning in Large Language Models under Noisy Context

no code implementations25 Oct 2023 Qingyuan Tian, Hanlun Zhu, Lei Wang, Yang Li, Yunshi Lan

More analyses and ablation studies show the robustness and generalization of R$^3$ prompting method in solving reasoning tasks in LLMs under noisy context.

Sentence

LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay

1 code implementation23 Oct 2023 Yihuai Lan, Zhiqiang Hu, Lei Wang, Yang Wang, Deheng Ye, Peilin Zhao, Ee-Peng Lim, Hui Xiong, Hao Wang

To achieve this goal, we adopt Avalon, a representative communication game, as the environment and use system prompts to guide LLM agents to play the game.

Distributed Adaptive Time-Varying Convex Optimization for Multi-agent Systems

no code implementations20 Oct 2023 Liangze Jiang, Zhengguang Wu, Lei Wang

A new class of adaptive algorithms are proposed to solve time-varying convex optimization problems.

Flow Dynamics Correction for Action Recognition

no code implementations16 Oct 2023 Lei Wang, Piotr Koniusz

Various research studies indicate that action recognition performance highly depends on the types of motions being extracted and how accurate the human actions are represented.

Fine-grained Action Recognition Hallucination +1

Prompting Large Language Models with Chain-of-Thought for Few-Shot Knowledge Base Question Generation

no code implementations12 Oct 2023 Yuanyuan Liang, Jianing Wang, Hanlun Zhu, Lei Wang, Weining Qian, Yunshi Lan

Inspired by Chain-of-Thought (CoT) prompting, which is an in-context learning strategy for reasoning, we formulate KBQG task as a reasoning problem, where the generation of a complete question is splitted into a series of sub-question generation.

In-Context Learning Question Generation +1

LLM4Vis: Explainable Visualization Recommendation using ChatGPT

1 code implementation11 Oct 2023 Lei Wang, Songheng Zhang, Yun Wang, Ee-Peng Lim, Yong Wang

To obtain demonstration examples with high-quality explanations, we propose a new explanation generation bootstrapping to iteratively refine generated explanations by considering the previous generation and template-based hint.

Data Visualization Explanation Generation

Adaptive Multi-head Contrastive Learning

no code implementations9 Oct 2023 Lei Wang, Piotr Koniusz, Tom Gedeon, Liang Zheng

As such, enforcing a high similarity for positive pairs and a low similarity for negative pairs may not always be achievable, and in the case of some pairs, forcing so may be detrimental to the performance.

Contrastive Learning

Reach-avoid Analysis for Sampled-data Systems with Measurement Uncertainties

no code implementations8 Oct 2023 Taoran Wu, Dejin Ren, Shuyuan Zhang, Lei Wang, Bai Xue

Digital control has become increasingly prevalent in modern systems, making continuous-time plants controlled by discrete-time (digital) controllers ubiquitous and crucial across industries, including aerospace, automotive, and manufacturing.

AI in Software Engineering: Case Studies and Prospects

no code implementations27 Sep 2023 Lei Wang

Based on the analysis of both case studies, using AI techniques such as deep learning and machine learning in software systems contributes to intelligent systems.

Decision Making

R2GenGPT: Radiology Report Generation with Frozen LLMs

1 code implementation18 Sep 2023 Zhanyu Wang, Lingqiao Liu, Lei Wang, Luping Zhou

First, it attains state-of-the-art (SOTA) performance by training only the lightweight visual alignment module while freezing all the parameters of LLM.

Differentially Private Average Consensus with Improved Accuracy-Privacy Trade-off

no code implementations15 Sep 2023 Lei Wang, Weijia Liu, Fanghong Guo, Zixin Qiao, Zhengguang Wu

Gaussian noise and the output of the mechanism using Gaussian noises, it is shown that the resulting average consensus algorithm can eliminate the gap in the sense that the accuracy-privacy trade-off of the centralized averaging approach with differential privacy can be almost recovered by appropriately designing the variances of the added noises.

Beamforming Design and Performance Evaluation for RIS-aided Localization using LEO Satellite Signals

no code implementations13 Sep 2023 Lei Wang, Pinjun Zheng, Xing Liu, Tarig Ballal, Tareq Y. Al-Naffouri

The growing availability of low-Earth orbit (LEO) satellites, coupled with the anticipated widespread deployment of reconfigurable intelligent surfaces (RISs), opens up promising prospects for new localization paradigms.

Enhancing Sample Utilization through Sample Adaptive Augmentation in Semi-Supervised Learning

1 code implementation ICCV 2023 Guan Gui, Zhen Zhao, Lei Qi, Luping Zhou, Lei Wang, Yinghuan Shi

Sample adaptive augmentation (SAA) is proposed for this stated purpose and consists of two modules: 1) sample selection module; 2) sample augmentation module.

LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors

1 code implementation26 Aug 2023 Chengkun Wei, Wenlong Meng, Zhikun Zhang, Min Chen, Minghu Zhao, Wenjing Fang, Lei Wang, Zihui Zhang, Wenzhi Chen

Instead of directly inverting the triggers, LMSanitator aims to invert the predefined attack vectors (pretrained models' output when the input is embedded with triggers) of the task-agnostic backdoors, which achieves much better convergence performance and backdoor detection accuracy.

Head-Tail Cooperative Learning Network for Unbiased Scene Graph Generation

1 code implementation23 Aug 2023 Lei Wang, Zejian yuan, Yao Lu, Badong Chen

We also propose a self-supervised learning approach to enhance the prediction ability of the tail-prefer feature representation branch by constraining tail-prefer predicate features.

Graph Generation Self-Supervised Learning +1

A Survey on Large Language Model based Autonomous Agents

2 code implementations22 Aug 2023 Lei Wang, Chen Ma, Xueyang Feng, Zeyu Zhang, Hao Yang, Jingsen Zhang, ZhiYuan Chen, Jiakai Tang, Xu Chen, Yankai Lin, Wayne Xin Zhao, Zhewei Wei, Ji-Rong Wen

In this paper, we present a comprehensive survey of these studies, delivering a systematic review of the field of LLM-based autonomous agents from a holistic perspective.

Language Modelling Large Language Model

MoCoSA: Momentum Contrast for Knowledge Graph Completion with Structure-Augmented Pre-trained Language Models

no code implementations16 Aug 2023 Jiabang He, Liu Jia, Lei Wang, Xiyao Li, Xing Xu

However, they struggle with semantically rich real-world entities due to limited structural information and fail to generalize to unseen entities.

Entity Embeddings Link Prediction

Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields

no code implementations11 Aug 2023 Yatao Li, Wanling Gao, Lei Wang, Lixin Sun, Zun Wang, Jianfeng Zhan

This suite of metrics has demonstrated a better ability to assess a model's performance in real-world scientific applications, in contrast to traditional AI benchmarking methodologies.

Benchmarking

Empower Your Model with Longer and Better Context Comprehension

1 code implementation25 Jul 2023 YiFei Gao, Lei Wang, Jun Fang, Longhua Hu, Jun Cheng

Recently, with the emergence of numerous Large Language Models (LLMs), the implementation of AI has entered a new era.

Hierarchical Spatio-Temporal Representation Learning for Gait Recognition

no code implementations ICCV 2023 Lei Wang, Bo Liu, Fangfang Liang, Bincheng Wang

While current methods focus on exploiting body part-based representations, they often neglect the hierarchical dependencies between local motion patterns.

Gait Recognition Representation Learning

Semantic-Aware Dual Contrastive Learning for Multi-label Image Classification

1 code implementation19 Jul 2023 Leilei Ma, Dengdi Sun, Lei Wang, Haifeng Zhao, Bin Luo

Specifically, we leverage semantic-aware representation learning to extract category-related local discriminative features and construct category prototypes.

Contrastive Learning Multi-Label Image Classification +2

IterLara: A Turing Complete Algebra for Big Data, AI, Scientific Computing, and Database

no code implementations17 Jul 2023 Hongxiao Li, Wanling Gao, Lei Wang, Jianfeng Zhan

The study of \textsc{Lara}'s expressive ability reports that it can represent relational algebra and most linear algebra operations.

In-context Autoencoder for Context Compression in a Large Language Model

1 code implementation13 Jul 2023 Tao Ge, Jing Hu, Lei Wang, Xun Wang, Si-Qing Chen, Furu Wei

We propose the In-context Autoencoder (ICAE), leveraging the power of a large language models (LLM) to compress a long context into short compact memory slots that can be directly conditioned on by the LLM for various purposes.

Language Modelling Large Language Model +3

An Empirical Study on the Holiday Effect of China's Time-Honored Companies

no code implementations29 Jun 2023 Xianyang Li, Jiayi Xu, Haoxuan Xu, Yunxuan Ma, Yu Zhong, Lei Wang

The stock segment of China's time-honored brand enterprises has an important position in our securities stock market.

PEBO-SLAM: Observer design for visual inertial SLAM with convergence guarantees

no code implementations22 Jun 2023 Bowen Yi, Chi Jin, Lei Wang, Guodong Shi, Viorela Ila, Ian R. Manchester

This paper introduces a new linear parameterization to the problem of visual inertial simultaneous localization and mapping (VI-SLAM) -- without any approximation -- for the case only using information from a single monocular camera and an inertial measurement unit.

Simultaneous Localization and Mapping

D3L: Decomposition of 3D Rotation and Lift from 2D Joint to 3D for Human Mesh Recovery

no code implementations10 Jun 2023 Xiaoyang Hao, Han Li, Jun Cheng, Lei Wang

However, these methods present rotation semantic ambiguity, rotation error accumulation, and shape estimation overfitting, which also leads to errors in the estimated pose.

Human Mesh Recovery Pose Estimation +1

Do-GOOD: Towards Distribution Shift Evaluation for Pre-Trained Visual Document Understanding Models

1 code implementation5 Jun 2023 Jiabang He, Yi Hu, Lei Wang, Xing Xu, Ning Liu, Hui Liu, Heng Tao Shen

Results from the experiments demonstrate that there is a significant performance gap between the in-distribution (ID) and OOD settings for document images, and that fine-grained analysis of distribution shifts can reveal the brittle nature of existing pre-trained VDU models and OOD generalization algorithms.

document understanding Question Answering

User Behavior Simulation with Large Language Model based Agents

1 code implementation5 Jun 2023 Lei Wang, Jingsen Zhang, Hao Yang, ZhiYuan Chen, Jiakai Tang, Zeyu Zhang, Xu Chen, Yankai Lin, Ruihua Song, Wayne Xin Zhao, Jun Xu, Zhicheng Dou, Jun Wang, Ji-Rong Wen

Simulating high quality user behavior data has always been a fundamental problem in human-centered applications, where the major difficulty originates from the intricate mechanism of human decision process.

Language Modelling Large Language Model +2

SAPI: Surroundings-Aware Vehicle Trajectory Prediction at Intersections

no code implementations2 Jun 2023 Ethan Zhang, Hao Xiao, Yiqian Gan, Lei Wang

In this work we propose a deep learning model, i. e., SAPI, to predict vehicle trajectories at intersections.

Autonomous Vehicles Trajectory Prediction

ReDirTrans: Latent-to-Latent Translation for Gaze and Head Redirection

no code implementations CVPR 2023 Shiwei Jin, Zhen Wang, Lei Wang, Ning Bi, Truong Nguyen

Then both the initial and edited embeddings are projected back (deprojected) to the initial latent space as residuals to modify the input latent vectors by subtraction and addition, representing old status removal and new status addition.

Attribute Gaze Estimation +2

MALM: Mask Augmentation based Local Matching for Food-Recipe Retrieval

1 code implementation18 May 2023 Bhanu Prakash Voutharoja, Peng Wang, Lei Wang, Vivienne Guan

A de-facto idea to address this task is to learn a shared feature embedding space in which a food image is aligned better to its paired recipe than other recipes.

Image-text matching Retrieval +1

Automatic Radiology Report Generation by Learning with Increasingly Hard Negatives

1 code implementation11 May 2023 Bhanu Prakash Voutharoja, Lei Wang, Luping Zhou

At each iteration, conditioned on a given set of hard negative reports, image and report features are learned as usual by minimising the loss functions related to report generation.

Medical Report Generation

Non-Autoregressive Math Word Problem Solver with Unified Tree Structure

1 code implementation8 May 2023 Yi Bin, Mengqun Han, Wenhao Shi, Lei Wang, Yang Yang, See-Kiong Ng, Heng Tao Shen

For evaluating the possible expression variants, we design a path-based metric to evaluate the partial accuracy of expressions of a unified tree.

Math valid

Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models

3 code implementations6 May 2023 Lei Wang, Wanyu Xu, Yihuai Lan, Zhiqiang Hu, Yunshi Lan, Roy Ka-Wei Lee, Ee-Peng Lim

To address the calculation errors and improve the quality of generated reasoning steps, we extend PS prompting with more detailed instructions and derive PS+ prompting.

Math

Stars Are All You Need: A Distantly Supervised Pyramid Network for Unified Sentiment Analysis

no code implementations2 May 2023 Wenchang Li, Yixing Chen, Shuang Zheng, Lei Wang, John P. Lalor

We also demonstrate the interpretability of DSPN's outputs on reviews to show the pyramid structure inherent in unified sentiment analysis.

Aspect Category Detection Sentiment Analysis

Learning Partial Correlation based Deep Visual Representation for Image Classification

1 code implementation CVPR 2023 Saimunur Rahman, Piotr Koniusz, Lei Wang, Luping Zhou, Peyman Moghadam, Changming Sun

Our work obtains a partial correlation based deep visual representation and mitigates the small sample problem often encountered by covariance matrix estimation in CNN.

Fine-Grained Image Classification

Accelerating Hybrid Federated Learning Convergence under Partial Participation

no code implementations10 Apr 2023 Jieming Bian, Lei Wang, Kun Yang, Cong Shen, Jie Xu

In this paper, we provide theoretical analysis of hybrid FL under clients' partial participation to validate that partial participation is the key constraint on convergence speed.

Federated Learning

Zero-Shot Next-Item Recommendation using Large Pretrained Language Models

1 code implementation6 Apr 2023 Lei Wang, Ee-Peng Lim

Large language models (LLMs) have achieved impressive zero-shot performance in various natural language processing (NLP) tasks, demonstrating their capabilities for inference without training examples.

Sequential Recommendation

METransformer: Radiology Report Generation by Transformer with Multiple Learnable Expert Tokens

no code implementations CVPR 2023 Zhanyu Wang, Lingqiao Liu, Lei Wang, Luping Zhou

In the encoder, each expert token interacts with both vision tokens and other expert tokens to learn to attend different image regions for image representation.

LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models

2 code implementations4 Apr 2023 Zhiqiang Hu, Lei Wang, Yihuai Lan, Wanyu Xu, Ee-Peng Lim, Lidong Bing, Xing Xu, Soujanya Poria, Roy Ka-Wei Lee

The success of large language models (LLMs), like GPT-4 and ChatGPT, has led to the development of numerous cost-effective and accessible alternatives that are created by finetuning open-access LLMs with task-specific data (e. g., ChatDoctor) or instruction data (e. g., Alpaca).

Arithmetic Reasoning Language Modelling

3Mformer: Multi-order Multi-mode Transformer for Skeletal Action Recognition

no code implementations CVPR 2023 Lei Wang, Piotr Koniusz

We split action sequences into temporal blocks, Higher-order Transformer (HoT) produces embeddings of each temporal block based on (i) the body joints, (ii) pairwise links of body joints and (iii) higher-order hyper-edges of skeleton body joints.

Action Recognition Skeleton Based Action Recognition

Edge-free but Structure-aware: Prototype-Guided Knowledge Distillation from GNNs to MLPs

no code implementations24 Mar 2023 Taiqiang Wu, Zhe Zhao, Jiahao Wang, Xingyu Bai, Lei Wang, Ngai Wong, Yujiu Yang

Distilling high-accuracy Graph Neural Networks~(GNNs) to low-latency multilayer perceptrons~(MLPs) on graph tasks has become a hot research topic.

Knowledge Distillation

ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction

1 code implementation ICCV 2023 Jiabang He, Lei Wang, Yi Hu, Ning Liu, Hui Liu, Xing Xu, Heng Tao Shen

To this end, we propose a simple but effective in-context learning framework called ICL-D3IE, which enables LLMs to perform DIE with different types of demonstration examples.

Document AI In-Context Learning

MVFusion: Multi-View 3D Object Detection with Semantic-aligned Radar and Camera Fusion

no code implementations21 Feb 2023 Zizhang Wu, Guilian Chen, Yuanzhu Gan, Lei Wang, Jian Pu

To achieve so, we inject the semantic alignment into the radar features via the semantic-aligned radar encoder (SARE) to produce image-guided radar features.

3D Object Detection Autonomous Driving +1

CMLCompiler: A Unified Compiler for Classical Machine Learning

no code implementations31 Jan 2023 Xu Wen, Wanling Gao, Anzheng Li, Lei Wang, Zihan Jiang, Jianfeng Zhan

Without a unified framework, the hybrid deployments of deep learning (DL) and CML also suffer from severe performance and portability issues.

High-level semantic feature matters few-shot unsupervised domain adaptation

no code implementations5 Jan 2023 Lei Yu, Wanqi Yang, Shengqi Huang, Lei Wang, Ming Yang

However, the goal of FS-UDA and FSL are relevant yet distinct, since FS-UDA aims to classify the samples in target domain rather than source domain.

Few-Shot Learning Unsupervised Domain Adaptation +1

A GOA-Based Fault-Tolerant Trajectory Tracking Control for an Underwater Vehicle of Multi-Thruster System without Actuator Saturation

no code implementations4 Jan 2023 Danjie Zhu, Lei Wang, Hua Zhang, Simon X. Yang

This paper proposes an intelligent fault-tolerant control (FTC) strategy to tackle the trajectory tracking problem of an underwater vehicle (UV) under thruster damage (power loss) cases and meanwhile resolve the actuator saturation brought by the vehicle's physical constraints.

Learning Spatial-context-aware Global Visual Feature Representation for Instance Image Retrieval

1 code implementation ICCV 2023 Zhongyan Zhang, Lei Wang, Luping Zhou, Piotr Koniusz

To this end, we propose a novel feature learning framework for instance image retrieval, which embeds local spatial context information into the learned global feature representations.

Image Retrieval Retrieval

Regularized Primitive Graph Learning for Unified Vector Mapping

no code implementations ICCV 2023 Lei Wang, Min Dai, Jianan He, Jingwei Huang

Our key idea is using primitive graph as a unified representation of vector maps and formulating shape regularization and topology reconstruction as primitive graph reconstruction problems that can be solved in the same framework.

Graph Learning Graph Reconstruction

Quality at the Tail of Machine Learning Inference

no code implementations25 Dec 2022 Zhengxin Yang, Wanling Gao, Chunjie Luo, Lei Wang, Fei Tang, Xu Wen, Jianfeng Zhan

The study unveils a counterintuitive revelation: deep learning inference quality exhibits fluctuations due to inference time.

Autonomous Driving Benchmarking +1

a cognitive frequency allocation strategy for multi-carrier radar against communication interference

no code implementations23 Dec 2022 Zhao Shan, Lei Wang, PengFei Liu, Tianyao Huang, Yimin Liu

To address this challenge, we use a novel iteratively selecting technique which breaks a difficult decision task into several easy tasks.

ToL: A Tensor of List-Based Unified Computation Model

no code implementations21 Dec 2022 Hongxiao Li, Wanling Gao, Lei Wang, Jianfeng Zhan

This article presents a unified computation model with generalized expression ability and a concise set of primitive operators for programming high-level algorithms.

Generalizing Math Word Problem Solvers via Solution Diversification

1 code implementation1 Dec 2022 Zhenwen Liang, Jipeng Zhang, Lei Wang, Yan Wang, Jie Shao, Xiangliang Zhang

In this paper, we design a new training framework for an MWP solver by introducing a solution buffer and a solution discriminator.

Math

Recent Advances in RecBole: Extensions with more Practical Considerations

1 code implementation28 Nov 2022 Lanling Xu, Zhen Tian, Gaowei Zhang, Lei Wang, Junjie Zhang, Bowen Zheng, YiFan Li, Yupeng Hou, Xingyu Pan, Yushuo Chen, Wayne Xin Zhao, Xu Chen, Ji-Rong Wen

In order to show the recent update in RecBole, we write this technical report to introduce our latest improvements on RecBole.

Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models

1 code implementation27 Nov 2022 Lei Wang, Jiabang He, Xing Xu, Ning Liu, Hui Liu

In this paper, we propose a new model architecture with alignment-enriched tuning (dubbed AETNet) upon pre-trained document image models, to adapt downstream tasks with the joint task-specific supervised and alignment-aware contrastive objective.

A Unified Framework for Contrastive Learning from a Perspective of Affinity Matrix

no code implementations26 Nov 2022 Wenbin Li, Meihao Kong, Xuesong Yang, Lei Wang, Jing Huo, Yang Gao, Jiebo Luo

In this study, we present a new unified contrastive learning representation framework (named UniCLR) suitable for all the above four kinds of methods from a novel perspective of basic affinity matrix.

Contrastive Learning Representation Learning

Explainable and Safe Reinforcement Learning for Autonomous Air Mobility

1 code implementation24 Nov 2022 Lei Wang, Hongyu Yang, Yi Lin, Suwan Yin, Yuankai Wu

Although DRL has achieved important advancements in this field, the existing works pay little attention to the explainability and safety issues related to DRL controllers, particularly the safety under adversarial attacks.

Adversarial Attack Q-Learning +3

A Review of Intelligent Music Generation Systems

no code implementations16 Nov 2022 Lei Wang, Ziyi Zhao, Hanwei Liu, Junwei Pang, Yi Qin, Qidi Wu

With the introduction of ChatGPT, the public's perception of AI-generated content (AIGC) has begun to reshape.

Benchmarking Music Generation

Exploiting Contrastive Learning and Numerical Evidence for Confusing Legal Judgment Prediction

no code implementations15 Nov 2022 Leilei Gan, Baokui Li, Kun Kuang, Yating Zhang, Lei Wang, Luu Anh Tuan, Yi Yang, Fei Wu

Given the fact description text of a legal case, legal judgment prediction (LJP) aims to predict the case's charge, law article and penalty term.

Contrastive Learning

Mitigating Popularity Bias in Recommendation with Unbalanced Interactions: A Gradient Perspective

no code implementations31 Oct 2022 Weijieying Ren, Lei Wang, Kunpeng Liu, Ruocheng Guo, Lim Ee Peng, Yanjie Fu

We present a gradient perspective to understand two negative impacts of popularity bias in recommendation model optimization: (i) the gradient direction of popular item embeddings is closer to that of positive interactions, and (ii) the magnitude of positive gradient for popular items are much greater than that of unpopular items.

Model Optimization Recommendation Systems

Time-rEversed diffusioN tEnsor Transformer: A new TENET of Few-Shot Object Detection

1 code implementation30 Oct 2022 Shan Zhang, Naila Murray, Lei Wang, Piotr Koniusz

To address these drawbacks, we propose a Time-rEversed diffusioN tEnsor Transformer (TENET), which i) forms high-order tensor representations that capture multi-way feature occurrences that are highly discriminative, and ii) uses a transformer that dynamically extracts correlations between the query image and the entire support set, instead of a single average-pooled support embedding.

Few-Shot Object Detection Object +1

Uncertainty-DTW for Time Series and Sequences

1 code implementation30 Oct 2022 Lei Wang, Piotr Koniusz

Dynamic Time Warping (DTW) is used for matching pairs of sequences and celebrated in applications such as forecasting the evolution of time series, clustering time series or even matching sequence pairs in few-shot action recognition.

Dynamic Time Warping Few-Shot action recognition +3

Temporal-Viewpoint Transportation Plan for Skeletal Few-shot Action Recognition

no code implementations30 Oct 2022 Lei Wang, Piotr Koniusz

To factor out misalignment between query and support sequences of 3D body joints, we propose an advanced variant of Dynamic Time Warping which jointly models each smooth path between the query and support frames to achieve simultaneously the best alignment in the temporal and simulated camera viewpoint spaces for end-to-end learning under the limited few-shot training data.

Dynamic Time Warping Few-Shot action recognition +3

COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models

1 code implementation27 Oct 2022 Bowen Shen, Zheng Lin, Yuanxin Liu, Zhengxiao Liu, Lei Wang, Weiping Wang

Motivated by such considerations, we propose a collaborative optimization for PLMs that integrates static model compression and dynamic inference acceleration.

Model Compression

Recommendation with User Active Disclosing Willingness

no code implementations25 Oct 2022 Lei Wang, Xu Chen, Quanyu Dai, Zhenhua Dong

Recommender system has been deployed in a large amount of real-world applications, profoundly influencing people's daily life and production. Traditional recommender models mostly collect as comprehensive as possible user behaviors for accurate preference estimation.

Recommendation Systems

TPU-MLIR: A Compiler For TPU Using MLIR

1 code implementation23 Oct 2022 Pengchao Hu, Man Lu, Lei Wang, Guoyue Jiang

Multi-level intermediate representations (MLIR) show great promise for reducing the cost of building domain-specific compilers by providing a reusable and extensible compiler infrastructure.

S2WAT: Image Style Transfer via Hierarchical Vision Transformer using Strips Window Attention

1 code implementation22 Oct 2022 Chiyu Zhang, Xiaogang Xu, Lei Wang, Zaiyan Dai, Jun Yang

Transformer's recent integration into style transfer leverages its proficiency in establishing long-range dependencies, albeit at the expense of attenuated local modeling.

Style Transfer

Synthetic Voice Detection and Audio Splicing Detection using SE-Res2Net-Conformer Architecture

no code implementations7 Oct 2022 Lei Wang, Benedict Yeoh, Jun Wah Ng

Synthetic voice and splicing audio clips have been generated to spoof Internet users and artificial intelligence (AI) technologies such as voice authentication.

Binary Classification

Pseudo-Label Generation and Various Data Augmentation for Semi-Supervised Hyperspectral Object Detection

1 code implementation Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops 2022 Jun Yu, Liwen Zhang, Shenshen Du, Hao Chang, Keda Lu, Zhong Zhang, Ye Yu, Lei Wang, Qiang Ling

To overcome these difficulties, this paper first select fewer but suitable data augmentation methods to improve the accuracy of the supervised model based on the labeled training set, which is suitable for the characteristics of hyperspectral images.

Data Augmentation object-detection +3

On Embeddings and Inverse Embeddings of Input Design for Regularized System Identification

no code implementations27 Sep 2022 Biqiang Mu, Tianshi Chen, He Kong, Bo Jiang, Lei Wang, Junfeng Wu

For the emerging regularized system identification, the study on input design has just started, and it is often formulated as a non-convex optimization problem that minimizes a scalar measure of the Bayesian mean squared error matrix subject to certain constraints, and the state-of-art method is the so-called quadratic mapping and inverse embedding (QMIE) method, where a time domain inverse embedding (TDIE) is proposed to find the inverse of the quadratic mapping.

GaitMM: Multi-Granularity Motion Sequence Learning for Gait Recognition

1 code implementation18 Sep 2022 Lei Wang, Bo Liu, Bincheng Wang, Fuqiang Yu

In this study, we propose a multi-granularity motion representation network (GaitMM) for gait sequence learning.

Gait Recognition

Deep Variational Free Energy Approach to Dense Hydrogen

1 code implementation13 Sep 2022 Hao Xie, Zi-Hang Li, Han Wang, Linfeng Zhang, Lei Wang

We developed a deep generative model-based variational free energy approach to the equations of state of dense hydrogen.

Explanation Guided Contrastive Learning for Sequential Recommendation

1 code implementation3 Sep 2022 Lei Wang, Ee-Peng Lim, Zhiwei Liu, Tianxiang Zhao

Recently, contrastive learning has been applied to the sequential recommendation task to address data sparsity caused by users with few item interactions and items with few user adoptions.

Contrastive Learning Representation Learning +1

Improving Compositional Generalization in Math Word Problem Solving

1 code implementation3 Sep 2022 Yunshi Lan, Lei Wang, Jing Jiang, Ee-Peng Lim

To improve the compositional generalization in MWP solving, we propose an iterative data augmentation method that includes diverse compositional variation into training data and could collaborate with MWP methods.

Data Augmentation Math +1

A Medical Semantic-Assisted Transformer for Radiographic Report Generation

no code implementations22 Aug 2022 Zhanyu Wang, Mingkang Tang, Lei Wang, Xiu Li, Luping Zhou

Automated radiographic report generation is a challenging cross-domain task that aims to automatically generate accurate and semantic-coherence reports to describe medical images.

Image Captioning Medical Report Generation

Private, Efficient, and Accurate: Protecting Models Trained by Multi-party Learning with Differential Privacy

no code implementations18 Aug 2022 Wenqiang Ruan, Mingxin Xu, Wenjing Fang, Li Wang, Lei Wang, Weili Han

Second, to reduce the accuracy loss led by differential privacy noise and the huge communication overhead of MPL, we propose two optimization methods for the training process of MPL: (1) the data-independent feature extraction method, which aims to simplify the trained model structure; (2) the local data-based global model initialization method, which aims to speed up the convergence of the model training.

Instance Image Retrieval by Learning Purely From Within the Dataset

no code implementations12 Aug 2022 Zhongyan Zhang, Lei Wang, Yang Wang, Luping Zhou, Jianjia Zhang, Peng Wang, Fang Chen

Although achieving promising results, this approach is restricted by two issues: 1) the domain gap between benchmark datasets and the dataset of a given retrieval task; 2) the required auxiliary dataset cannot be readily obtained.

Image Retrieval Retrieval +2

RDA: Reciprocal Distribution Alignment for Robust Semi-supervised Learning

3 code implementations9 Aug 2022 Yue Duan, Lei Qi, Lei Wang, Luping Zhou, Yinghuan Shi

In this work, we propose Reciprocal Distribution Alignment (RDA) to address semi-supervised learning (SSL), which is a hyperparameter-free framework that is independent of confidence threshold and works with both the matched (conventionally) and the mismatched class distributions.

Semi-Supervised Image Classification

Primitive Graph Learning for Unified Vector Mapping

no code implementations28 Jun 2022 Lei Wang, Min Dai, Jianan He, Jingwei Huang, Mingwei Sun

Then, we convert vector shape prediction, regularization, and topology reconstruction into a unique primitive graph learning problem.

Graph Learning

Attitude estimation from vector measurements: Necessary and sufficient conditions and convergent observer design

no code implementations27 Jun 2022 Bowen Yi, Lei Wang, Ian R. Manchester

The paper addresses the problem of attitude estimation for rigid bodies using (possibly time-varying) vector measurements, for which we provide a necessary and sufficient condition of distinguishability.

Constructing Cross-lingual Consumer Health Vocabulary with Word-Embedding from Comparable User Generated Content

no code implementations23 Jun 2022 Chia-Hsuan Chang, Lei Wang, Christopher C. Yang

To analyze the health consumer-generated content (HCGC) from the OHCs, identifying the colloquial medical expressions used by laypeople is a critical challenge.

TC-SfM: Robust Track-Community-Based Structure-from-Motion

no code implementations13 Jun 2022 Lei Wang, Linlin Ge, Shan Luo, Zihan Yan, Zhaopeng Cui, Jieqing Feng

Specifically, a novel structure is proposed, namely, {\textit{track-community}}, in which each community consists of a group of tracks and represents a local segment in the scene.

Community Detection

Contrastive Centroid Supervision Alleviates Domain Shift in Medical Image Classification

no code implementations31 May 2022 Wenshuo Zhou, Dalu Yang, Binghong Wu, Yehui Yang, Junde Wu, Xiaorong Wang, Lei Wang, Haifeng Huang, Yanwu Xu

Deep learning based medical imaging classification models usually suffer from the domain shift problem, where the classification performance drops when training data and real-world data differ in imaging equipment manufacturer, image acquisition protocol, patient populations, etc.

domain classification Domain Generalization +3

Fast and Arbitrary Beam Pattern Design for RIS-Assisted Terahertz Wireless Communication

no code implementations6 May 2022 Jian Dang, Zaichen Zhang, Yewei Li, Liang Wu, Bingcheng Zhu, Lei Wang

Reconfigurable intelligent surface (RIS) can assist terahertz wireless communication to restore the fragile line-of-sight links and facilitate beam steering.

Revealing the CO2 emission reduction of ridesplitting and its determinants based on real-world data

no code implementations2 Apr 2022 Wenxiang Li, Yuanyuan Li, Ziyuan Pu, Long Cheng, Lei Wang, Linchuan Yang

Integrating the trip data with the COPERT model, this study calculates the CO2 emissions of shared rides (ridesplitting) and their substituted single rides (regular ridesourcing) to estimate the CO2 emission reduction of each ridesplitting trip.

Interpretable Machine Learning

MutexMatch: Semi-Supervised Learning with Mutex-Based Consistency Regularization

3 code implementations27 Mar 2022 Yue Duan, Zhen Zhao, Lei Qi, Lei Wang, Luping Zhou, Yinghuan Shi, Yang Gao

The core issue in semi-supervised learning (SSL) lies in how to effectively leverage unlabeled data, whereas most existing methods tend to put a great emphasis on the utilization of high-confidence samples yet seldom fully explore the usage of low-confidence samples.

Semi-Supervised Image Classification

Topological EEG Nonlinear Dynamics Analysis for Emotion Recognition

no code implementations14 Mar 2022 Yan Yan, Xuankun Wu, Chengdong Li, Yini He, Zhicheng Zhang, Huihui Li, Ang Li, Lei Wang

The proposed work is the first investigation in the emotion recognition oriented EEG topological feature analysis, which brought a novel insight into the brain neural system nonlinear dynamics analysis and feature extraction.

Arousal Estimation Dominance Estimation +7

Deep Transfer Learning with Graph Neural Network for Sensor-Based Human Activity Recognition

no code implementations14 Mar 2022 Yan Yan, Tianzheng Liao, Jinjin Zhao, Jiahong Wang, Liang Ma, Wei Lv, Jing Xiong, Lei Wang

Given this observation, we devised a graph-inspired deep learning approach toward the sensor-based HAR tasks, which was further used to build a deep transfer learning model toward giving a tentative solution for these two challenging problems.

Few-Shot Learning Human Activity Recognition +1

Two-stream Hierarchical Similarity Reasoning for Image-text Matching

no code implementations10 Mar 2022 Ran Chen, Hanli Wang, Lei Wang, Sam Kwong

Second, previous approaches only consider learning single-stream similarity alignment (i. e., image-to-text level or text-to-image level), which is inadequate to fully use similarity information for image-text matching.

Image-text matching Text Matching +1

CenGCN: Centralized Convolutional Networks with Vertex Imbalance for Scale-Free Graphs

no code implementations16 Feb 2022 Feng Xia, Lei Wang, Tao Tang, Xin Chen, Xiangjie Kong, Giles Oatley, Irwin King

In each non-output layer of the GCN, this framework uses a hub attention mechanism to assign new weights to connected non-hub vertices based on their common information with hub vertices.

Link Prediction

Active and Passive Hybrid Detection Method for Power CPS False Data Injection Attacks with Improved AKF and GRU-CNN

no code implementations14 Feb 2022 Zhaoyang Qu, Xiaoyong Bo, Tong Yu, Yaowei Liu, Yunchang Dong, Zhongfeng Kan, Lei Wang, Yang Li

Taking account of the fact that the existing knowledge-driven detection process for FDIAs has been in a passive detection state for a long time and ignores the advantages of data-driven active capture of features, an active and passive hybrid detection method for power CPS FDIAs with improved adaptive Kalman filter (AKF) and convolutional neural networks (CNN) is proposed in this paper.

AD-NEGF: An End-to-End Differentiable Quantum Transport Simulator for Sensitivity Analysis and Inverse Problems

no code implementations10 Feb 2022 Yingzhanghao Zhou, Xiang Chen, Peng Zhang, Jun Wang, Lei Wang, Hong Guo

Since proposed in the 70s, the Non-Equilibrium Green Function (NEGF) method has been recognized as a standard approach to quantum transport simulations.

A Novel Mix-normalization Method for Generalizable Multi-source Person Re-identification

no code implementations24 Jan 2022 Lei Qi, Lei Wang, Yinghuan Shi, Xin Geng

Different from the conventional data augmentation, the proposed domain-aware mix-normalization to enhance the diversity of features during training from the normalization view of the neural network, which can effectively alleviate the model overfitting to the source domains, so as to boost the generalization capability of the model in the unseen domain.

Data Augmentation Person Re-Identification

Indirect Adaptive Control of Nonlinearly Parameterized Nonlinear Dissipative Systems

no code implementations15 Jan 2022 Romeo Ortega, Rafael Cisneros, Lei Wang, Arjan van der Schaft

In this note we address the problem of indirect adaptive (regulation or tracking) control of nonlinear, input affine dissipative systems.

$m^\ast$ of two-dimensional electron gas: a neural canonical transformation study

1 code implementation10 Jan 2022 Hao Xie, Linfeng Zhang, Lei Wang

The quasiparticle effective mass $m^\ast$ of interacting electrons is a fundamental quantity in the Fermi liquid theory.

StyTr2: Image Style Transfer With Transformers

3 code implementations CVPR 2022 Yingying Deng, Fan Tang, WeiMing Dong, Chongyang Ma, Xingjia Pan, Lei Wang, Changsheng Xu

The goal of image style transfer is to render an image with artistic features guided by a style reference while maintaining the original content.

Style Transfer

Kernelized Few-Shot Object Detection With Efficient Integral Aggregation

no code implementations CVPR 2022 Shan Zhang, Lei Wang, Naila Murray, Piotr Koniusz

We design a Kernelized Few-shot Object Detector by leveraging kernelized matrices computed over multiple proposal regions, which yield expressive non-linear representations whose model complexity is learned on the fly.

Few-Shot Object Detection Object +2

Decentralized Optimization Over the Stiefel Manifold by an Approximate Augmented Lagrangian Function

no code implementations30 Dec 2021 Lei Wang, Xin Liu

In this paper, we focus on the decentralized optimization problem over the Stiefel manifold, which is defined on a connected network of $d$ agents.

Integrating Quantum Processor Device and Control Optimization in a Gradient-based Framework

no code implementations23 Dec 2021 Xiaotong Ni, Hui-Hai Zhao, Lei Wang, Feng Wu, Jianxin Chen

In a quantum processor, the device design and external controls together contribute to the quality of the target quantum operations.

3D Skeleton-based Few-shot Action Recognition with JEANIE is not so Naïve

no code implementations23 Dec 2021 Lei Wang, Jun Liu, Piotr Koniusz

In this paper, we propose a Few-shot Learning pipeline for 3D skeleton-based action recognition by Joint tEmporal and cAmera viewpoiNt alIgnmEnt (JEANIE).

Dynamic Time Warping Few-Shot action recognition +3

A Multi-View Framework for BGP Anomaly Detection via Graph Attention Network

no code implementations23 Dec 2021 Songtao Peng, Jiaqi Nie, Xincheng Shu, Zhongyuan Ruan, Lei Wang, Yunxuan Sheng, Qi Xuan

As the default protocol for exchanging routing reachability information on the Internet, the abnormal behavior in traffic of Border Gateway Protocols (BGP) is closely related to Internet anomaly events.

Anomaly Detection feature selection +3

Analysis and Evaluation of Kinect-based Action Recognition Algorithms

1 code implementation16 Dec 2021 Lei Wang

Human action recognition still exists many challenging problems such as different viewpoints, occlusion, lighting conditions, human body size and the speed of action execution, although it has been widely used in different areas.

Action Recognition Temporal Action Localization

Unsupervised Domain Generalization for Person Re-identification: A Domain-specific Adaptive Framework

1 code implementation30 Nov 2021 Lei Qi, Jiaqi Liu, Lei Wang, Yinghuan Shi, Xin Geng

A significance of our work lies in that it shows the potential of unsupervised domain generalization for person ReID and sets a strong baseline for the further research on this topic.

Domain Generalization Person Re-Identification +1

Learning Dynamic Compact Memory Embedding for Deformable Visual Object Tracking

no code implementations23 Nov 2021 Pengfei Zhu, Hongtao Yu, Kaihua Zhang, Yu Wang, Shuai Zhao, Lei Wang, Tianzhu Zhang, QinGhua Hu

To address this issue, segmentation-based trackers have been proposed that employ per-pixel matching to improve the tracking performance of deformable objects effectively.

Segmentation Visual Object Tracking +1

Block-Sparse Recovery Network for Two-Dimensional Harmonic Retrieval

no code implementations15 Nov 2021 Rong Fu, Tianyao Huang, Lei Wang, Yimin Liu

As a typical signal processing problem, multidimensional harmonic retrieval (MHR) has been adapted to a wide range of applications in signal processing.

Retrieval Vocal Bursts Valence Prediction

A Novel Sample-efficient Deep Reinforcement Learning with Episodic Policy Transfer for PID-Based Control in Cardiac Catheterization Robots

no code implementations28 Oct 2021 Olatunji Mumini Omisore, Toluwanimi Akinyemi, Wenke Duan, Wenjing Du, Lei Wang

Robotic catheterization is typically used for percutaneous coronary intervention procedures nowadays and it involves steering flexible endovascular tools to open up occlusion in the coronaries.

R4: A Framework for Route Representation and Route Recommendation

no code implementations20 Oct 2021 Ran Cheng, Chao Chen, Longfei Xu, Shen Li, Lei Wang, Hengbin Cui, Kaikui Liu, Xiaolong Li

For user representation, we utilize a series of historical navigation to extract user preference.

Attribute

Graph Partner Neural Networks for Semi-Supervised Learning on Graphs

no code implementations18 Oct 2021 Langzhang Liang, Cuiyun Gao, Shiyi Chen, Shishi Duan, Yu Pan, Junjin Zheng, Lei Wang, Zenglin Xu

Graph Convolutional Networks (GCNs) are powerful for processing graph-structured data and have achieved state-of-the-art performance in several tasks such as node classification, link prediction, and graph classification.

Graph Classification Link Prediction +1

High-order Tensor Pooling with Attention for Action Recognition

no code implementations11 Oct 2021 Lei Wang, Ke Sun, Piotr Koniusz

We aim at capturing high-order statistics of feature vectors formed by a neural network, and propose end-to-end second- and higher-order pooling to form a tensor descriptor.

Ranked #2 on Scene Recognition on YUP++ (using extra training data)

Action Recognition Scene Recognition +1

Differential Privacy with Manifold Data Dependency

no code implementations29 Sep 2021 Lei Wang, Deming Yuan, Guodong Shi

In this paper, we study dataset processing mechanisms generated by linear queries in the presence of manifold data dependency.

Network Learning in Quadratic Games from Fictitious Plays

no code implementations29 Sep 2021 Kemi Ding, Yijun Chen, Lei Wang, Xiaoqiang Ren, Guodong Shi

Next, in view of the inherent stability and sparsity constraints for the network interaction structure, we propose a stable and sparse system identification framework for learning the interaction graph from full player action observations.

Distributed Zeroth-Order Optimization: Convergence Rates That Match Centralized Counterpart

no code implementations29 Sep 2021 Deming Yuan, Lei Wang, Alexandre Proutiere, Guodong Shi

Zeroth-order optimization has become increasingly important in complex optimization and machine learning when cost functions are impossible to be described in closed analytical forms.

Heterologous Normalization

no code implementations29 Sep 2021 Chunjie Luo, Jianfeng Zhan, Lei Wang, Wanling Gao

Specifically, it calculates the mean like Batch Normalization to maintain the advantage of Batch Normalization.

A hierarchical residual network with compact triplet-center loss for sketch recognition

no code implementations28 Sep 2021 Lei Wang, Shihui Zhang, Huan He, Xiaoxiao Zhang, Yu Sang

Last but not least, the compact triplet-center loss is proposed specifically for the sketch recognition task.

Sketch Recognition

NOAHQA: Numerical Reasoning with Interpretable Graph Question Answering Dataset

1 code implementation Findings (EMNLP) 2021 Qiyuan Zhang, Lei Wang, Sicheng Yu, Shuohang Wang, Yang Wang, Jing Jiang, Ee-Peng Lim

While diverse question answering (QA) datasets have been proposed and contributed significantly to the development of deep learning models for QA tasks, the existing datasets fall short in two aspects.

Graph Question Answering Question Answering

Progressive Hard-case Mining across Pyramid Levels for Object Detection

1 code implementation15 Sep 2021 Binghong Wu, Yehui Yang, Dalu Yang, Junde Wu, Xiaorong Wang, Haifeng Huang, Lei Wang, Yanwu Xu

Based on focal loss with ATSS-R50, our approach achieves 40. 5 AP, surpassing the state-of-the-art QFL (Quality Focal Loss, 39. 9 AP) and VFL (Varifocal Loss, 40. 1 AP).

object-detection Object Detection

LibFewShot: A Comprehensive Library for Few-shot Learning

1 code implementation10 Sep 2021 Wenbin Li, Ziyi, Wang, Xuesong Yang, Chuanqi Dong, Pinzhuo Tian, Tiexin Qin, Jing Huo, Yinghuan Shi, Lei Wang, Yang Gao, Jiebo Luo

Furthermore, based on LibFewShot, we provide comprehensive evaluations on multiple benchmarks with various backbone architectures to evaluate common pitfalls and effects of different training tricks.

Data Augmentation Few-Shot Image Classification +2

Improving Ranking Correlation of Supernet with Candidates Enhancement and Progressive Training

1 code implementation12 Aug 2021 Ziwei Yang, Ruyi Zhang, Zhi Yang, Xubo Yang, Lei Wang, Zheyang Li

One-shot neural architecture search (NAS) applies weight-sharing supernet to reduce the unaffordable computation overhead of automated architecture designing.

Neural Architecture Search

Cascade Bagging for Accuracy Prediction with Few Training Samples

1 code implementation12 Aug 2021 Ruyi Zhang, Ziwei Yang, Zhi Yang, Xubo Yang, Lei Wang, Zheyang Li

To alleviate this problem, we propose a novel framework to train an accuracy predictor under few training samples.

Data Augmentation Ensemble Learning +1

Few-shot Unsupervised Domain Adaptation with Image-to-class Sparse Similarity Encoding

no code implementations6 Aug 2021 Shengqi Huang, Wanqi Yang, Lei Wang, Luping Zhou, Ming Yang

Inspired by the recent local descriptor based few-shot learning (FSL), our general UDA model is fully built upon local descriptors (LDs) for image classification and domain adaptation.

Few-Shot Learning Image Classification +1

Trade When Opportunity Comes: Price Movement Forecasting via Locality-Aware Attention and Iterative Refinement Labeling

no code implementations26 Jul 2021 Liang Zeng, Lei Wang, Hui Niu, Ruchen Zhang, Ling Wang, Jian Li

In a set of experiments on three real-world financial markets: stocks, cryptocurrencies, and ETFs, LARA significantly outperforms several machine learning based methods on the Qlib quantitative investment platform.

Metric Learning Time Series Analysis

Crosslink-Net: Double-branch Encoder Segmentation Network via Fusing Vertical and Horizontal Convolutions

1 code implementation24 Jul 2021 Qian Yu, Lei Qi, Luping Zhou, Lei Wang, Yilong Yin, Yinghuan Shi, Wuzhang Wang, Yang Gao

Together, the above two schemes give rise to a novel double-branch encoder segmentation framework for medical image segmentation, namely Crosslink-Net.

Image Segmentation Medical Image Segmentation +2

Hand Image Understanding via Deep Multi-Task Learning

1 code implementation ICCV 2021 Xiong Zhang, Hongsheng Huang, Jianchao Tan, Hongmin Xu, Cheng Yang, Guozhu Peng, Lei Wang, Ji Liu

To further improve the performance of these tasks, we propose a novel Hand Image Understanding (HIU) framework to extract comprehensive information of the hand object from a single RGB image, by jointly considering the relationships between these tasks.

3D Hand Pose Estimation Multi-Task Learning +1

Trip-ROMA: Self-Supervised Learning with Triplets and Random Mappings

1 code implementation22 Jul 2021 Wenbin Li, Xuesong Yang, Meihao Kong, Lei Wang, Jing Huo, Yang Gao, Jiebo Luo

However, in small data regimes, we can not obtain a sufficient number of negative pairs or effectively avoid the over-fitting problem when negatives are not used at all.

Representation Learning Self-Supervised Learning +1

A Self-Boosting Framework for Automated Radiographic Report Generation

no code implementations CVPR 2021 Zhanyu Wang, Luping Zhou, Lei Wang, Xiu Li

On one hand, the image-text matching branch helps to learn highly text-correlated visual features for the report generation branch to output high quality reports.

Image Captioning Image-text matching +3

Signal Acquisition of Luojia-1A Low Earth Orbit Navigation Augmentation System with Software Defined Receiver

no code implementations31 May 2021 Liang Chen, Xiangchen Lu, Nan Shen, Lei Wang, Yuan Zhuang, Ye Su, Deren Li, Ruizhi Chen

The performance of those integration algorithms on expanding the successful acquisition time range is verified by the real data collected from the Luojia-1A satellite.

StyTr$^2$: Image Style Transfer with Transformers

4 code implementations30 May 2021 Yingying Deng, Fan Tang, WeiMing Dong, Chongyang Ma, Xingjia Pan, Lei Wang, Changsheng Xu

The goal of image style transfer is to render an image with artistic features guided by a style reference while maintaining the original content.

Style Transfer

Hybrid gene selection approach using XGBoost and multi-objective genetic algorithm for cancer classification

no code implementations30 May 2021 Xiongshi Deng, Min Li, Shaobo Deng, Lei Wang

In the second stage, XGBoost-MOGA searches for an optimal gene subset based on the most relevant genes's group using a multi-objective optimization genetic algorithm.

feature selection

RFCBF: enhance the performance and stability of Fast Correlation-Based Filter

no code implementations30 May 2021 Xiongshi Deng, Min Li, Lei Wang, Qikang Wan

Feature selection is a preprocessing step which plays a crucial role in the domain of machine learning and data mining.

feature selection

Investigating Math Word Problems using Pretrained Multilingual Language Models

1 code implementation19 May 2021 Minghuan Tan, Lei Wang, Lingxiao Jiang, Jing Jiang

In this paper, we revisit math word problems~(MWPs) from the cross-lingual and multilingual perspective.

Machine Translation Math +2

Ab-initio study of interacting fermions at finite temperature with neural canonical transformation

1 code implementation18 May 2021 Hao Xie, Linfeng Zhang, Lei Wang

The variational density matrix is parametrized by a permutation equivariant many-body unitary transformation together with a discrete probabilistic model.

Fusing Higher-order Features in Graph Neural Networks for Skeleton-based Action Recognition

1 code implementation4 May 2021 Zhenyue Qin, Yang Liu, Pan Ji, Dongwoo Kim, Lei Wang, Bob McKay, Saeed Anwar, Tom Gedeon

Recent skeleton-based action recognition methods extract features from 3D joint coordinates as spatial-temporal cues, using these representations in a graph neural network for feature fusion to boost recognition performance.

Action Recognition Skeleton Based Action Recognition

Contrastive Learning based Hybrid Networks for Long-Tailed Image Classification

no code implementations CVPR 2021 Peng Wang, Kai Han, Xiu-Shen Wei, Lei Zhang, Lei Wang

Learning discriminative image representations plays a vital role in long-tailed image classification because it can ease the classifier learning in imbalanced cases.

Classification Contrastive Learning +4

Shift-and-Balance Attention

1 code implementation24 Mar 2021 Chunjie Luo, Jianfeng Zhan, Tianshu Hao, Lei Wang, Wanling Gao

The attention branch is gated using the Sigmoid function and multiplied by the feature map's trunk branch.

DeepStyle: User Style Embedding for Authorship Attribution of Short Texts

no code implementations14 Mar 2021 Zhiqiang Hu, Roy Ka-Wei Lee, Lei Wang, Ee-Peng Lim, Bo Dai

Authorship attribution (AA), which is the task of finding the owner of a given text, is an important and widely studied research topic with many applications.

text-classification Text Classification

Network Representation Learning: From Traditional Feature Learning to Deep Learning

no code implementations7 Mar 2021 Ke Sun, Lei Wang, Bo Xu, Wenhong Zhao, Shyh Wei Teng, Feng Xia

Network representation learning (NRL) is an effective graph analytics technique and promotes users to deeply understand the hidden characteristics of graph data.

Recommendation Systems Representation Learning

Coordinated Cyber-Attack Detection Model of Cyber-Physical Power System Based on the Operating State Data Link

no code implementations27 Feb 2021 Lei Wang, Pengcheng Xu, Zhaoyang Qu, Xiaoyong Bo, Yunchang Dong, Zhenming Zhang, Yang Li

Existing coordinated cyber-attack detection methods have low detection accuracy and efficiency and poor generalization ability due to difficulties dealing with unbalanced attack data samples, high data dimensionality, and noisy data sets.

Cyber Attack Detection

HPC AI500: Representative, Repeatable and Simple HPC AI Benchmarking

no code implementations25 Feb 2021 Zihan Jiang, Wanling Gao, Fei Tang, Xingwang Xiong, Lei Wang, Chuanxin Lan, Chunjie Luo, Hongxiao Li, Jianfeng Zhan

Recent years witness a trend of applying large-scale distributed deep learning algorithms (HPC AI) in both business and scientific computing areas, whose goal is to speed up the training time to achieve a state-of-the-art quality.

Image Classification Performance

Modeling Method for the Coupling Relations of Microgrid Cyber-Physical Systems Driven by Hybrid Spatiotemporal Events

no code implementations1 Feb 2021 Xiaoyong Bo, Xiaoyu Chen, Huashun Li, Yunchang Dong, Zhaoyang Qu, Lei Wang, Yang Li

Considering the constraints of the temporal conversion of information flow and energy flow, a microgrid CPS coupling model is established, the effectiveness of which is verified by simulating false data injection attack (FDIA) scenarios.

Decision Making

Cannot find the paper you are looking for? You can Submit a new open access paper.