Search Results for author: Lei Wang

Found 364 papers, 124 papers with code

ASCM: An Answer Space Clustered Prompting Method without Answer Engineering

1 code implementation • Findings (ACL) 2022 • Zhen Wang, Yating Yang, Zhou Xi, Bo Ma, Lei Wang, Rui Dong, Azmat Anwar

We also propose a stable semi-supervised method named stair learning (SL) that orderly distills knowledge from better models to weaker models.

Few-Shot Text Classification Natural Language Inference +1

Paper
Code

ReDro: Efficiently Learning Large-sized SPD Visual Representation

no code implementations • ECCV 2020 • Saimunur Rahman, Lei Wang, Changming Sun, Luping Zhou

When learning this representation in deep networks, eigen-decomposition of covariance matrix is usually needed for a key step called matrix normalisation.

Fine-Grained Image Classification

Paper
Add Code

基于预训练语言模型的案件要素识别方法(A Method for Case Factor Recognition Based on Pre-trained Language Models)

no code implementations • CCL 2020 • Haishun Liu, Lei Wang, Yanguang Chen, Shuchen Zhang, Yuanyuan Sun, Hongfei Lin

案件要素识别指将案件描述中重要事实描述自动抽取出来, 并根据领域专家设计的要素体系进行分类, 是智慧司法领域的重要研究内容。基于传统神经网络的文本编码难以提取深层次特征, 基于阈值的多标签分类难以捕获标签间依赖关系, 因此本文提出了基于预训练语言模型的多标签文本分类模型。该模型采用以Layer-attentive策略进行特征融合的语言模型作为编码器, 使用基于LSTM的序列生成模型作为解码器。在“CAIL2019”数据集上进行实验, 该方法比基于循环神经网络的算法在F1值上最高可提升7. 6%, 在相同超参数设置下比基础语言模型(BERT)提升约3. 2%。

Paper
Add Code

RotateCT: Knowledge Graph Embedding by Rotation and Coordinate Transformation in Complex Space

no code implementations • COLING 2022 • Yao Dong, Lei Wang, Ji Xiang, Xiaobo Guo, Yuqiang Xie

Knowledge graph embedding, which aims to learn representations of entities and relations in knowledge graphs, finds applications in various downstream tasks.

Computational Efficiency Knowledge Graph Embedding +3

Paper
Add Code

Nonlinear sparse variational Bayesian learning based model predictive control with application to PEMFC temperature control

no code implementations • 15 Apr 2024 • Qi Zhang, Lei Wang, Weihua Xu, Hongye Su, Lei Xie

Variational inference is used by NSVB-MPC to assess the predictive accuracy and make the necessary corrections to quantify system uncertainty.

Model Predictive Control Variational Inference

Paper
Add Code

Weakly-Supervised Learning via Multi-Lateral Decoder Branching for Guidewire Segmentation in Robot-Assisted Cardiovascular Catheterization

no code implementations • 11 Apr 2024 • Olatunji Mumini Omisore, Toluwanimi Akinyemi, Anh Nguyen, Lei Wang

Thus, we offer a less expensive method for real-time tool segmentation and tracking during robot-assisted cardiac catheterization.

Segmentation Weakly-supervised Learning

Paper
Add Code

ONNXPruner: ONNX-Based General Model Pruning Adapter

no code implementations • 10 Apr 2024 • Dongdong Ren, Wenbin Li, Tianyu Ding, Lei Wang, Qi Fan, Jing Huo, Hongbing Pan, Yang Gao

However, the practical application of these algorithms across various models and platforms remains a significant challenge.

Paper
Add Code

AUEditNet: Dual-Branch Facial Action Unit Intensity Manipulation with Implicit Disentanglement

no code implementations • 7 Apr 2024 • Shiwei Jin, Zhen Wang, Lei Wang, Peng Liu, Ning Bi, Truong Nguyen

Our experiments demonstrate AUEditNet's superior accuracy in editing AU intensities, affirming its capability in disentangling facial attributes and identity within a limited subject pool.

Attribute Disentanglement

Paper
Add Code

CSST Strong Lensing Preparation: a Framework for Detecting Strong Lenses in the Multi-color Imaging Survey by the China Survey Space Telescope (CSST)

no code implementations • 2 Apr 2024 • Xu Li, Ruiqi Sun, Jiameng Lv, Peng Jia, Nan Li, Chengliang Wei, Zou Hu, Xinzhong Er, Yun Chen, Zhang Ban, Yuedong Fang, Qi Guo, Dezi Liu, Guoliang Li, Lin Lin, Ming Li, Ran Li, Xiaobo Li, Yu Luo, Xianmin Meng, Jundan Nie, Zhaoxiang Qi, Yisheng Qiu, Li Shao, Hao Tian, Lei Wang, Wei Wang, Jingtian Xian, Youhua Xu, Tianmeng Zhang, Xin Zhang, Zhimin Zhou

To overcome these challenges, we have developed a framework based on a hierarchical visual Transformer with a sliding window technique to search for strong lensing systems within entire images.

Paper
Add Code

HyperCLOVA X Technical Report

no code implementations • 2 Apr 2024 • Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han, Youngkyun Jin, Hyein Jun, Jaeseung Jung, Chanwoong Kim, jinhong Kim, Jinuk Kim, Dokyeong Lee, Dongwook Park, Jeong Min Sohn, Sujung Han, Jiae Heo, Sungju Hong, Mina Jeon, Hyunhoon Jung, Jungeun Jung, Wangkyo Jung, Chungjoon Kim, Hyeri Kim, Jonghyun Kim, Min Young Kim, Soeun Lee, Joonhee Park, Jieun Shin, Sojin Yang, Jungsoon Yoon, Hwaran Lee, Sanghwan Bae, Jeehwan Cha, Karl Gylleus, Donghoon Ham, Mihak Hong, Youngki Hong, Yunki Hong, Dahyun Jang, Hyojun Jeon, Yujin Jeon, Yeji Jeong, Myunggeun Ji, Yeguk Jin, Chansong Jo, Shinyoung Joo, Seunghwan Jung, Adrian Jungmyung Kim, Byoung Hoon Kim, Hyomin Kim, Jungwhan Kim, Minkyoung Kim, Minseung Kim, Sungdong Kim, Yonghee Kim, Youngjun Kim, Youngkwan Kim, Donghyeon Ko, Dughyun Lee, Ha Young Lee, Jaehong Lee, Jieun Lee, Jonghyun Lee, Jongjin Lee, Min Young Lee, Yehbin Lee, Taehong Min, Yuri Min, Kiyoon Moon, Hyangnam Oh, Jaesun Park, Kyuyon Park, Younghun Park, Hanbae Seo, Seunghyun Seo, Mihyun Sim, Gyubin Son, Matt Yeo, Kyung Hoon Yeom, Wonjoon Yoo, Myungin You, Doheon Ahn, Homin Ahn, Joohee Ahn, Seongmin Ahn, Chanwoo An, Hyeryun An, Junho An, Sang-Min An, Boram Byun, Eunbin Byun, Jongho Cha, Minji Chang, Seunggyu Chang, Haesong Cho, Youngdo Cho, Dalnim Choi, Daseul Choi, Hyoseok Choi, Minseong Choi, Sangho Choi, Seongjae Choi, Wooyong Choi, Sewhan Chun, Dong Young Go, Chiheon Ham, Danbi Han, Jaemin Han, Moonyoung Hong, Sung Bum Hong, Dong-Hyun Hwang, Seongchan Hwang, Jinbae Im, Hyuk Jin Jang, Jaehyung Jang, Jaeni Jang, Sihyeon Jang, Sungwon Jang, Joonha Jeon, Daun Jeong, JoonHyun Jeong, Kyeongseok Jeong, Mini Jeong, Sol Jin, Hanbyeol Jo, Hanju Jo, Minjung Jo, Chaeyoon Jung, Hyungsik Jung, Jaeuk Jung, Ju Hwan Jung, Kwangsun Jung, Seungjae Jung, Soonwon Ka, Donghan Kang, Soyoung Kang, Taeho Kil, Areum Kim, Beomyoung Kim, Byeongwook Kim, Daehee Kim, Dong-Gyun Kim, Donggook Kim, Donghyun Kim, Euna Kim, Eunchul Kim, Geewook Kim, Gyu Ri Kim, Hanbyul Kim, Heesu Kim, Isaac Kim, Jeonghoon Kim, JiHye Kim, Joonghoon Kim, Minjae Kim, Minsub Kim, Pil Hwan Kim, Sammy Kim, Seokhun Kim, Seonghyeon Kim, Soojin Kim, Soong Kim, Soyoon Kim, Sunyoung Kim, TaeHo Kim, Wonho Kim, Yoonsik Kim, You Jin Kim, Yuri Kim, Beomseok Kwon, Ohsung Kwon, Yoo-Hwan Kwon, Anna Lee, Byungwook Lee, Changho Lee, Daun Lee, Dongjae Lee, Ha-Ram Lee, Hodong Lee, Hwiyeong Lee, Hyunmi Lee, Injae Lee, Jaeung Lee, Jeongsang Lee, Jisoo Lee, JongSoo Lee, Joongjae Lee, Juhan Lee, Jung Hyun Lee, Junghoon Lee, Junwoo Lee, Se Yun Lee, Sujin Lee, Sungjae Lee, Sungwoo Lee, Wonjae Lee, Zoo Hyun Lee, Jong Kun Lim, Kun Lim, Taemin Lim, Nuri Na, Jeongyeon Nam, Kyeong-Min Nam, Yeonseog Noh, Biro Oh, Jung-Sik Oh, Solgil Oh, Yeontaek Oh, Boyoun Park, Cheonbok Park, Dongju Park, Hyeonjin Park, Hyun Tae Park, Hyunjung Park, JiHye Park, Jooseok Park, JungHwan Park, Jungsoo Park, Miru Park, Sang Hee Park, Seunghyun Park, Soyoung Park, Taerim Park, Wonkyeong Park, Hyunjoon Ryu, Jeonghun Ryu, Nahyeon Ryu, Soonshin Seo, Suk Min Seo, Yoonjeong Shim, Kyuyong Shin, Wonkwang Shin, Hyun Sim, Woongseob Sim, Hyejin Soh, Bokyong Son, Hyunjun Son, Seulah Son, Chi-Yun Song, Chiyoung Song, Ka Yeon Song, Minchul Song, Seungmin Song, Jisung Wang, Yonggoo Yeo, Myeong Yeon Yi, Moon Bin Yim, Taehwan Yoo, Youngjoon Yoo, Sungmin Yoon, Young Jin Yoon, Hangyeol Yu, Ui Seon Yu, Xingdong Zuo, Jeongin Bae, Joungeun Bae, Hyunsoo Cho, Seonghyun Cho, Yongjin Cho, Taekyoon Choi, Yera Choi, Jiwan Chung, Zhenghui Han, Byeongho Heo, Euisuk Hong, Taebaek Hwang, Seonyeol Im, Sumin Jegal, Sumin Jeon, Yelim Jeong, Yonghyun Jeong, Can Jiang, Juyong Jiang, Jiho Jin, Ara Jo, Younghyun Jo, Hoyoun Jung, Juyoung Jung, Seunghyeong Kang, Dae Hee Kim, Ginam Kim, Hangyeol Kim, Heeseung Kim, Hyojin Kim, Hyojun Kim, Hyun-Ah Kim, Jeehye Kim, Jin-Hwa Kim, Jiseon Kim, Jonghak Kim, Jung Yoon Kim, Rak Yeong Kim, Seongjin Kim, Seoyoon Kim, Sewon Kim, Sooyoung Kim, Sukyoung Kim, Taeyong Kim, Naeun Ko, Bonseung Koo, Heeyoung Kwak, Haena Kwon, Youngjin Kwon, Boram Lee, Bruce W. Lee, Dagyeong Lee, Erin Lee, Euijin Lee, Ha Gyeong Lee, Hyojin Lee, Hyunjeong Lee, Jeeyoon Lee, Jeonghyun Lee, Jongheok Lee, Joonhyung Lee, Junhyuk Lee, Mingu Lee, Nayeon Lee, Sangkyu Lee, Se Young Lee, Seulgi Lee, Seung Jin Lee, Suhyeon Lee, Yeonjae Lee, Yesol Lee, Youngbeom Lee, Yujin Lee, Shaodong Li, Tianyu Liu, Seong-Eun Moon, Taehong Moon, Max-Lasse Nihlenramstroem, Wonseok Oh, Yuri Oh, Hongbeen Park, Hyekyung Park, Jaeho Park, Nohil Park, Sangjin Park, Jiwon Ryu, Miru Ryu, Simo Ryu, Ahreum Seo, Hee Seo, Kangdeok Seo, Jamin Shin, Seungyoun Shin, Heetae Sin, Jiangping Wang, Lei Wang, Ning Xiang, Longxiang Xiao, Jing Xu, Seonyeong Yi, Haanju Yoo, Haneul Yoo, Hwanhee Yoo, Liang Yu, Youngjae Yu, Weijie Yuan, Bo Zeng, Qian Zhou, Kyunghyun Cho, Jung-Woo Ha, Joonsuk Park, Jihyun Hwang, Hyoung Jo Kwon, Soonyong Kwon, Jungyeon Lee, Seungho Lee, Seonghyeon Lim, Hyunkyung Noh, Seungho Choi, Sang-Woo Lee, Jung Hwa Lim, Nako Sung

We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding.

Instruction Following Machine Translation +1

Paper
Add Code

Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation

1 code implementation • 31 Mar 2024 • Wenxiao Deng, Wenbin Li, Tianyu Ding, Lei Wang, Hongguang Zhang, Kuihua Huang, Jing Huo, Yang Gao

However, these methods face two primary limitations: the dispersed feature distribution within the same class in synthetic datasets, reducing class discrimination, and an exclusive focus on mean feature consistency, lacking precision and comprehensiveness.

Paper
Code

Computation and Communication Efficient Lightweighting Vertical Federated Learning

1 code implementation • 30 Mar 2024 • Heqiang Wang, Jieming Bian, Lei Wang

Moreover, we establish a convergence bound for our LVFL algorithm, which accounts for both communication and computational lightweighting ratios.

Computational Efficiency Image Classification +1

Paper
Code

PCToolkit: A Unified Plug-and-Play Prompt Compression Toolkit of Large Language Models

1 code implementation • 26 Mar 2024 • Jinyi Li, Yihuai Lan, Lei Wang, Hao Wang

Prompt compression is an innovative method for efficiently condensing input prompts while preserving essential information.

Code Completion Few-Shot Learning +2

130

Paper
Code

Space Group Informed Transformer for Crystalline Materials Generation

1 code implementation • 23 Mar 2024 • Zhendong Cao, Xiaoshan Luo, Jian Lv, Lei Wang

We introduce CrystalFormer, a transformer-based autoregressive model specifically designed for space group-controlled generation of crystalline materials.

Paper
Code

View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network

2 code implementations • 21 Mar 2024 • Quan Zhang, Lei Wang, Vishal M. Patel, Xiaohua Xie, JianHuang Lai

Experiments on two datasets show that VDT is a feasible and effective solution for AGPReID, surpassing the previous method on mAP/Rank1 by up to 5. 0%/2. 7% on CARGO and 3. 7%/5. 2% on AG-ReID, keeping the same magnitude of computational complexity.

Person Re-Identification

131

Paper
Code

The Whole is Better than the Sum: Using Aggregated Demonstrations in In-Context Learning for Sequential Recommendation

1 code implementation • 15 Mar 2024 • Lei Wang, Ee-Peng Lim

Large language models (LLMs) have shown excellent performance on various NLP tasks.

In-Context Learning Sequential Recommendation

Paper
Code

Chaotic Masking Protocol for Secure Communication and Attack Detection in Remote Estimation of Cyber-Physical Systems

no code implementations • 14 Mar 2024 • Tao Chen, Andreu Cecilia, Daniele Astolfi, Lei Wang, Zhitao Liu, Hongye Su

In remote estimation of cyber-physical systems (CPSs), sensor measurements transmitted through network may be attacked by adversaries, leading to leakage risk of privacy (e. g., the system state), and/or failure of the remote estimator.

Paper
Add Code

Taming Cross-Domain Representation Variance in Federated Prototype Learning with Heterogeneous Data Domains

no code implementations • 14 Mar 2024 • Lei Wang, Jieming Bian, Letian Zhang, Chen Chen, Jie Xu

Federated learning (FL) allows collaborative machine learning training without sharing private data.

Clustering Federated Learning

Paper
Add Code

Adaptive Hybrid Masking Strategy for Privacy-Preserving Face Recognition Against Model Inversion Attack

no code implementations • 14 Mar 2024 • Yuanqing Huang, Yinggui Wang, Jianshu Li, Le Yang, Kai Song, Lei Wang

Specifically, face images are masked in the frequency domain using an adaptive MixUp strategy.

Data Augmentation Face Recognition +1

Paper
Add Code

Gradient-Aware Logit Adjustment Loss for Long-tailed Classifier

1 code implementation • 14 Mar 2024 • Fan Zhang, Wei Qin, Weijieying Ren, Lei Wang, Zetong Chen, Richang Hong

Additionally, We find that most of the solutions to long-tailed problems are still biased towards head classes in the end, and we propose a simple and post hoc prediction re-balancing strategy to further mitigate the basis toward head class.

Paper
Code

Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning

1 code implementation • 29 Feb 2024 • Weijieying Ren, Xinlong Li, Lei Wang, Tianxiang Zhao, Wei Qin

Through extensive experiments, we uncover the mode connectivity phenomenon in the LLMs continual learning scenario and find that it can strike a balance between plasticity and stability.

Continual Learning Language Modelling +1

Paper
Code

Feynman Diagrams as Computational Graphs

no code implementations • 28 Feb 2024 • Pengcheng Hou, Tao Wang, Daniel Cerkoney, Xiansheng Cai, Zhiyi Li, Youjin Deng, Lei Wang, Kun Chen

We propose a computational graph representation of high-order Feynman diagrams in Quantum Field Theory (QFT), applicable to any combination of spatial, temporal, momentum, and frequency domains.

Paper
Add Code

All in an Aggregated Image for In-Image Learning

1 code implementation • 28 Feb 2024 • Lei Wang, Wanyu Xu, Zhiqiang Hu, Yihuai Lan, Shan Dong, Hao Wang, Roy Ka-Wei Lee, Ee-Peng Lim

This paper introduces a new in-context learning (ICL) mechanism called In-Image Learning (I$^2$L) that combines demonstration examples, visual cues, and chain-of-thought reasoning into an aggregated image to enhance the capabilities of Large Multimodal Models (e. g., GPT-4V) in multimodal reasoning tasks.

Hallucination In-Context Learning +1

Paper
Code

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

1 code implementation • 27 Feb 2024 • Shuming Ma, Hongyu Wang, Lingxiao Ma, Lei Wang, Wenhui Wang, Shaohan Huang, Li Dong, Ruiping Wang, Jilong Xue, Furu Wei

Recent research, such as BitNet, is paving the way for a new era of 1-bit Large Language Models (LLMs).

Paper
Code

Mastering the Game of Guandan with Deep Reinforcement Learning and Behavior Regulating

no code implementations • 21 Feb 2024 • Yifan Yanggong, Hao Pan, Lei Wang

Games are a simplified model of reality and often serve as a favored platform for Artificial Intelligence (AI) research.

Decision Making

Paper
Add Code

Advancing Anomaly Detection: An Adaptation Model and a New Dataset

no code implementations • 7 Feb 2024 • Liyun Zhu, Arjun Raj, Lei Wang

To address these challenges, we propose the Scenario-Adaptive Anomaly Detection (SA2D) method, leveraging the few-shot learning framework for faster adaptation of pre-trained models to new concepts.

Anomaly Detection Few-Shot Learning

Paper
Add Code

Meet JEANIE: a Similarity Measure for 3D Skeleton Sequences via Temporal-Viewpoint Alignment

no code implementations • 7 Feb 2024 • Lei Wang, Jun Liu, Liang Zheng, Tom Gedeon, Piotr Koniusz

For a support sequence, we match it with view-simulated query sequences, as in the popular Dynamic Time Warping (DTW).

Dynamic Time Warping Few-Shot action recognition +2

Paper
Add Code

Taylor Videos for Action Recognition

1 code implementation • 5 Feb 2024 • Lei Wang, Xiuyuan Yuan, Tom Gedeon, Liang Zheng

Addressing these challenges, we propose the Taylor video, a new video format that highlights the dominate motions (e. g., a waving hand) in each of its frames named the Taylor frame.

Action Recognition Optical Flow Estimation

Paper
Code

Localization of Dummy Data Injection Attacks in Power Systems Considering Incomplete Topological Information: A Spatio-Temporal Graph Wavelet Convolutional Neural Network Approach

no code implementations • 27 Jan 2024 • Zhaoyang Qu, Yunchang Dong, Yang Li, Siqi Song, Tao Jiang, Min Li, Qiming Wang, Lei Wang, Xiaoyong Bo, Jiye Zang, Qi Xu

Unfortunately, this approach tends to overlook the inherent topological correlations within the non-Euclidean spatial attributes of power grid data, consequently leading to diminished accuracy in attack localization.

Paper
Add Code

Inference Attacks Against Face Recognition Model without Classification Layers

no code implementations • 24 Jan 2024 • Yuanqing Huang, Huilong Chen, Yinggui Wang, Lei Wang

To the best of our knowledge, the proposed attack model is the very first in the literature developed for FR models without a classification layer.

Face Recognition Generative Adversarial Network +3

Paper
Add Code

A Fast, Performant, Secure Distributed Training Framework For Large Language Model

no code implementations • 18 Jan 2024 • Wei Huang, Yinggui Wang, Anda Cheng, Aihui Zhou, Chaofan Yu, Lei Wang

In this paper, we propose a secure distributed LLM based on model slicing.

Language Modelling Large Language Model

Paper
Add Code

A Hypernetwork Based Framework for Non-Stationary Channel Prediction

no code implementations • 16 Jan 2024 • Guanzhang Liu, Zhengyang Hu, Lei Wang, Hongying Zhang, Jiang Xue, Michail Matthaiou

In this paper, a hypernetwork based framework is proposed for non-stationary channel prediction.

Paper
Add Code

Distributed Solvers for Network Linear Equations with Scalarized Compression

no code implementations • 12 Jan 2024 • Lei Wang, Zihao Ren, Deming Yuan, Guodong Shi

We then employ such a compressed consensus flow as a fundamental consensus subroutine to develop distributed continuous-time and discrete-time solvers for network linear equations, and prove their exponential convergence properties under scalar node communications.

Distributed Computing

Paper
Add Code

Quartet Logic: A Four-Step Reasoning (QLFR) framework for advancing Short Text Classification

no code implementations • 6 Jan 2024 • Hui Wu, Yuanben Zhang, Zhonghe Han, Yingyan Hou, Lei Wang, Siye liu, Qihang Gong, Yunping Ge

Consequently, this study sought to employ CoT to investigate the capabilities of LLMs in STC tasks.

Common Sense Reasoning Multi-Task Learning +2

Paper
Add Code

A Two-stage Personalized Virtual Try-on Framework with Shape Control and Texture Guidance

no code implementations • 24 Dec 2023 • Shufang Zhang, Minxue Ni, Lei Wang, Wenxin Ding, Shuai Chen, Yuhong Liu

The Diffusion model has a strong ability to generate wild images.

Virtual Try-on

Paper
Add Code

YAYI-UIE: A Chat-Enhanced Instruction Tuning Framework for Universal Information Extraction

1 code implementation • 24 Dec 2023 • Xinglin Xiao, Yijie Wang, Nan Xu, Yuqi Wang, Hanxuan Yang, Minzheng Wang, Yin Luo, Lei Wang, Wenji Mao, Daniel Zeng

The difficulty of the information extraction task lies in dealing with the task-specific label schemas and heterogeneous data structures.

UIE

Paper
Code

YAYI 2: Multilingual Open-Source Large Language Models

no code implementations • 22 Dec 2023 • Yin Luo, Qingchao Kong, Nan Xu, Jia Cao, Bao Hao, Baoyu Qu, Bo Chen, Chao Zhu, Chenyang Zhao, Donglei Zhang, Fan Feng, Feifei Zhao, Hailong Sun, Hanxuan Yang, Haojun Pan, Hongyu Liu, Jianbin Guo, Jiangtao Du, Jingyi Wang, Junfeng Li, Lei Sun, Liduo Liu, Lifeng Dong, Lili Liu, Lin Wang, Liwen Zhang, Minzheng Wang, Pin Wang, Ping Yu, Qingxiao Li, Rui Yan, Rui Zou, Ruiqun Li, Taiwen Huang, Xiaodong Wang, Xiaofei Wu, Xin Peng, Xina Zhang, Xing Fang, Xinglin Xiao, Yanni Hao, Yao Dong, Yigang Wang, Ying Liu, Yongyu Jiang, Yungan Wang, Yuqi Wang, Zhangsheng Wang, Zhaoxin Yu, Zhen Luo, Wenji Mao, Lei Wang, Dajun Zeng

As the latest advancements in natural language processing, large language models (LLMs) have achieved human-level language understanding and generation abilities in many real-world tasks, and even have been regarded as a potential path to the artificial general intelligence.

Paper
Add Code

Gemini: A Family of Highly Capable Multimodal Models

no code implementations • The Keyword 2023 • Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee, Fabio Viola, Malcolm Reynolds, Yuanzhong Xu, Ryan Doherty, Eli Collins, Clemens Meyer, Eliza Rutherford, Erica Moreira, Kareem Ayoub, Megha Goel, Jack Krawczyk, Ed Chi, Heng-Tze Cheng, Eric Ni, Purvi Shah, Patrick Kane, Betty Chan, Manaal Faruqui, Aliaksei Severyn, Hanzhao Lin, Yaguang Li, Yong Cheng, Mahdis Mahdieh, Mia Chen, Pei Sun, Dustin Tran, Sumit Bagri, Balaji Lakshminarayanan, Jeremiah Liu, Andras Orban, Fabian Güra, Hao Zhou, Xinying Song, Aurelien Boffy, Harish Ganapathy, Steven Zheng, HyunJeong Choe, Ágoston Weisz, Tao Zhu, Yifeng Lu, Siddharth Gopal, Jarrod Kahn, Maciej Kula, Jeff Pitman, Rushin Shah, Emanuel Taropa, Majd Al Merey, Martin Baeuml, Zhifeng Chen, Laurent El Shafey, Yujing Zhang, Olcan Sercinoglu, George Tucker, Enrique Piqueras, Maxim Krikun, Iain Barr, Nikolay Savinov, Ivo Danihelka, Becca Roelofs, Anaïs White, Anders Andreassen, Tamara von Glehn, Lakshman Yagati, Mehran Kazemi, Lucas Gonzalez, Misha Khalman, Jakub Sygnowski, Alexandre Frechette, Charlotte Smith, Laura Culp, Lev Proleev, Yi Luan, Xi Chen, James Lottes, Nathan Schucher, Federico Lebron, Alban Rrustemi, Natalie Clay, Phil Crone, Tomas Kocisky, Jeffrey Zhao, Bartek Perz, Dian Yu, Heidi Howard, Adam Bloniarz, Jack W. Rae, Han Lu, Laurent SIfre, Marcello Maggioni, Fred Alcober, Dan Garrette, Megan Barnes, Shantanu Thakoor, Jacob Austin, Gabriel Barth-Maron, William Wong, Rishabh Joshi, Rahma Chaabouni, Deeni Fatiha, Arun Ahuja, Gaurav Singh Tomar, Evan Senter, Martin Chadwick, Ilya Kornakov, Nithya Attaluri, Iñaki Iturrate, Ruibo Liu, Yunxuan Li, Sarah Cogan, Jeremy Chen, Chao Jia, Chenjie Gu, Qiao Zhang, Jordan Grimstad, Ale Jakse Hartman, Xavier Garcia, Thanumalayan Sankaranarayana Pillai, Jacob Devlin, Michael Laskin, Diego de Las Casas, Dasha Valter, Connie Tao, Lorenzo Blanco, Adrià Puigdomènech Badia, David Reitter, Mianna Chen, Jenny Brennan, Clara Rivera, Sergey Brin, Shariq Iqbal, Gabriela Surita, Jane Labanowski, Abhi Rao, Stephanie Winkler, Emilio Parisotto, Yiming Gu, Kate Olszewska, Ravi Addanki, Antoine Miech, Annie Louis, Denis Teplyashin, Geoff Brown, Elliot Catt, Jan Balaguer, Jackie Xiang, Pidong Wang, Zoe Ashwood, Anton Briukhov, Albert Webson, Sanjay Ganapathy, Smit Sanghavi, Ajay Kannan, Ming-Wei Chang, Axel Stjerngren, Josip Djolonga, Yuting Sun, Ankur Bapna, Matthew Aitchison, Pedram Pejman, Henryk Michalewski, Tianhe Yu, Cindy Wang, Juliette Love, Junwhan Ahn, Dawn Bloxwich, Kehang Han, Peter Humphreys, Thibault Sellam, James Bradbury, Varun Godbole, Sina Samangooei, Bogdan Damoc, Alex Kaskasoli, Sébastien M. R. Arnold, Vijay Vasudevan, Shubham Agrawal, Jason Riesa, Dmitry Lepikhin, Richard Tanburn, Srivatsan Srinivasan, Hyeontaek Lim, Sarah Hodkinson, Pranav Shyam, Johan Ferret, Steven Hand, Ankush Garg, Tom Le Paine, Jian Li, Yujia Li, Minh Giang, Alexander Neitz, Zaheer Abbas, Sarah York, Machel Reid, Elizabeth Cole, Aakanksha Chowdhery, Dipanjan Das, Dominika Rogozińska, Vitaliy Nikolaev, Pablo Sprechmann, Zachary Nado, Lukas Zilka, Flavien Prost, Luheng He, Marianne Monteiro, Gaurav Mishra, Chris Welty, Josh Newlan, Dawei Jia, Miltiadis Allamanis, Clara Huiyi Hu, Raoul de Liedekerke, Justin Gilmer, Carl Saroufim, Shruti Rijhwani, Shaobo Hou, Disha Shrivastava, Anirudh Baddepudi, Alex Goldin, Adnan Ozturel, Albin Cassirer, Yunhan Xu, Daniel Sohn, Devendra Sachan, Reinald Kim Amplayo, Craig Swanson, Dessie Petrova, Shashi Narayan, Arthur Guez, Siddhartha Brahma, Jessica Landon, Miteyan Patel, Ruizhe Zhao, Kevin Villela, Luyu Wang, Wenhao Jia, Matthew Rahtz, Mai Giménez, Legg Yeung, James Keeling, Petko Georgiev, Diana Mincu, Boxi Wu, Salem Haykal, Rachel Saputro, Kiran Vodrahalli, James Qin, Zeynep Cankara, Abhanshu Sharma, Nick Fernando, Will Hawkins, Behnam Neyshabur, Solomon Kim, Adrian Hutter, Priyanka Agrawal, Alex Castro-Ros, George van den Driessche, Tao Wang, Shuo-Yiin Chang, Paul Komarek, Ross Mcilroy, Mario Lučić, Guodong Zhang, Wael Farhan, Michael Sharman, Paul Natsev, Paul Michel, Yamini Bansal, Siyuan Qiao, Kris Cao, Siamak Shakeri, Christina Butterfield, Justin Chung, Paul Kishan Rubenstein, Shivani Agrawal, Arthur Mensch, Kedar Soparkar, Karel Lenc, Timothy Chung, Aedan Pope, Loren Maggiore, Jackie Kay, Priya Jhakra, Shibo Wang, Joshua Maynez, Mary Phuong, Taylor Tobin, Andrea Tacchetti, Maja Trebacz, Kevin Robinson, Yash Katariya, Sebastian Riedel, Paige Bailey, Kefan Xiao, Nimesh Ghelani, Lora Aroyo, Ambrose Slone, Neil Houlsby, Xuehan Xiong, Zhen Yang, Elena Gribovskaya, Jonas Adler, Mateo Wirth, Lisa Lee, Music Li, Thais Kagohara, Jay Pavagadhi, Sophie Bridgers, Anna Bortsova, Sanjay Ghemawat, Zafarali Ahmed, Tianqi Liu, Richard Powell, Vijay Bolina, Mariko Iinuma, Polina Zablotskaia, James Besley, Da-Woon Chung, Timothy Dozat, Ramona Comanescu, Xiance Si, Jeremy Greer, Guolong Su, Martin Polacek, Raphaël Lopez Kaufman, Simon Tokumine, Hexiang Hu, Elena Buchatskaya, Yingjie Miao, Mohamed Elhawaty, Aditya Siddhant, Nenad Tomasev, Jinwei Xing, Christina Greer, Helen Miller, Shereen Ashraf, Aurko Roy, Zizhao Zhang, Ada Ma, Angelos Filos, Milos Besta, Rory Blevins, Ted Klimenko, Chih-Kuan Yeh, Soravit Changpinyo, Jiaqi Mu, Oscar Chang, Mantas Pajarskas, Carrie Muir, Vered Cohen, Charline Le Lan, Krishna Haridasan, Amit Marathe, Steven Hansen, Sholto Douglas, Rajkumar Samuel, Mingqiu Wang, Sophia Austin, Chang Lan, Jiepu Jiang, Justin Chiu, Jaime Alonso Lorenzo, Lars Lowe Sjösund, Sébastien Cevey, Zach Gleicher, Thi Avrahami, Anudhyan Boral, Hansa Srinivasan, Vittorio Selo, Rhys May, Konstantinos Aisopos, Léonard Hussenot, Livio Baldini Soares, Kate Baumli, Michael B. Chang, Adrià Recasens, Ben Caine, Alexander Pritzel, Filip Pavetic, Fabio Pardo, Anita Gergely, Justin Frye, Vinay Ramasesh, Dan Horgan, Kartikeya Badola, Nora Kassner, Subhrajit Roy, Ethan Dyer, Víctor Campos Campos, Alex Tomala, Yunhao Tang, Dalia El Badawy, Elspeth White, Basil Mustafa, Oran Lang, Abhishek Jindal, Sharad Vikram, Zhitao Gong, Sergi Caelles, Ross Hemsley, Gregory Thornton, Fangxiaoyu Feng, Wojciech Stokowiec, Ce Zheng, Phoebe Thacker, Çağlar Ünlü, Zhishuai Zhang, Mohammad Saleh, James Svensson, Max Bileschi, Piyush Patil, Ankesh Anand, Roman Ring, Katerina Tsihlas, Arpi Vezer, Marco Selvi, Toby Shevlane, Mikel Rodriguez, Tom Kwiatkowski, Samira Daruki, Keran Rong, Allan Dafoe, Nicholas FitzGerald, Keren Gu-Lemberg, Mina Khan, Lisa Anne Hendricks, Marie Pellat, Vladimir Feinberg, James Cobon-Kerr, Tara Sainath, Maribeth Rauh, Sayed Hadi Hashemi, Richard Ives, Yana Hasson, Eric Noland, Yuan Cao, Nathan Byrd, Le Hou, Qingze Wang, Thibault Sottiaux, Michela Paganini, Jean-Baptiste Lespiau, Alexandre Moufarek, Samer Hassan, Kaushik Shivakumar, Joost van Amersfoort, Amol Mandhane, Pratik Joshi, Anirudh Goyal, Matthew Tung, Andrew Brock, Hannah Sheahan, Vedant Misra, Cheng Li, Nemanja Rakićević, Mostafa Dehghani, Fangyu Liu, Sid Mittal, Junhyuk Oh, Seb Noury, Eren Sezener, Fantine Huot, Matthew Lamm, Nicola De Cao, Charlie Chen, Sidharth Mudgal, Romina Stella, Kevin Brooks, Gautam Vasudevan, Chenxi Liu, Mainak Chain, Nivedita Melinkeri, Aaron Cohen, Venus Wang, Kristie Seymore, Sergey Zubkov, Rahul Goel, Summer Yue, Sai Krishnakumaran, Brian Albert, Nate Hurley, Motoki Sano, Anhad Mohananey, Jonah Joughin, Egor Filonov, Tomasz Kępa, Yomna Eldawy, Jiawern Lim, Rahul Rishi, Shirin Badiezadegan, Taylor Bos, Jerry Chang, Sanil Jain, Sri Gayatri Sundara Padmanabhan, Subha Puttagunta, Kalpesh Krishna, Leslie Baker, Norbert Kalb, Vamsi Bedapudi, Shuntong Lei, Anthony Yu, Oren Litvin, Xiang Zhou, Zhichun Wu, Sam Sobell, Andrea Siciliano, Alan Papir, Robby Neale, Jonas Bragagnolo, Tej Toor, Tina Chen, Valentin Anklin, Feiran Wang, Richie Feng, Milad Gholami, Kevin Ling, Lijuan Liu, Jules Walter, Hamid Moghaddam, Arun Kishore, Jakub Adamek, Tyler Mercado, Jonathan Mallinson, Siddhinita Wandekar, Stephen Cagle, Eran Ofek, Guillermo Garrido, Clemens Lombriser, Maksim Mukha, Botu Sun, Hafeezul Rahman Mohammad, Josip Matak, Yadi Qian, Vikas Peswani, Pawel Janus, Quan Yuan, Leif Schelin, Oana David, Ankur Garg, Yifan He, Oleksii Duzhyi, Anton Älgmyr, Timothée Lottaz, Qi Li, Vikas Yadav, Luyao Xu, Alex Chinien, Rakesh Shivanna, Aleksandr Chuklin, Josie Li, Carrie Spadine, Travis Wolfe, Kareem Mohamed, Subhabrata Das, Zihang Dai, Kyle He, Daniel von Dincklage, Shyam Upadhyay, Akanksha Maurya, Luyan Chi, Sebastian Krause, Khalid Salama, Pam G Rabinovitch, Pavan Kumar Reddy M, Aarush Selvan, Mikhail Dektiarev, Golnaz Ghiasi, Erdem Guven, Himanshu Gupta, Boyi Liu, Deepak Sharma, Idan Heimlich Shtacher, Shachi Paul, Oscar Akerlund, François-Xavier Aubet, Terry Huang, Chen Zhu, Eric Zhu, Elico Teixeira, Matthew Fritze, Francesco Bertolini, Liana-Eleonora Marinescu, Martin Bölle, Dominik Paulus, Khyatti Gupta, Tejasi Latkar, Max Chang, Jason Sanders, Roopa Wilson, Xuewei Wu, Yi-Xuan Tan, Lam Nguyen Thiet, Tulsee Doshi, Sid Lall, Swaroop Mishra, Wanming Chen, Thang Luong, Seth Benjamin, Jasmine Lee, Ewa Andrejczuk, Dominik Rabiej, Vipul Ranjan, Krzysztof Styrc, Pengcheng Yin, Jon Simon, Malcolm Rose Harriott, Mudit Bansal, Alexei Robsky, Geoff Bacon, David Greene, Daniil Mirylenka, Chen Zhou, Obaid Sarvana, Abhimanyu Goyal, Samuel Andermatt, Patrick Siegler, Ben Horn, Assaf Israel, Francesco Pongetti, Chih-Wei "Louis" Chen, Marco Selvatici, Pedro Silva, Kathie Wang, Jackson Tolins, Kelvin Guu, Roey Yogev, Xiaochen Cai, Alessandro Agostini, Maulik Shah, Hung Nguyen, Noah Ó Donnaile, Sébastien Pereira, Linda Friso, Adam Stambler, Adam Kurzrok, Chenkai Kuang, Yan Romanikhin, Mark Geller, ZJ Yan, Kane Jang, Cheng-Chun Lee, Wojciech Fica, Eric Malmi, Qijun Tan, Dan Banica, Daniel Balle, Ryan Pham, Yanping Huang, Diana Avram, Hongzhi Shi, Jasjot Singh, Chris Hidey, Niharika Ahuja, Pranab Saxena, Dan Dooley, Srividya Pranavi Potharaju, Eileen O'Neill, Anand Gokulchandran, Ryan Foley, Kai Zhao, Mike Dusenberry, YuAn Liu, Pulkit Mehta, Ragha Kotikalapudi, Chalence Safranek-Shrader, Andrew Goodman, Joshua Kessinger, Eran Globen, Prateek Kolhar, Chris Gorgolewski, Ali Ibrahim, Yang song, Ali Eichenbaum, Thomas Brovelli, Sahitya Potluri, Preethi Lahoti, Cip Baetu, Ali Ghorbani, Charles Chen, Andy Crawford, Shalini Pal, Mukund Sridhar, Petru Gurita, Asier Mujika, Igor Petrovski, Pierre-Louis Cedoz, Chenmei Li, Shiyuan Chen, Niccolò Dal Santo, Siddharth Goyal, Jitesh Punjabi, Karthik Kappaganthu, Chester Kwak, Pallavi LV, Sarmishta Velury, Himadri Choudhury, Jamie Hall, Premal Shah, Ricardo Figueira, Matt Thomas, Minjie Lu, Ting Zhou, Chintu Kumar, Thomas Jurdi, Sharat Chikkerur, Yenai Ma, Adams Yu, Soo Kwak, Victor Ähdel, Sujeevan Rajayogam, Travis Choma, Fei Liu, Aditya Barua, Colin Ji, Ji Ho Park, Vincent Hellendoorn, Alex Bailey, Taylan Bilal, Huanjie Zhou, Mehrdad Khatir, Charles Sutton, Wojciech Rzadkowski, Fiona Macintosh, Konstantin Shagin, Paul Medina, Jinjing Zhou, Pararth Shah, Yingying Bi, Attila Dankovics, Shipra Banga, Sabine Lehmann, Marissa Bredesen, Zifan Lin, John Eric Hoffmann, Jonathan Lai, Raynald Chung, Kai Yang, Nihal Balani, Arthur Bražinskas, Andrei Sozanschi, Matthew Hayes, Héctor Fernández Alcalde, Peter Makarov, Will Chen, Antonio Stella, Liselotte Snijders, Michael Mandl, Ante Kärrman, Paweł Nowak, Xinyi Wu, Alex Dyck, Krishnan Vaidyanathan, Raghavender R, Jessica Mallet, Mitch Rudominer, Eric Johnston, Sushil Mittal, Akhil Udathu, Janara Christensen, Vishal Verma, Zach Irving, Andreas Santucci, Gamaleldin Elsayed, Elnaz Davoodi, Marin Georgiev, Ian Tenney, Geoffrey Cideron, Edouard Leurent, Mahmoud Alnahlawi, Ionut Georgescu, Nan Wei, Ivy Zheng, Dylan Scandinaro, Heinrich Jiang, Jasper Snoek, Mukund Sundararajan, Xuezhi Wang, Zack Ontiveros, Itay Karo, Jeremy Cole, Vinu Rajashekhar, Lara Tumeh, Eyal Ben-David, Rishub Jain, Jonathan Uesato, Romina Datta, Oskar Bunyan, Shimu Wu, John Zhang, Piotr Stanczyk, Ye Zhang, David Steiner, Subhajit Naskar, Michael Azzam, Matthew Johnson, Adam Paszke, Chung-Cheng Chiu, Jaume Sanchez Elias, Afroz Mohiuddin, Faizan Muhammad, Jin Miao, Andrew Lee, Nino Vieillard, Jane Park, Jiageng Zhang, Jeff Stanway, Drew Garmon, Abhijit Karmarkar, Zhe Dong, Jong Lee, Aviral Kumar, Luowei Zhou, Jonathan Evens, William Isaac, Geoffrey Irving, Edward Loper, Michael Fink, Isha Arkatkar, Nanxin Chen, Izhak Shafran, Ivan Petrychenko, Zhe Chen, Johnson Jia, Anselm Levskaya, Zhenkai Zhu, Peter Grabowski, Yu Mao, Alberto Magni, Kaisheng Yao, Javier Snaider, Norman Casagrande, Evan Palmer, Paul Suganthan, Alfonso Castaño, Irene Giannoumis, Wooyeol Kim, Mikołaj Rybiński, Ashwin Sreevatsa, Jennifer Prendki, David Soergel, Adrian Goedeckemeyer, Willi Gierke, Mohsen Jafari, Meenu Gaba, Jeremy Wiesner, Diana Gage Wright, Yawen Wei, Harsha Vashisht, Yana Kulizhskaya, Jay Hoover, Maigo Le, Lu Li, Chimezie Iwuanyanwu, Lu Liu, Kevin Ramirez, Andrey Khorlin, Albert Cui, Tian Lin, Marcus Wu, Ricardo Aguilar, Keith Pallo, Abhishek Chakladar, Ginger Perng, Elena Allica Abellan, Mingyang Zhang, Ishita Dasgupta, Nate Kushman, Ivo Penchev, Alena Repina, Xihui Wu, Tom van der Weide, Priya Ponnapalli, Caroline Kaplan, Jiri Simsa, Shuangfeng Li, Olivier Dousse, Jeff Piper, Nathan Ie, Rama Pasumarthi, Nathan Lintz, Anitha Vijayakumar, Daniel Andor, Pedro Valenzuela, Minnie Lui, Cosmin Paduraru, Daiyi Peng, Katherine Lee, Shuyuan Zhang, Somer Greene, Duc Dung Nguyen, Paula Kurylowicz, Cassidy Hardin, Lucas Dixon, Lili Janzer, Kiam Choo, Ziqiang Feng, Biao Zhang, Achintya Singhal, Dayou Du, Dan McKinnon, Natasha Antropova, Tolga Bolukbasi, Orgad Keller, David Reid, Daniel Finchelstein, Maria Abi Raad, Remi Crocker, Peter Hawkins, Robert Dadashi, Colin Gaffney, Ken Franko, Anna Bulanova, Rémi Leblond, Shirley Chung, Harry Askham, Luis C. Cobo, Kelvin Xu, Felix Fischer, Jun Xu, Christina Sorokin, Chris Alberti, Chu-Cheng Lin, Colin Evans, Alek Dimitriev, Hannah Forbes, Dylan Banarse, Zora Tung, Mark Omernick, Colton Bishop, Rachel Sterneck, Rohan Jain, Jiawei Xia, Ehsan Amid, Francesco Piccinno, Xingyu Wang, Praseem Banzal, Daniel J. Mankowitz, Alex Polozov, Victoria Krakovna, Sasha Brown, Mohammadhossein Bateni, Dennis Duan, Vlad Firoiu, Meghana Thotakuri, Tom Natan, Matthieu Geist, Ser tan Girgin, Hui Li, Jiayu Ye, Ofir Roval, Reiko Tojo, Michael Kwong, James Lee-Thorp, Christopher Yew, Danila Sinopalnikov, Sabela Ramos, John Mellor, Abhishek Sharma, Kathy Wu, David Miller, Nicolas Sonnerat, Denis Vnukov, Rory Greig, Jennifer Beattie, Emily Caveness, Libin Bai, Julian Eisenschlos, Alex Korchemniy, Tomy Tsai, Mimi Jasarevic, Weize Kong, Phuong Dao, Zeyu Zheng, Frederick Liu, Fan Yang, Rui Zhu, Tian Huey Teh, Jason Sanmiya, Evgeny Gladchenko, Nejc Trdin, Daniel Toyama, Evan Rosen, Sasan Tavakkol, Linting Xue, Chen Elkind, Oliver Woodman, John Carpenter, George Papamakarios, Rupert Kemp, Sushant Kafle, Tanya Grunina, Rishika Sinha, Alice Talbert, Diane Wu, Denese Owusu-Afriyie, Cosmo Du, Chloe Thornton, Jordi Pont-Tuset, Pradyumna Narayana, Jing Li, Saaber Fatehi, John Wieting, Omar Ajmeri, Benigno Uria, Yeongil Ko, Laura Knight, Amélie Héliou, Ning Niu, Shane Gu, Chenxi Pang, Yeqing Li, Nir Levine, Ariel Stolovich, Rebeca Santamaria-Fernandez, Sonam Goenka, Wenny Yustalim, Robin Strudel, Ali Elqursh, Charlie Deck, Hyo Lee, Zonglin Li, Kyle Levin, Raphael Hoffmann, Dan Holtmann-Rice, Olivier Bachem, Sho Arora, Christy Koh, Soheil Hassas Yeganeh, Siim Põder, Mukarram Tariq, Yanhua Sun, Lucian Ionita, Mojtaba Seyedhosseini, Pouya Tafti, Zhiyu Liu, Anmol Gulati, Jasmine Liu, Xinyu Ye, Bart Chrzaszcz, Lily Wang, Nikhil Sethi, Tianrun Li, Ben Brown, Shreya Singh, Wei Fan, Aaron Parisi, Joe Stanton, Vinod Koverkathu, Christopher A. Choquette-Choo, Yunjie Li, TJ Lu, Abe Ittycheriah, Prakash Shroff, Mani Varadarajan, Sanaz Bahargam, Rob Willoughby, David Gaddy, Guillaume Desjardins, Marco Cornero, Brona Robenek, Bhavishya Mittal, Ben Albrecht, Ashish Shenoy, Fedor Moiseev, Henrik Jacobsson, Alireza Ghaffarkhah, Morgane Rivière, Alanna Walton, Clément Crepy, Alicia Parrish, Zongwei Zhou, Clement Farabet, Carey Radebaugh, Praveen Srinivasan, Claudia van der Salm, Andreas Fidjeland, Salvatore Scellato, Eri Latorre-Chimoto, Hanna Klimczak-Plucińska, David Bridson, Dario de Cesare, Tom Hudson, Piermaria Mendolicchio, Lexi Walker, Alex Morris, Matthew Mauger, Alexey Guseynov, Alison Reid, Seth Odoom, Lucia Loher, Victor Cotruta, Madhavi Yenugula, Dominik Grewe, Anastasia Petrushkina, Tom Duerig, Antonio Sanchez, Steve Yadlowsky, Amy Shen, Amir Globerson, Lynette Webb, Sahil Dua, Dong Li, Surya Bhupatiraju, Dan Hurt, Haroon Qureshi, Ananth Agarwal, Tomer Shani, Matan Eyal, Anuj Khare, Shreyas Rammohan Belle, Lei Wang, Chetan Tekur, Mihir Sanjay Kale, Jinliang Wei, Ruoxin Sang, Brennan Saeta, Tyler Liechty, Yao Zhao, Stephan Lee, Pandu Nayak, Doug Fritz, Manish Reddy Vuyyuru, John Aslanides, Nidhi Vyas, Martin Wicke, Xiao Ma, Evgenii Eltyshev, Nina Martin, Hardie Cate, James Manyika, Keyvan Amiri, Yelin Kim, Xi Xiong, Kai Kang, Florian Luisier, Nilesh Tripuraneni, David Madras, Mandy Guo, Austin Waters, Oliver Wang, Joshua Ainslie, Jason Baldridge, Han Zhang, Garima Pruthi, Jakob Bauer, Feng Yang, Riham Mansour, Jason Gelman, Yang Xu, George Polovets, Ji Liu, Honglong Cai, Warren Chen, XiangHai Sheng, Emily Xue, Sherjil Ozair, Christof Angermueller, Xiaowei Li, Anoop Sinha, Weiren Wang, Julia Wiesinger, Emmanouil Koukoumidis, Yuan Tian, Anand Iyer, Madhu Gurumurthy, Mark Goldenson, Parashar Shah, MK Blake, Hongkun Yu, Anthony Urbanowicz, Jennimaria Palomaki, Chrisantha Fernando, Ken Durden, Harsh Mehta, Nikola Momchev, Elahe Rahimtoroghi, Maria Georgaki, Amit Raul, Sebastian Ruder, Morgan Redshaw, Jinhyuk Lee, Denny Zhou, Komal Jalan, Dinghua Li, Blake Hechtman, Parker Schuh, Milad Nasr, Kieran Milan, Vladimir Mikulik, Juliana Franco, Tim Green, Nam Nguyen, Joe Kelley, Aroma Mahendru, Andrea Hu, Joshua Howland, Ben Vargas, Jeffrey Hui, Kshitij Bansal, Vikram Rao, Rakesh Ghiya, Emma Wang, Ke Ye, Jean Michel Sarr, Melanie Moranski Preston, Madeleine Elish, Steve Li, Aakash Kaku, Jigar Gupta, Ice Pasupat, Da-Cheng Juan, Milan Someswar, Tejvi M., Xinyun Chen, Aida Amini, Alex Fabrikant, Eric Chu, Xuanyi Dong, Amruta Muthal, Senaka Buthpitiya, Sarthak Jauhari, Nan Hua, Urvashi Khandelwal, Ayal Hitron, Jie Ren, Larissa Rinaldi, Shahar Drath, Avigail Dabush, Nan-Jiang Jiang, Harshal Godhia, Uli Sachs, Anthony Chen, Yicheng Fan, Hagai Taitelbaum, Hila Noga, Zhuyun Dai, James Wang, Chen Liang, Jenny Hamer, Chun-Sung Ferng, Chenel Elkind, Aviel Atias, Paulina Lee, Vít Listík, Mathias Carlen, Jan van de Kerkhof, Marcin Pikus, Krunoslav Zaher, Paul Müller, Sasha Zykova, Richard Stefanec, Vitaly Gatsko, Christoph Hirnschall, Ashwin Sethi, Xingyu Federico Xu, Chetan Ahuja, Beth Tsai, Anca Stefanoiu, Bo Feng, Keshav Dhandhania, Manish Katyal, Akshay Gupta, Atharva Parulekar, Divya Pitta, Jing Zhao, Vivaan Bhatia, Yashodha Bhavnani, Omar Alhadlaq, Xiaolin Li, Peter Danenberg, Dennis Tu, Alex Pine, Vera Filippova, Abhipso Ghosh, Ben Limonchik, Bhargava Urala, Chaitanya Krishna Lanka, Derik Clive, Yi Sun, Edward Li, Hao Wu, Kevin Hongtongsak, Ianna Li, Kalind Thakkar, Kuanysh Omarov, Kushal Majmundar, Michael Alverson, Michael Kucharski, Mohak Patel, Mudit Jain, Maksim Zabelin, Paolo Pelagatti, Rohan Kohli, Saurabh Kumar, Joseph Kim, Swetha Sankar, Vineet Shah, Lakshmi Ramachandruni, Xiangkai Zeng, Ben Bariach, Laura Weidinger, Amar Subramanya, Sissie Hsiao, Demis Hassabis, Koray Kavukcuoglu, Adam Sadovsky, Quoc Le, Trevor Strohman, Yonghui Wu, Slav Petrov, Jeffrey Dean, Oriol Vinyals

This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding.

Ranked #1 on Multi-task Language Understanding on MMLU (using extra training data)

Arithmetic Reasoning Code Generation +3

Paper
Add Code

Roll With the Punches: Expansion and Shrinkage of Soft Label Selection for Semi-supervised Fine-Grained Learning

1 code implementation • 19 Dec 2023 • Yue Duan, Zhen Zhao, Lei Qi, Luping Zhou, Lei Wang, Yinghuan Shi

While semi-supervised learning (SSL) has yielded promising results, the more realistic SSL scenario remains to be explored, in which the unlabeled data exhibits extremely high recognition difficulty, e. g., fine-grained visual classification in the context of SSL (SS-FGVC).

Fine-Grained Image Classification Pseudo Label

Paper
Code

Federated Learning with Instance-Dependent Noisy Label

no code implementations • 16 Dec 2023 • Lei Wang, Jieming Bian, Jie Xu

We introduce a novel algorithm called FedBeat (Federated Learning with Bayesian Ensemble-Assisted Transition Matrix Estimation).

Federated Learning

Paper
Add Code

Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites

1 code implementation • 4 Dec 2023 • Lei Wang, Jiabang He, Shenshen Li, Ning Liu, Ee-Peng Lim

The fine-grained object attributes and behaviors non-existent in the image may still be generated but not measured by the current evaluation methods.

Hallucination Hallucination Evaluation +2

Paper
Code

Learning with Noisy Low-Cost MOS for Image Quality Assessment via Dual-Bias Calibration

no code implementations • 27 Nov 2023 • Lei Wang, Qingbo Wu, Desen Yuan, King Ngi Ngan, Hongliang Li, Fanman Meng, Linfeng Xu

Learning based image quality assessment (IQA) models have obtained impressive performance with the help of reliable subjective quality labels, where mean opinion score (MOS) is the most popular choice.

Image Quality Assessment

Paper
Add Code

SpliceMix: A Cross-scale and Semantic Blending Augmentation Strategy for Multi-label Image Classification

1 code implementation • 26 Nov 2023 • Lei Wang, Yibing Zhan, Leilei Ma, Dapeng Tao, Liang Ding, Chen Gong

The "splice" in our method is two-fold: 1) Each mixed image is a splice of several downsampled images in the form of a grid, where the semantics of images attending to mixing are blended without object deficiencies for alleviating co-occurred bias; 2) We splice mixed images and the original mini-batch to form a new SpliceMixed mini-batch, which allows an image with different scales to contribute to training together.

Data Augmentation Multi-Label Image Classification

Paper
Code

HiH: A Multi-modal Hierarchy in Hierarchy Network for Unconstrained Gait Recognition

no code implementations • 19 Nov 2023 • Lei Wang, Yinchi Ma, Peng Luan, Wei Yao, CongCong Li, Bo Liu

Gait recognition has achieved promising advances in controlled settings, yet it significantly struggles in unconstrained environments due to challenges such as view changes, occlusions, and varying walking speeds.

Gait Recognition

Paper
Add Code

CAFE: Carbon-Aware Federated Learning in Geographically Distributed Data Centers

no code implementations • 6 Nov 2023 • Jieming Bian, Lei Wang, Shaolei Ren, Jie Xu

Training large-scale artificial intelligence (AI) models demands significant computational power and energy, leading to increased carbon footprint with potential environmental repercussions.

Federated Learning

Paper
Add Code

A Systematic Evaluation of GPT-4V's Multimodal Capability for Medical Image Analysis

no code implementations • 31 Oct 2023 • Yingshu Li, Yunyi Liu, Zhanyu Wang, Xinyu Liang, Lei Wang, Lingqiao Liu, Leyang Cui, Zhaopeng Tu, Longyue Wang, Luping Zhou

This work conducts an evaluation of GPT-4V's multimodal capability for medical image analysis, with a focus on three representative tasks of radiology report generation, medical visual question answering, and medical visual grounding.

Descriptive Medical Visual Question Answering +3

Paper
Add Code

R$^3$ Prompting: Review, Rephrase and Resolve for Chain-of-Thought Reasoning in Large Language Models under Noisy Context

no code implementations • 25 Oct 2023 • Qingyuan Tian, Hanlun Zhu, Lei Wang, Yang Li, Yunshi Lan

More analyses and ablation studies show the robustness and generalization of R$^3$ prompting method in solving reasoning tasks in LLMs under noisy context.

Sentence

Paper
Add Code

LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay

1 code implementation • 23 Oct 2023 • Yihuai Lan, Zhiqiang Hu, Lei Wang, Yang Wang, Deheng Ye, Peilin Zhao, Ee-Peng Lim, Hui Xiong, Hao Wang

To achieve this goal, we adopt Avalon, a representative communication game, as the environment and use system prompts to guide LLM agents to play the game.

Paper
Code

Distributed Adaptive Time-Varying Convex Optimization for Multi-agent Systems

no code implementations • 20 Oct 2023 • Liangze Jiang, Zhengguang Wu, Lei Wang

A new class of adaptive algorithms are proposed to solve time-varying convex optimization problems.

Paper
Add Code

Flow Dynamics Correction for Action Recognition

no code implementations • 16 Oct 2023 • Lei Wang, Piotr Koniusz

Various research studies indicate that action recognition performance highly depends on the types of motions being extracted and how accurate the human actions are represented.

Fine-grained Action Recognition Hallucination +1

Paper
Add Code

HeightFormer: A Multilevel Interaction and Image-adaptive Classification-regression Network for Monocular Height Estimation with Aerial Images

no code implementations • 12 Oct 2023 • Zhan Chen, Yidan Zhang, Xiyu Qi, Yongqiang Mao, Xin Zhou, Lulu Niu, Hui Wu, Lei Wang, Yunping Ge

MIB supplements the fixed sample grid in CNN of the conventional backbone network with tokens of different interaction ranges.

Autonomous Driving regression +1

Paper
Add Code

Prompting Large Language Models with Chain-of-Thought for Few-Shot Knowledge Base Question Generation

no code implementations • 12 Oct 2023 • Yuanyuan Liang, Jianing Wang, Hanlun Zhu, Lei Wang, Weining Qian, Yunshi Lan

Inspired by Chain-of-Thought (CoT) prompting, which is an in-context learning strategy for reasoning, we formulate KBQG task as a reasoning problem, where the generation of a complete question is splitted into a series of sub-question generation.

In-Context Learning Question Generation +1

Paper
Add Code

LLM4Vis: Explainable Visualization Recommendation using ChatGPT

1 code implementation • 11 Oct 2023 • Lei Wang, Songheng Zhang, Yun Wang, Ee-Peng Lim, Yong Wang

To obtain demonstration examples with high-quality explanations, we propose a new explanation generation bootstrapping to iteratively refine generated explanations by considering the previous generation and template-based hint.

Data Visualization Explanation Generation

Paper
Code

Adaptive Multi-head Contrastive Learning

no code implementations • 9 Oct 2023 • Lei Wang, Piotr Koniusz, Tom Gedeon, Liang Zheng

As such, enforcing a high similarity for positive pairs and a low similarity for negative pairs may not always be achievable, and in the case of some pairs, forcing so may be detrimental to the performance.

Contrastive Learning

Paper
Add Code

Reach-avoid Analysis for Sampled-data Systems with Measurement Uncertainties

no code implementations • 8 Oct 2023 • Taoran Wu, Dejin Ren, Shuyuan Zhang, Lei Wang, Bai Xue

Digital control has become increasingly prevalent in modern systems, making continuous-time plants controlled by discrete-time (digital) controllers ubiquitous and crucial across industries, including aerospace, automotive, and manufacturing.

Paper
Add Code

AI in Software Engineering: Case Studies and Prospects

no code implementations • 27 Sep 2023 • Lei Wang

Based on the analysis of both case studies, using AI techniques such as deep learning and machine learning in software systems contributes to intelligent systems.

Decision Making

Paper
Add Code

FlaCGEC: A Chinese Grammatical Error Correction Dataset with Fine-grained Linguistic Annotation

1 code implementation • 26 Sep 2023 • Hanyue Du, Yike Zhao, Qingyuan Tian, Jiani Wang, Lei Wang, Yunshi Lan, Xuesong Lu

Chinese Grammatical Error Correction (CGEC) has been attracting growing attention from researchers recently.

Grammatical Error Correction

Paper
Code

R2GenGPT: Radiology Report Generation with Frozen LLMs

1 code implementation • 18 Sep 2023 • Zhanyu Wang, Lingqiao Liu, Lei Wang, Luping Zhou

First, it attains state-of-the-art (SOTA) performance by training only the lightweight visual alignment module while freezing all the parameters of LLM.

Paper
Code

Differentially Private Average Consensus with Improved Accuracy-Privacy Trade-off

no code implementations • 15 Sep 2023 • Lei Wang, Weijia Liu, Fanghong Guo, Zixin Qiao, Zhengguang Wu

Gaussian noise and the output of the mechanism using Gaussian noises, it is shown that the resulting average consensus algorithm can eliminate the gap in the sense that the accuracy-privacy trade-off of the centralized averaging approach with differential privacy can be almost recovered by appropriately designing the variances of the added noises.

Paper
Add Code

Beamforming Design and Performance Evaluation for RIS-aided Localization using LEO Satellite Signals

no code implementations • 13 Sep 2023 • Lei Wang, Pinjun Zheng, Xing Liu, Tarig Ballal, Tareq Y. Al-Naffouri

The growing availability of low-Earth orbit (LEO) satellites, coupled with the anticipated widespread deployment of reconfigurable intelligent surfaces (RISs), opens up promising prospects for new localization paradigms.

Paper
Add Code

Enhancing Sample Utilization through Sample Adaptive Augmentation in Semi-Supervised Learning

1 code implementation • ICCV 2023 • Guan Gui, Zhen Zhao, Lei Qi, Luping Zhou, Lei Wang, Yinghuan Shi

Sample adaptive augmentation (SAA) is proposed for this stated purpose and consists of two modules: 1) sample selection module; 2) sample augmentation module.

Paper
Code

LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors

1 code implementation • 26 Aug 2023 • Chengkun Wei, Wenlong Meng, Zhikun Zhang, Min Chen, Minghu Zhao, Wenjing Fang, Lei Wang, Zihui Zhang, Wenzhi Chen

Instead of directly inverting the triggers, LMSanitator aims to invert the predefined attack vectors (pretrained models' output when the input is embedded with triggers) of the task-agnostic backdoors, which achieves much better convergence performance and backdoor detection accuracy.

Paper
Code

Head-Tail Cooperative Learning Network for Unbiased Scene Graph Generation

1 code implementation • 23 Aug 2023 • Lei Wang, Zejian yuan, Yao Lu, Badong Chen

We also propose a self-supervised learning approach to enhance the prediction ability of the tail-prefer feature representation branch by constraining tail-prefer predicate features.

Graph Generation Self-Supervised Learning +1

Paper
Code

A Survey on Large Language Model based Autonomous Agents

2 code implementations • 22 Aug 2023 • Lei Wang, Chen Ma, Xueyang Feng, Zeyu Zhang, Hao Yang, Jingsen Zhang, ZhiYuan Chen, Jiakai Tang, Xu Chen, Yankai Lin, Wayne Xin Zhao, Zhewei Wei, Ji-Rong Wen

In this paper, we present a comprehensive survey of these studies, delivering a systematic review of the field of LLM-based autonomous agents from a holistic perspective.

Language Modelling Large Language Model

2,217

Paper
Code

Towards Semi-supervised Learning with Non-random Missing Labels

2 code implementations • ICCV 2023 • Yue Duan, Zhen Zhao, Lei Qi, Luping Zhou, Lei Wang, Yinghuan Shi

Semi-supervised learning (SSL) tackles the label missing problem by enabling the effective usage of unlabeled data.

Missing Labels Semi-Supervised Image Classification

Paper
Code

MoCoSA: Momentum Contrast for Knowledge Graph Completion with Structure-Augmented Pre-trained Language Models

no code implementations • 16 Aug 2023 • Jiabang He, Liu Jia, Lei Wang, Xiyao Li, Xing Xu

However, they struggle with semantically rich real-world entities due to limited structural information and fail to generalize to unseen entities.

Ranked #1 on Link Prediction on WN18RR

Entity Embeddings Link Prediction

Paper
Add Code

Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields

no code implementations • 11 Aug 2023 • Yatao Li, Wanling Gao, Lei Wang, Lixin Sun, Zun Wang, Jianfeng Zhan

This suite of metrics has demonstrated a better ability to assess a model's performance in real-world scientific applications, in contrast to traditional AI benchmarking methodologies.

Benchmarking

Paper
Add Code

Empower Your Model with Longer and Better Context Comprehension

1 code implementation • 25 Jul 2023 • YiFei Gao, Lei Wang, Jun Fang, Longhua Hu, Jun Cheng

Recently, with the emergence of numerous Large Language Models (LLMs), the implementation of AI has entered a new era.

Paper
Code

Semantic-Aware Dual Contrastive Learning for Multi-label Image Classification

1 code implementation • 19 Jul 2023 • Leilei Ma, Dengdi Sun, Lei Wang, Haifeng Zhao, Bin Luo

Specifically, we leverage semantic-aware representation learning to extract category-related local discriminative features and construct category prototypes.

Ranked #1 on Multi-Label Learning on COCO 2014

Contrastive Learning Multi-Label Image Classification +2

Paper
Code

Hierarchical Spatio-Temporal Representation Learning for Gait Recognition

no code implementations • ICCV 2023 • Lei Wang, Bo Liu, Fangfang Liang, Bincheng Wang

While current methods focus on exploiting body part-based representations, they often neglect the hierarchical dependencies between local motion patterns.

Gait Recognition Representation Learning

Paper
Add Code

IterLara: A Turing Complete Algebra for Big Data, AI, Scientific Computing, and Database

no code implementations • 17 Jul 2023 • Hongxiao Li, Wanling Gao, Lei Wang, Jianfeng Zhan

The study of \textsc{Lara}'s expressive ability reports that it can represent relational algebra and most linear algebra operations.

Paper
Add Code

In-context Autoencoder for Context Compression in a Large Language Model

1 code implementation • 13 Jul 2023 • Tao Ge, Jing Hu, Lei Wang, Xun Wang, Si-Qing Chen, Furu Wei

We propose the In-context Autoencoder (ICAE), leveraging the power of a large language models (LLM) to compress a long context into short compact memory slots that can be directly conditioned on by the LLM for various purposes.

Language Modelling Large Language Model +3

Paper
Code

DBFed: Debiasing Federated Learning Framework based on Domain-Independent

no code implementations • 10 Jul 2023 • Jiale Li, Zhixin Li, Yibo Wang, Yao Li, Lei Wang

However, it brings challenges in information security and data security.

Fairness Federated Learning +1

Paper
Add Code

An Empirical Study on the Holiday Effect of China's Time-Honored Companies

no code implementations • 29 Jun 2023 • Xianyang Li, Jiayi Xu, Haoxuan Xu, Yunxuan Ma, Yu Zhong, Lei Wang

The stock segment of China's time-honored brand enterprises has an important position in our securities stock market.

Paper
Add Code

PEBO-SLAM: Observer design for visual inertial SLAM with convergence guarantees

no code implementations • 22 Jun 2023 • Bowen Yi, Chi Jin, Lei Wang, Guodong Shi, Viorela Ila, Ian R. Manchester

This paper introduces a new linear parameterization to the problem of visual inertial simultaneous localization and mapping (VI-SLAM) -- without any approximation -- for the case only using information from a single monocular camera and an inertial measurement unit.

Simultaneous Localization and Mapping

Paper
Add Code

D3L: Decomposition of 3D Rotation and Lift from 2D Joint to 3D for Human Mesh Recovery

no code implementations • 10 Jun 2023 • Xiaoyang Hao, Han Li, Jun Cheng, Lei Wang

However, these methods present rotation semantic ambiguity, rotation error accumulation, and shape estimation overfitting, which also leads to errors in the estimated pose.

Human Mesh Recovery Pose Estimation +1

Paper
Add Code

User Behavior Simulation with Large Language Model based Agents

1 code implementation • 5 Jun 2023 • Lei Wang, Jingsen Zhang, Hao Yang, ZhiYuan Chen, Jiakai Tang, Zeyu Zhang, Xu Chen, Yankai Lin, Ruihua Song, Wayne Xin Zhao, Jun Xu, Zhicheng Dou, Jun Wang, Ji-Rong Wen

Simulating high quality user behavior data has always been a fundamental problem in human-centered applications, where the major difficulty originates from the intricate mechanism of human decision process.

Language Modelling Large Language Model +2

206

Paper
Code

Do-GOOD: Towards Distribution Shift Evaluation for Pre-Trained Visual Document Understanding Models

1 code implementation • 5 Jun 2023 • Jiabang He, Yi Hu, Lei Wang, Xing Xu, Ning Liu, Hui Liu, Heng Tao Shen

Results from the experiments demonstrate that there is a significant performance gap between the in-distribution (ID) and OOD settings for document images, and that fine-grained analysis of distribution shifts can reveal the brittle nature of existing pre-trained VDU models and OOD generalization algorithms.

document understanding Question Answering

Paper
Code

SAPI: Surroundings-Aware Vehicle Trajectory Prediction at Intersections

no code implementations • 2 Jun 2023 • Ethan Zhang, Hao Xiao, Yiqian Gan, Lei Wang

In this work we propose a deep learning model, i. e., SAPI, to predict vehicle trajectories at intersections.

Autonomous Vehicles Trajectory Prediction

Paper
Add Code

ReDirTrans: Latent-to-Latent Translation for Gaze and Head Redirection

no code implementations • CVPR 2023 • Shiwei Jin, Zhen Wang, Lei Wang, Ning Bi, Truong Nguyen

Then both the initial and edited embeddings are projected back (deprojected) to the initial latent space as residuals to modify the input latent vectors by subtraction and addition, representing old status removal and new status addition.

Attribute Gaze Estimation +2

Paper
Add Code

MALM: Mask Augmentation based Local Matching for Food-Recipe Retrieval

1 code implementation • 18 May 2023 • Bhanu Prakash Voutharoja, Peng Wang, Lei Wang, Vivienne Guan

A de-facto idea to address this task is to learn a shared feature embedding space in which a food image is aligned better to its paired recipe than other recipes.

Image-text matching Retrieval +1

Paper
Code

Automatic Radiology Report Generation by Learning with Increasingly Hard Negatives

1 code implementation • 11 May 2023 • Bhanu Prakash Voutharoja, Lei Wang, Luping Zhou

At each iteration, conditioned on a given set of hard negative reports, image and report features are learned as usual by minimising the loss functions related to report generation.

Medical Report Generation

Paper
Code

Read, Diagnose and Chat: Towards Explainable and Interactive LLMs-Augmented Depression Detection in Social Media

no code implementations • 9 May 2023 • Wei Qin, Zetong Chen, Lei Wang, Yunshi Lan, Weijieying Ren, Richang Hong

This paper proposes a new depression detection system based on LLMs that is both interpretable and interactive.

Depression Detection

Paper
Add Code

Non-Autoregressive Math Word Problem Solver with Unified Tree Structure

1 code implementation • 8 May 2023 • Yi Bin, Mengqun Han, Wenhao Shi, Lei Wang, Yang Yang, See-Kiong Ng, Heng Tao Shen

For evaluating the possible expression variants, we design a path-based metric to evaluate the partial accuracy of expressions of a unified tree.

Math valid

Paper
Code

Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models

3 code implementations • 6 May 2023 • Lei Wang, Wanyu Xu, Yihuai Lan, Zhiqiang Hu, Yunshi Lan, Roy Ka-Wei Lee, Ee-Peng Lim

To address the calculation errors and improve the quality of generated reasoning steps, we extend PS prompting with more detailed instructions and derive PS+ prompting.

Math

8,384

Paper
Code

T-SciQ: Teaching Multimodal Chain-of-Thought Reasoning via Mixed Large Language Model Signals for Science Question Answering

1 code implementation • 5 May 2023 • Lei Wang, Yi Hu, Jiabang He, Xing Xu, Ning Liu, Hui Liu, Heng Tao Shen

To address these issues, we propose a novel method termed T-SciQ that aims at teaching science question answering with LLM signals.

Language Modelling Large Language Model +1

Paper
Code

Stars Are All You Need: A Distantly Supervised Pyramid Network for Unified Sentiment Analysis

no code implementations • 2 May 2023 • Wenchang Li, Yixing Chen, Shuang Zheng, Lei Wang, John P. Lalor

We also demonstrate the interpretability of DSPN's outputs on reviews to show the pyramid structure inherent in unified sentiment analysis.

Aspect Category Detection Aspect Category Sentiment Analysis +1

Paper
Add Code

Learning Partial Correlation based Deep Visual Representation for Image Classification

1 code implementation • CVPR 2023 • Saimunur Rahman, Piotr Koniusz, Lei Wang, Luping Zhou, Peyman Moghadam, Changming Sun

Our work obtains a partial correlation based deep visual representation and mitigates the small sample problem often encountered by covariance matrix estimation in CNN.

Fine-Grained Image Classification

Paper
Code

Accelerating Hybrid Federated Learning Convergence under Partial Participation

no code implementations • 10 Apr 2023 • Jieming Bian, Lei Wang, Kun Yang, Cong Shen, Jie Xu

In this paper, we provide theoretical analysis of hybrid FL under clients' partial participation to validate that partial participation is the key constraint on convergence speed.

Federated Learning

Paper
Add Code

Zero-Shot Next-Item Recommendation using Large Pretrained Language Models

1 code implementation • 6 Apr 2023 • Lei Wang, Ee-Peng Lim

Large language models (LLMs) have achieved impressive zero-shot performance in various natural language processing (NLP) tasks, demonstrating their capabilities for inference without training examples.

Sequential Recommendation

113

Paper
Code

METransformer: Radiology Report Generation by Transformer with Multiple Learnable Expert Tokens

no code implementations • CVPR 2023 • Zhanyu Wang, Lingqiao Liu, Lei Wang, Luping Zhou

In the encoder, each expert token interacts with both vision tokens and other expert tokens to learn to attend different image regions for image representation.

Paper
Add Code

LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models

2 code implementations • 4 Apr 2023 • Zhiqiang Hu, Lei Wang, Yihuai Lan, Wanyu Xu, Ee-Peng Lim, Lidong Bing, Xing Xu, Soujanya Poria, Roy Ka-Wei Lee

The success of large language models (LLMs), like GPT-4 and ChatGPT, has led to the development of numerous cost-effective and accessible alternatives that are created by finetuning open-access LLMs with task-specific data (e. g., ChatDoctor) or instruction data (e. g., Alpaca).

Arithmetic Reasoning Language Modelling

932

Paper
Code

3Mformer: Multi-order Multi-mode Transformer for Skeletal Action Recognition

no code implementations • CVPR 2023 • Lei Wang, Piotr Koniusz

We split action sequences into temporal blocks, Higher-order Transformer (HoT) produces embeddings of each temporal block based on (i) the body joints, (ii) pairwise links of body joints and (iii) higher-order hyper-edges of skeleton body joints.

Action Recognition Skeleton Based Action Recognition

Paper
Add Code

Edge-free but Structure-aware: Prototype-Guided Knowledge Distillation from GNNs to MLPs

no code implementations • 24 Mar 2023 • Taiqiang Wu, Zhe Zhao, Jiahao Wang, Xingyu Bai, Lei Wang, Ngai Wong, Yujiu Yang

Distilling high-accuracy Graph Neural Networks~(GNNs) to low-latency multilayer perceptrons~(MLPs) on graph tasks has become a hot research topic.

Knowledge Distillation

Paper
Add Code

ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction

1 code implementation • ICCV 2023 • Jiabang He, Lei Wang, Yi Hu, Ning Liu, Hui Liu, Xing Xu, Heng Tao Shen

To this end, we propose a simple but effective in-context learning framework called ICL-D3IE, which enables LLMs to perform DIE with different types of demonstration examples.

Document AI In-Context Learning

Paper
Code

REASONER: An Explainable Recommendation Dataset with Multi-aspect Real User Labeled Ground Truths Towards more Measurable Explainable Recommendation

no code implementations • 1 Mar 2023 • Xu Chen, Jingsen Zhang, Lei Wang, Quanyu Dai, Zhenhua Dong, Ruiming Tang, Rui Zhang, Li Chen, Ji-Rong Wen

To alleviate the above problems, we propose to build an explainable recommendation dataset with multi-aspect real user labeled ground truths.

Explainable Recommendation Informativeness +1

Paper
Add Code

MonoPGC: Monocular 3D Object Detection with Pixel Geometry Contexts

no code implementations • 21 Feb 2023 • Zizhang Wu, Yuanzhu Gan, Lei Wang, Guilian Chen, Jian Pu

Monocular 3D object detection reveals an economical but challenging task in autonomous driving.

Autonomous Driving Depth Estimation +3

Paper
Add Code

MVFusion: Multi-View 3D Object Detection with Semantic-aligned Radar and Camera Fusion

no code implementations • 21 Feb 2023 • Zizhang Wu, Guilian Chen, Yuanzhu Gan, Lei Wang, Jian Pu

To achieve so, we inject the semantic alignment into the radar features via the semantic-aligned radar encoder (SARE) to produce image-guided radar features.

Ranked #7 on 3D Object Detection on nuscenes Camera-Radar

3D Object Detection Autonomous Driving +1

Paper
Add Code

CMLCompiler: A Unified Compiler for Classical Machine Learning

no code implementations • 31 Jan 2023 • Xu Wen, Wanling Gao, Anzheng Li, Lei Wang, Zihan Jiang, Jianfeng Zhan

Without a unified framework, the hybrid deployments of deep learning (DL) and CML also suffer from severe performance and portability issues.

Paper
Add Code

High-level semantic feature matters few-shot unsupervised domain adaptation

no code implementations • 5 Jan 2023 • Lei Yu, Wanqi Yang, Shengqi Huang, Lei Wang, Ming Yang

However, the goal of FS-UDA and FSL are relevant yet distinct, since FS-UDA aims to classify the samples in target domain rather than source domain.

Few-Shot Learning Unsupervised Domain Adaptation +1

Paper
Add Code

A GOA-Based Fault-Tolerant Trajectory Tracking Control for an Underwater Vehicle of Multi-Thruster System without Actuator Saturation

no code implementations • 4 Jan 2023 • Danjie Zhu, Lei Wang, Hua Zhang, Simon X. Yang

This paper proposes an intelligent fault-tolerant control (FTC) strategy to tackle the trajectory tracking problem of an underwater vehicle (UV) under thruster damage (power loss) cases and meanwhile resolve the actuator saturation brought by the vehicle's physical constraints.

Paper
Add Code

Regularized Primitive Graph Learning for Unified Vector Mapping

no code implementations • ICCV 2023 • Lei Wang, Min Dai, Jianan He, Jingwei Huang

Our key idea is using primitive graph as a unified representation of vector maps and formulating shape regularization and topology reconstruction as primitive graph reconstruction problems that can be solved in the same framework.

Graph Learning Graph Reconstruction

Paper
Add Code

Learning Spatial-context-aware Global Visual Feature Representation for Instance Image Retrieval

1 code implementation • ICCV 2023 • Zhongyan Zhang, Lei Wang, Luping Zhou, Piotr Koniusz

To this end, we propose a novel feature learning framework for instance image retrieval, which embeds local spatial context information into the learned global feature representations.

Image Retrieval Retrieval

Paper
Code

Quality at the Tail of Machine Learning Inference

no code implementations • 25 Dec 2022 • Zhengxin Yang, Wanling Gao, Chunjie Luo, Lei Wang, Fei Tang, Xu Wen, Jianfeng Zhan

The study unveils a counterintuitive revelation: deep learning inference quality exhibits fluctuations due to inference time.

Autonomous Driving Benchmarking +1

Paper
Add Code

a cognitive frequency allocation strategy for multi-carrier radar against communication interference

no code implementations • 23 Dec 2022 • Zhao Shan, Lei Wang, PengFei Liu, Tianyao Huang, Yimin Liu

To address this challenge, we use a novel iteratively selecting technique which breaks a difficult decision task into several easy tasks.

Paper
Add Code

ToL: A Tensor of List-Based Unified Computation Model

no code implementations • 21 Dec 2022 • Hongxiao Li, Wanling Gao, Lei Wang, Jianfeng Zhan

This article presents a unified computation model with generalized expression ability and a concise set of primitive operators for programming high-level algorithms.

Paper
Add Code

Domain Generalization by Learning and Removing Domain-specific Features

1 code implementation • Advances in Neural Information Processing Systems 2022 • Yu Ding, Lei Wang, Bin Liang, Shuming Liang, Yang Wang, Fang Chen

With the images output by the encoder-decoder network, another classifier is designed to learn the domain-invariant features to conduct image classification.

Ranked #18 on Domain Generalization on PACS

Domain Generalization Image Classification

Paper
Code

Generalizing Math Word Problem Solvers via Solution Diversification

1 code implementation • 1 Dec 2022 • Zhenwen Liang, Jipeng Zhang, Lei Wang, Yan Wang, Jie Shao, Xiangliang Zhang

In this paper, we design a new training framework for an MWP solver by introducing a solution buffer and a solution discriminator.

Math

Paper
Code

Recent Advances in RecBole: Extensions with more Practical Considerations

1 code implementation • 28 Nov 2022 • Lanling Xu, Zhen Tian, Gaowei Zhang, Lei Wang, Junjie Zhang, Bowen Zheng, YiFan Li, Yupeng Hou, Xingyu Pan, Yushuo Chen, Wayne Xin Zhao, Xu Chen, Ji-Rong Wen

In order to show the recent update in RecBole, we write this technical report to introduce our latest improvements on RecBole.

3,166

Paper
Code

Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models

1 code implementation • 27 Nov 2022 • Lei Wang, Jiabang He, Xing Xu, Ning Liu, Hui Liu

In this paper, we propose a new model architecture with alignment-enriched tuning (dubbed AETNet) upon pre-trained document image models, to adapt downstream tasks with the joint task-specific supervised and alignment-aware contrastive objective.

Paper
Code

A Unified Framework for Contrastive Learning from a Perspective of Affinity Matrix

no code implementations • 26 Nov 2022 • Wenbin Li, Meihao Kong, Xuesong Yang, Lei Wang, Jing Huo, Yang Gao, Jiebo Luo

In this study, we present a new unified contrastive learning representation framework (named UniCLR) suitable for all the above four kinds of methods from a novel perspective of basic affinity matrix.

Contrastive Learning Representation Learning

Paper
Add Code

Explainable and Safe Reinforcement Learning for Autonomous Air Mobility

1 code implementation • 24 Nov 2022 • Lei Wang, Hongyu Yang, Yi Lin, Suwan Yin, Yuankai Wu

Although DRL has achieved important advancements in this field, the existing works pay little attention to the explainability and safety issues related to DRL controllers, particularly the safety under adversarial attacks.

Adversarial Attack Q-Learning +3

Paper
Code

A Review of Intelligent Music Generation Systems

no code implementations • 16 Nov 2022 • Lei Wang, Ziyi Zhao, Hanwei Liu, Junwei Pang, Yi Qin, Qidi Wu

With the introduction of ChatGPT, the public's perception of AI-generated content (AIGC) has begun to reshape.

Benchmarking Music Generation

Paper
Add Code

Exploiting Contrastive Learning and Numerical Evidence for Confusing Legal Judgment Prediction

1 code implementation • 15 Nov 2022 • Leilei Gan, Baokui Li, Kun Kuang, Yating Zhang, Lei Wang, Luu Anh Tuan, Yi Yang, Fei Wu

Given the fact description text of a legal case, legal judgment prediction (LJP) aims to predict the case's charge, law article and penalty term.

Contrastive Learning

Paper
Code

Syntax-Guided Domain Adaptation for Aspect-based Sentiment Analysis

no code implementations • 10 Nov 2022 • Anguo Dong, Cuiyun Gao, Yan Jia, Qing Liao, Xuan Wang, Lei Wang, Jing Xiao

In this work, we propose a novel Syntax-guided Domain Adaptation Model, named SDAM, for more effective cross-domain ABSA.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Paper
Add Code

Mitigating Popularity Bias in Recommendation with Unbalanced Interactions: A Gradient Perspective

no code implementations • 31 Oct 2022 • Weijieying Ren, Lei Wang, Kunpeng Liu, Ruocheng Guo, Lim Ee Peng, Yanjie Fu

We present a gradient perspective to understand two negative impacts of popularity bias in recommendation model optimization: (i) the gradient direction of popular item embeddings is closer to that of positive interactions, and (ii) the magnitude of positive gradient for popular items are much greater than that of unpopular items.

Model Optimization Recommendation Systems

Paper
Add Code

A Case Study of Chinese Sentiment Analysis on Social Media Reviews Based on LSTM

no code implementations • 31 Oct 2022 • Lukai Wang, Lei Wang

Finally, sentiment analysis results were obtained by analyzing the comments with the LSTM model.

Chinese Sentiment Analysis Sentiment Analysis

Paper
Add Code

Time-rEversed diffusioN tEnsor Transformer: A new TENET of Few-Shot Object Detection

1 code implementation • 30 Oct 2022 • Shan Zhang, Naila Murray, Lei Wang, Piotr Koniusz

To address these drawbacks, we propose a Time-rEversed diffusioN tEnsor Transformer (TENET), which i) forms high-order tensor representations that capture multi-way feature occurrences that are highly discriminative, and ii) uses a transformer that dynamically extracts correlations between the query image and the entire support set, instead of a single average-pooled support embedding.

Few-Shot Object Detection Object +1

Paper
Code

Temporal-Viewpoint Transportation Plan for Skeletal Few-shot Action Recognition

no code implementations • 30 Oct 2022 • Lei Wang, Piotr Koniusz

To factor out misalignment between query and support sequences of 3D body joints, we propose an advanced variant of Dynamic Time Warping which jointly models each smooth path between the query and support frames to achieve simultaneously the best alignment in the temporal and simulated camera viewpoint spaces for end-to-end learning under the limited few-shot training data.

Dynamic Time Warping Few-Shot action recognition +3

Paper
Add Code

Uncertainty-DTW for Time Series and Sequences

1 code implementation • 30 Oct 2022 • Lei Wang, Piotr Koniusz

Dynamic Time Warping (DTW) is used for matching pairs of sequences and celebrated in applications such as forecasting the evolution of time series, clustering time series or even matching sequence pairs in few-shot action recognition.

Dynamic Time Warping Few-Shot action recognition +3

Paper
Code

COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models

1 code implementation • 27 Oct 2022 • Bowen Shen, Zheng Lin, Yuanxin Liu, Zhengxiao Liu, Lei Wang, Weiping Wang

Motivated by such considerations, we propose a collaborative optimization for PLMs that integrates static model compression and dynamic inference acceleration.

Model Compression

Paper
Code

Recommendation with User Active Disclosing Willingness

no code implementations • 25 Oct 2022 • Lei Wang, Xu Chen, Quanyu Dai, Zhenhua Dong

Recommender system has been deployed in a large amount of real-world applications, profoundly influencing people's daily life and production. Traditional recommender models mostly collect as comprehensive as possible user behaviors for accurate preference estimation.

Recommendation Systems

Paper
Add Code

TPU-MLIR: A Compiler For TPU Using MLIR

1 code implementation • 23 Oct 2022 • Pengchao Hu, Man Lu, Lei Wang, Guoyue Jiang

Multi-level intermediate representations (MLIR) show great promise for reducing the cost of building domain-specific compilers by providing a reusable and extensible compiler infrastructure.

458

Paper
Code

S2WAT: Image Style Transfer via Hierarchical Vision Transformer using Strips Window Attention

1 code implementation • 22 Oct 2022 • Chiyu Zhang, Xiaogang Xu, Lei Wang, Zaiyan Dai, Jun Yang

Transformer's recent integration into style transfer leverages its proficiency in establishing long-range dependencies, albeit at the expense of attenuated local modeling.

Style Transfer

Paper
Code

Cloud-based Automatic Speech Recognition Systems for Southeast Asian Languages

no code implementations • 7 Oct 2022 • Lei Wang, Rong Tong, Cheung Chi Leung, Sunil Sivadas, Chongjia Ni, Bin Ma

This paper provides an overall introduction of our Automatic Speech Recognition (ASR) systems for Southeast Asian languages.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Synthetic Voice Detection and Audio Splicing Detection using SE-Res2Net-Conformer Architecture

no code implementations • 7 Oct 2022 • Lei Wang, Benedict Yeoh, Jun Wah Ng

Synthetic voice and splicing audio clips have been generated to spoof Internet users and artificial intelligence (AI) technologies such as voice authentication.

Binary Classification

Paper
Add Code

Pronunciation Modeling of Foreign Words for Mandarin ASR by Considering the Effect of Language Transfer

no code implementations • 7 Oct 2022 • Lei Wang, Rong Tong

This paper focuses on examining the phonetic effect of language transfer in automatic speech recognition.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Pseudo-Label Generation and Various Data Augmentation for Semi-Supervised Hyperspectral Object Detection

1 code implementation • Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops 2022 • Jun Yu, Liwen Zhang, Shenshen Du, Hao Chang, Keda Lu, Zhong Zhang, Ye Yu, Lei Wang, Qiang Ling

To overcome these difficulties, this paper first select fewer but suitable data augmentation methods to improve the accuracy of the supervised model based on the labeled training set, which is suitable for the characteristics of hyperspectral images.

Data Augmentation object-detection +3

Paper
Code

On Embeddings and Inverse Embeddings of Input Design for Regularized System Identification

no code implementations • 27 Sep 2022 • Biqiang Mu, Tianshi Chen, He Kong, Bo Jiang, Lei Wang, Junfeng Wu

For the emerging regularized system identification, the study on input design has just started, and it is often formulated as a non-convex optimization problem that minimizes a scalar measure of the Bayesian mean squared error matrix subject to certain constraints, and the state-of-art method is the so-called quadratic mapping and inverse embedding (QMIE) method, where a time domain inverse embedding (TDIE) is proposed to find the inverse of the quadratic mapping.

Paper
Add Code

GaitMM: Multi-Granularity Motion Sequence Learning for Gait Recognition

1 code implementation • 18 Sep 2022 • Lei Wang, Bo Liu, Bincheng Wang, Fuqiang Yu

In this study, we propose a multi-granularity motion representation network (GaitMM) for gait sequence learning.

Gait Recognition

Paper
Code

Deep Variational Free Energy Approach to Dense Hydrogen

1 code implementation • 13 Sep 2022 • Hao Xie, Zi-Hang Li, Han Wang, Linfeng Zhang, Lei Wang

We developed a deep generative model-based variational free energy approach to the equations of state of dense hydrogen.

Paper
Code

Explanation Guided Contrastive Learning for Sequential Recommendation

1 code implementation • 3 Sep 2022 • Lei Wang, Ee-Peng Lim, Zhiwei Liu, Tianxiang Zhao

Recently, contrastive learning has been applied to the sequential recommendation task to address data sparsity caused by users with few item interactions and items with few user adoptions.

Contrastive Learning Representation Learning +1

Paper
Code

Improving Compositional Generalization in Math Word Problem Solving

1 code implementation • 3 Sep 2022 • Yunshi Lan, Lei Wang, Jing Jiang, Ee-Peng Lim

To improve the compositional generalization in MWP solving, we propose an iterative data augmentation method that includes diverse compositional variation into training data and could collaborate with MWP methods.

Data Augmentation Math +1

Paper
Code

A Variance-Reduced Stochastic Gradient Tracking Algorithm for Decentralized Optimization with Orthogonality Constraints

no code implementations • 29 Aug 2022 • Lei Wang, Xin Liu

Decentralized optimization with orthogonality constraints is found widely in scientific computing and data science.

Autonomous Driving Riemannian optimization

Paper
Add Code

A Medical Semantic-Assisted Transformer for Radiographic Report Generation

no code implementations • 22 Aug 2022 • Zhanyu Wang, Mingkang Tang, Lei Wang, Xiu Li, Luping Zhou

Automated radiographic report generation is a challenging cross-domain task that aims to automatically generate accurate and semantic-coherence reports to describe medical images.

Image Captioning Medical Report Generation

Paper
Add Code

Private, Efficient, and Accurate: Protecting Models Trained by Multi-party Learning with Differential Privacy

no code implementations • 18 Aug 2022 • Wenqiang Ruan, Mingxin Xu, Wenjing Fang, Li Wang, Lei Wang, Weili Han

Second, to reduce the accuracy loss led by differential privacy noise and the huge communication overhead of MPL, we propose two optimization methods for the training process of MPL: (1) the data-independent feature extraction method, which aims to simplify the trained model structure; (2) the local data-based global model initialization method, which aims to speed up the convergence of the model training.

Paper
Add Code

Scalable and Sparsity-Aware Privacy-Preserving K-means Clustering with Application to Fraud Detection

no code implementations • 12 Aug 2022 • Yingting Liu, Chaochao Chen, Jamie Cui, Li Wang, Lei Wang

The second type is provable secure but is inefficient and even helpless for the large-scale data sparsity scenario.

Clustering Fraud Detection +1

Paper
Add Code

Instance Image Retrieval by Learning Purely From Within the Dataset

no code implementations • 12 Aug 2022 • Zhongyan Zhang, Lei Wang, Yang Wang, Luping Zhou, Jianjia Zhang, Peng Wang, Fang Chen

Although achieving promising results, this approach is restricted by two issues: 1) the domain gap between benchmark datasets and the dataset of a given retrieval task; 2) the required auxiliary dataset cannot be readily obtained.

Image Retrieval Retrieval +2

Paper
Add Code

RDA: Reciprocal Distribution Alignment for Robust Semi-supervised Learning

3 code implementations • 9 Aug 2022 • Yue Duan, Lei Qi, Lei Wang, Luping Zhou, Yinghuan Shi

In this work, we propose Reciprocal Distribution Alignment (RDA) to address semi-supervised learning (SSL), which is a hyperparameter-free framework that is independent of confidence threshold and works with both the matched (conventionally) and the mismatched class distributions.

Semi-Supervised Image Classification

Paper
Code

FactorVAE: A Probabilistic Dynamic Factor Model Based on Variational Autoencoder for Predicting Cross-Sectional Stock Returns

3 code implementations • AAAI Conference on Artificial Intelligence 2022 • Yitong Duan, Lei Wang, Qizhong Zhang, Jian Li

As an asset pricing model in economics and finance, factor model has been widely used in quantitative investment.

Stock Price Prediction

Paper
Code

Primitive Graph Learning for Unified Vector Mapping

no code implementations • 28 Jun 2022 • Lei Wang, Min Dai, Jianan He, Jingwei Huang, Mingwei Sun

Then, we convert vector shape prediction, regularization, and topology reconstruction into a unique primitive graph learning problem.

Graph Learning

Paper
Add Code

Attitude estimation from vector measurements: Necessary and sufficient conditions and convergent observer design

no code implementations • 27 Jun 2022 • Bowen Yi, Lei Wang, Ian R. Manchester

The paper addresses the problem of attitude estimation for rigid bodies using (possibly time-varying) vector measurements, for which we provide a necessary and sufficient condition of distinguishability.

Paper
Add Code

Constructing Cross-lingual Consumer Health Vocabulary with Word-Embedding from Comparable User Generated Content

no code implementations • 23 Jun 2022 • Chia-Hsuan Chang, Lei Wang, Christopher C. Yang

The experimental results demonstrate that our framework outperforms the other two large language models in identifying CHV across languages.

Paper
Add Code

TC-SfM: Robust Track-Community-Based Structure-from-Motion

no code implementations • 13 Jun 2022 • Lei Wang, Linlin Ge, Shan Luo, Zihan Yan, Zhaopeng Cui, Jieqing Feng

Specifically, a novel structure is proposed, namely, {\textit{track-community}}, in which each community consists of a group of tracks and represents a local segment in the scene.

Community Detection

Paper
Add Code

Contrastive Centroid Supervision Alleviates Domain Shift in Medical Image Classification

no code implementations • 31 May 2022 • Wenshuo Zhou, Dalu Yang, Binghong Wu, Yehui Yang, Junde Wu, Xiaorong Wang, Lei Wang, Haifeng Huang, Yanwu Xu

Deep learning based medical imaging classification models usually suffer from the domain shift problem, where the classification performance drops when training data and real-world data differ in imaging equipment manufacturer, image acquisition protocol, patient populations, etc.

domain classification Domain Generalization +3

Paper
Add Code

Fast and Arbitrary Beam Pattern Design for RIS-Assisted Terahertz Wireless Communication

no code implementations • 6 May 2022 • Jian Dang, Zaichen Zhang, Yewei Li, Liang Wu, Bingcheng Zhu, Lei Wang

Reconfigurable intelligent surface (RIS) can assist terahertz wireless communication to restore the fragile line-of-sight links and facilitate beam steering.

Paper
Add Code

Predicting Time-to-conversion for Dementia of Alzheimer's Type using Multi-modal Deep Survival Analysis

no code implementations • 2 May 2022 • Ghazal Mirabnahrazam, Da Ma, Cédric Beaulac, Sieun Lee, Karteek Popuri, Hyunwoo Lee, Jiguo Cao, James E Galvin, Lei Wang, Mirza Faisal Beg, the Alzheimer's Disease Neuroimaging Initiative

Combining MRI and genetic features improved survival prediction over using either modality alone, but adding CDC to any combination of features only worked as well as using only CDC features.

Survival Analysis Survival Prediction

Paper
Add Code

Revealing the CO2 emission reduction of ridesplitting and its determinants based on real-world data

no code implementations • 2 Apr 2022 • Wenxiang Li, Yuanyuan Li, Ziyuan Pu, Long Cheng, Lei Wang, Linchuan Yang

Integrating the trip data with the COPERT model, this study calculates the CO2 emissions of shared rides (ridesplitting) and their substituted single rides (regular ridesourcing) to estimate the CO2 emission reduction of each ridesplitting trip.

Interpretable Machine Learning

Paper
Add Code

MutexMatch: Semi-Supervised Learning with Mutex-Based Consistency Regularization

3 code implementations • 27 Mar 2022 • Yue Duan, Zhen Zhao, Lei Qi, Lei Wang, Luping Zhou, Yinghuan Shi, Yang Gao

The core issue in semi-supervised learning (SSL) lies in how to effectively leverage unlabeled data, whereas most existing methods tend to put a great emphasis on the utilization of high-confidence samples yet seldom fully explore the usage of low-confidence samples.

Ranked #1 on Semi-Supervised Image Classification on Mini-ImageNet, 1000 Labels

Semi-Supervised Image Classification

Paper
Code

Deep Transfer Learning with Graph Neural Network for Sensor-Based Human Activity Recognition

no code implementations • 14 Mar 2022 • Yan Yan, Tianzheng Liao, Jinjin Zhao, Jiahong Wang, Liang Ma, Wei Lv, Jing Xiong, Lei Wang

Given this observation, we devised a graph-inspired deep learning approach toward the sensor-based HAR tasks, which was further used to build a deep transfer learning model toward giving a tentative solution for these two challenging problems.

Few-Shot Learning Human Activity Recognition +1

Paper
Add Code

Topological EEG Nonlinear Dynamics Analysis for Emotion Recognition

no code implementations • 14 Mar 2022 • Yan Yan, Xuankun Wu, Chengdong Li, Yini He, Zhicheng Zhang, Huihui Li, Ang Li, Lei Wang

The proposed work is the first investigation in the emotion recognition oriented EEG topological feature analysis, which brought a novel insight into the brain neural system nonlinear dynamics analysis and feature extraction.

Arousal Estimation Dominance Estimation +6

Paper
Add Code

Machine Learning Based Multimodal Neuroimaging Genomics Dementia Score for Predicting Future Conversion to Alzheimer's Disease

no code implementations • 11 Mar 2022 • Ghazal Mirabnahrazam, Da Ma, Sieun Lee, Karteek Popuri, Hyunwoo Lee, Jiguo Cao, Lei Wang, James E Galvin, Mirza Faisal Beg, the Alzheimer's Disease Neuroimaging Initiative

Using a pre-defined 0. 5 threshold on DAT scores, we predicted whether or not a subject would develop DAT in the future.

Ensemble Learning feature selection

Paper
Add Code

Two-stream Hierarchical Similarity Reasoning for Image-text Matching

no code implementations • 10 Mar 2022 • Ran Chen, Hanli Wang, Lei Wang, Sam Kwong

Second, previous approaches only consider learning single-stream similarity alignment (i. e., image-to-text level or text-to-image level), which is inadequate to fully use similarity information for image-text matching.

Image-text matching Text Matching +1

Paper
Add Code

CenGCN: Centralized Convolutional Networks with Vertex Imbalance for Scale-Free Graphs

no code implementations • 16 Feb 2022 • Feng Xia, Lei Wang, Tao Tang, Xin Chen, Xiangjie Kong, Giles Oatley, Irwin King

In each non-output layer of the GCN, this framework uses a hub attention mechanism to assign new weights to connected non-hub vertices based on their common information with hub vertices.

Link Prediction

Paper
Add Code

Active and Passive Hybrid Detection Method for Power CPS False Data Injection Attacks with Improved AKF and GRU-CNN

no code implementations • 14 Feb 2022 • Zhaoyang Qu, Xiaoyong Bo, Tong Yu, Yaowei Liu, Yunchang Dong, Zhongfeng Kan, Lei Wang, Yang Li

Taking account of the fact that the existing knowledge-driven detection process for FDIAs has been in a passive detection state for a long time and ignores the advantages of data-driven active capture of features, an active and passive hybrid detection method for power CPS FDIAs with improved adaptive Kalman filter (AKF) and convolutional neural networks (CNN) is proposed in this paper.

Paper
Add Code

AD-NEGF: An End-to-End Differentiable Quantum Transport Simulator for Sensitivity Analysis and Inverse Problems

no code implementations • 10 Feb 2022 • Yingzhanghao Zhou, Xiang Chen, Peng Zhang, Jun Wang, Lei Wang, Hong Guo

Since proposed in the 70s, the Non-Equilibrium Green Function (NEGF) method has been recognized as a standard approach to quantum transport simulations.

Paper
Add Code

Graph Neural Network with Curriculum Learning for Imbalanced Node Classification

no code implementations • 5 Feb 2022 • Xiaohe Li, Lijie Wen, Yawen Deng, Fuli Feng, Xuming Hu, Lei Wang, Zide Fan

Graph Neural Network (GNN) is an emerging technique for graph-based learning tasks such as node classification.

Graph Classification imbalanced classification +2

Paper
Add Code

Self-consistent Gradient-like Eigen Decomposition in Solving Schrödinger Equations

no code implementations • 3 Feb 2022 • Xihan Li, Xiang Chen, Rasul Tutunov, Haitham Bou-Ammar, Lei Wang, Jun Wang

The Schr\"odinger equation is at the heart of modern quantum mechanics.

Paper
Add Code

A Novel Mix-normalization Method for Generalizable Multi-source Person Re-identification

no code implementations • 24 Jan 2022 • Lei Qi, Lei Wang, Yinghuan Shi, Xin Geng

Different from the conventional data augmentation, the proposed domain-aware mix-normalization to enhance the diversity of features during training from the normalization view of the neural network, which can effectively alleviate the model overfitting to the source domains, so as to boost the generalization capability of the model in the unseen domain.

Data Augmentation Person Re-Identification

Paper
Add Code

Indirect Adaptive Control of Nonlinearly Parameterized Nonlinear Dissipative Systems

no code implementations • 15 Jan 2022 • Romeo Ortega, Rafael Cisneros, Lei Wang, Arjan van der Schaft

In this note we address the problem of indirect adaptive (regulation or tracking) control of nonlinear, input affine dissipative systems.

Paper
Add Code

$m^\ast$ of two-dimensional electron gas: a neural canonical transformation study

1 code implementation • 10 Jan 2022 • Hao Xie, Linfeng Zhang, Lei Wang

The quasiparticle effective mass $m^\ast$ of interacting electrons is a fundamental quantity in the Fermi liquid theory.

Paper
Code

StyTr2: Image Style Transfer With Transformers

3 code implementations • CVPR 2022 • Yingying Deng, Fan Tang, WeiMing Dong, Chongyang Ma, Xingjia Pan, Lei Wang, Changsheng Xu

The goal of image style transfer is to render an image with artistic features guided by a style reference while maintaining the original content.

Style Transfer

318

Paper
Code

DC-SSL: Addressing Mismatched Class Distribution in Semi-Supervised Learning

no code implementations • CVPR 2022 • Zhen Zhao, Luping Zhou, Yue Duan, Lei Wang, Lei Qi, Yinghuan Shi

Consistency-based Semi-supervised learning (SSL) has achieved promising performance recently.

Pseudo Label

Paper
Add Code

Kernelized Few-Shot Object Detection With Efficient Integral Aggregation

no code implementations • CVPR 2022 • Shan Zhang, Lei Wang, Naila Murray, Piotr Koniusz

We design a Kernelized Few-shot Object Detector by leveraging kernelized matrices computed over multiple proposal regions, which yield expressive non-linear representations whose model complexity is learned on the fly.

Few-Shot Object Detection Object +2

Paper
Add Code

Decentralized Optimization Over the Stiefel Manifold by an Approximate Augmented Lagrangian Function

no code implementations • 30 Dec 2021 • Lei Wang, Xin Liu

In this paper, we focus on the decentralized optimization problem over the Stiefel manifold, which is defined on a connected network of $d$ agents.

Paper
Add Code

Integrating Quantum Processor Device and Control Optimization in a Gradient-based Framework

no code implementations • 23 Dec 2021 • Xiaotong Ni, Hui-Hai Zhao, Lei Wang, Feng Wu, Jianxin Chen

In a quantum processor, the device design and external controls together contribute to the quality of the target quantum operations.

Paper
Add Code

3D Skeleton-based Few-shot Action Recognition with JEANIE is not so Naïve

no code implementations • 23 Dec 2021 • Lei Wang, Jun Liu, Piotr Koniusz

In this paper, we propose a Few-shot Learning pipeline for 3D skeleton-based action recognition by Joint tEmporal and cAmera viewpoiNt alIgnmEnt (JEANIE).

Dynamic Time Warping Few-Shot action recognition +3

Paper
Add Code

A Multi-View Framework for BGP Anomaly Detection via Graph Attention Network

no code implementations • 23 Dec 2021 • Songtao Peng, Jiaqi Nie, Xincheng Shu, Zhongyuan Ruan, Lei Wang, Yunxuan Sheng, Qi Xuan

As the default protocol for exchanging routing reachability information on the Internet, the abnormal behavior in traffic of Border Gateway Protocols (BGP) is closely related to Internet anomaly events.

Anomaly Detection feature selection +3

Paper
Add Code

Analysis and Evaluation of Kinect-based Action Recognition Algorithms

1 code implementation • 16 Dec 2021 • Lei Wang

Human action recognition still exists many challenging problems such as different viewpoints, occlusion, lighting conditions, human body size and the speed of action execution, although it has been widely used in different areas.

Action Recognition Temporal Action Localization

Paper
Code

Unsupervised Domain Generalization for Person Re-identification: A Domain-specific Adaptive Framework

1 code implementation • 30 Nov 2021 • Lei Qi, Jiaqi Liu, Lei Wang, Yinghuan Shi, Xin Geng

A significance of our work lies in that it shows the potential of unsupervised domain generalization for person ReID and sets a strong baseline for the further research on this topic.

Domain Generalization Person Re-Identification +1

Paper
Code

Learning Dynamic Compact Memory Embedding for Deformable Visual Object Tracking

no code implementations • 23 Nov 2021 • Pengfei Zhu, Hongtao Yu, Kaihua Zhang, Yu Wang, Shuai Zhao, Lei Wang, Tianzhu Zhang, QinGhua Hu

To address this issue, segmentation-based trackers have been proposed that employ per-pixel matching to improve the tracking performance of deformable objects effectively.

Segmentation Visual Object Tracking +1

Paper
Add Code

Block-Sparse Recovery Network for Two-Dimensional Harmonic Retrieval

no code implementations • 15 Nov 2021 • Rong Fu, Tianyao Huang, Lei Wang, Yimin Liu

As a typical signal processing problem, multidimensional harmonic retrieval (MHR) has been adapted to a wide range of applications in signal processing.

Retrieval Vocal Bursts Valence Prediction

Paper
Add Code

A Novel Sample-efficient Deep Reinforcement Learning with Episodic Policy Transfer for PID-Based Control in Cardiac Catheterization Robots

no code implementations • 28 Oct 2021 • Olatunji Mumini Omisore, Toluwanimi Akinyemi, Wenke Duan, Wenjing Du, Lei Wang

Robotic catheterization is typically used for percutaneous coronary intervention procedures nowadays and it involves steering flexible endovascular tools to open up occlusion in the coronaries.

Paper
Add Code

R4: A Framework for Route Representation and Route Recommendation

no code implementations • 20 Oct 2021 • Ran Cheng, Chao Chen, Longfei Xu, Shen Li, Lei Wang, Hengbin Cui, Kaikui Liu, Xiaolong Li

For user representation, we utilize a series of historical navigation to extract user preference.

Attribute

Paper
Add Code

Graph Partner Neural Networks for Semi-Supervised Learning on Graphs

no code implementations • 18 Oct 2021 • Langzhang Liang, Cuiyun Gao, Shiyi Chen, Shishi Duan, Yu Pan, Junjin Zheng, Lei Wang, Zenglin Xu

Graph Convolutional Networks (GCNs) are powerful for processing graph-structured data and have achieved state-of-the-art performance in several tasks such as node classification, link prediction, and graph classification.

Graph Classification Link Prediction +1

Paper
Add Code

High-order Tensor Pooling with Attention for Action Recognition

no code implementations • 11 Oct 2021 • Lei Wang, Ke Sun, Piotr Koniusz

We aim at capturing high-order statistics of feature vectors formed by a neural network, and propose end-to-end second- and higher-order pooling to form a tensor descriptor.

Ranked #2 on Scene Recognition on YUP++ (using extra training data)

Action Recognition Scene Recognition +1

Paper
Add Code

Network Learning in Quadratic Games from Fictitious Plays

no code implementations • 29 Sep 2021 • Kemi Ding, Yijun Chen, Lei Wang, Xiaoqiang Ren, Guodong Shi

Next, in view of the inherent stability and sparsity constraints for the network interaction structure, we propose a stable and sparse system identification framework for learning the interaction graph from full player action observations.

Paper
Add Code

Distributed Zeroth-Order Optimization: Convergence Rates That Match Centralized Counterpart

no code implementations • 29 Sep 2021 • Deming Yuan, Lei Wang, Alexandre Proutiere, Guodong Shi

Zeroth-order optimization has become increasingly important in complex optimization and machine learning when cost functions are impossible to be described in closed analytical forms.

Paper
Add Code

Heterologous Normalization

no code implementations • 29 Sep 2021 • Chunjie Luo, Jianfeng Zhan, Lei Wang, Wanling Gao

Specifically, it calculates the mean like Batch Normalization to maintain the advantage of Batch Normalization.

Paper
Add Code

Differential Privacy with Manifold Data Dependency

no code implementations • 29 Sep 2021 • Lei Wang, Deming Yuan, Guodong Shi

In this paper, we study dataset processing mechanisms generated by linear queries in the presence of manifold data dependency.

Paper
Add Code

A hierarchical residual network with compact triplet-center loss for sketch recognition

no code implementations • 28 Sep 2021 • Lei Wang, Shihui Zhang, Huan He, Xiaoxiao Zhang, Yu Sang

Last but not least, the compact triplet-center loss is proposed specifically for the sketch recognition task.

Sketch Recognition

Paper
Add Code

NOAHQA: Numerical Reasoning with Interpretable Graph Question Answering Dataset

1 code implementation • Findings (EMNLP) 2021 • Qiyuan Zhang, Lei Wang, Sicheng Yu, Shuohang Wang, Yang Wang, Jing Jiang, Ee-Peng Lim

While diverse question answering (QA) datasets have been proposed and contributed significantly to the development of deep learning models for QA tasks, the existing datasets fall short in two aspects.

Graph Question Answering Question Answering

Paper
Code

Progressive Hard-case Mining across Pyramid Levels for Object Detection

1 code implementation • 15 Sep 2021 • Binghong Wu, Yehui Yang, Dalu Yang, Junde Wu, Xiaorong Wang, Haifeng Huang, Lei Wang, Yanwu Xu

Based on focal loss with ATSS-R50, our approach achieves 40. 5 AP, surpassing the state-of-the-art QFL (Quality Focal Loss, 39. 9 AP) and VFL (Varifocal Loss, 40. 1 AP).

object-detection Object Detection

Paper
Code

LibFewShot: A Comprehensive Library for Few-shot Learning

1 code implementation • 10 Sep 2021 • Wenbin Li, Ziyi, Wang, Xuesong Yang, Chuanqi Dong, Pinzhuo Tian, Tiexin Qin, Jing Huo, Yinghuan Shi, Lei Wang, Yang Gao, Jiebo Luo

Furthermore, based on LibFewShot, we provide comprehensive evaluations on multiple benchmarks with various backbone architectures to evaluate common pitfalls and effects of different training tricks.

Data Augmentation Few-Shot Image Classification +2

803

Paper
Code

MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers

1 code implementation • 2 Sep 2021 • Yihuai Lan, Lei Wang, Qiyuan Zhang, Yunshi Lan, Bing Tian Dai, Yan Wang, Dongxiang Zhang, Ee-Peng Lim

Over the last few years, there are a growing number of datasets and deep learning-based methods proposed for effectively solving MWPs.

Ranked #8 on Math Word Problem Solving on Math23K

Math Math Word Problem Solving

155

Paper
Code

Observability is Sufficient for the Design of Globally Exponentially Convergent State Observers for State-affine Nonlinear Systems

no code implementations • 21 Aug 2021 • Lei Wang, Romeo Ortega, Alexei Bobtsov

In this paper we are interested in the problem of state observation of state-affine nonlinear systems.

Paper
Add Code

Identifiability Implies Robust, Globally Exponentially Convergent On-line Parameter Estimation: Application to Model Reference Adaptive Control

no code implementations • 19 Aug 2021 • Lei Wang, Romeo Ortega, Alexey Bobtsov, Jose Guadalupe Romero, Bowen Yi

The estimators are shown to be robust to additive measurement noise and--not necessarily slow--parameter variations.

regression

Paper
Add Code

Improving Ranking Correlation of Supernet with Candidates Enhancement and Progressive Training

1 code implementation • 12 Aug 2021 • Ziwei Yang, Ruyi Zhang, Zhi Yang, Xubo Yang, Lei Wang, Zheyang Li

One-shot neural architecture search (NAS) applies weight-sharing supernet to reduce the unaffordable computation overhead of automated architecture designing.

Neural Architecture Search

Paper
Code

Cascade Bagging for Accuracy Prediction with Few Training Samples

1 code implementation • 12 Aug 2021 • Ruyi Zhang, Ziwei Yang, Zhi Yang, Xubo Yang, Lei Wang, Zheyang Li

To alleviate this problem, we propose a novel framework to train an accuracy predictor under few training samples.

Data Augmentation Ensemble Learning +1

Paper
Code

Few-shot Unsupervised Domain Adaptation with Image-to-class Sparse Similarity Encoding

no code implementations • 6 Aug 2021 • Shengqi Huang, Wanqi Yang, Lei Wang, Luping Zhou, Ming Yang

Inspired by the recent local descriptor based few-shot learning (FSL), our general UDA model is fully built upon local descriptors (LDs) for image classification and domain adaptation.

Few-Shot Learning Image Classification +1

Paper
Add Code

MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving

1 code implementation • Findings (NAACL) 2022 • Zhenwen Liang, Jipeng Zhang, Lei Wang, Wei Qin, Yunshi Lan, Jie Shao, Xiangliang Zhang

Math word problem (MWP) solving faces a dilemma in number representation learning.

Ranked #5 on Math Word Problem Solving on MathQA

Common Sense Reasoning Language Modelling +4

Paper
Code

Trade When Opportunity Comes: Price Movement Forecasting via Locality-Aware Attention and Iterative Refinement Labeling

no code implementations • 26 Jul 2021 • Liang Zeng, Lei Wang, Hui Niu, Ruchen Zhang, Ling Wang, Jian Li

In a set of experiments on three real-world financial markets: stocks, cryptocurrencies, and ETFs, LARA significantly outperforms several machine learning based methods on the Qlib quantitative investment platform.

Metric Learning Time Series Analysis

Paper
Add Code

Crosslink-Net: Double-branch Encoder Segmentation Network via Fusing Vertical and Horizontal Convolutions

1 code implementation • 24 Jul 2021 • Qian Yu, Lei Qi, Luping Zhou, Lei Wang, Yilong Yin, Yinghuan Shi, Wuzhang Wang, Yang Gao

Together, the above two schemes give rise to a novel double-branch encoder segmentation framework for medical image segmentation, namely Crosslink-Net.

Image Segmentation Medical Image Segmentation +2

Paper
Code

Hand Image Understanding via Deep Multi-Task Learning

1 code implementation • ICCV 2021 • Xiong Zhang, Hongsheng Huang, Jianchao Tan, Hongmin Xu, Cheng Yang, Guozhu Peng, Lei Wang, Ji Liu

To further improve the performance of these tasks, we propose a novel Hand Image Understanding (HIU) framework to extract comprehensive information of the hand object from a single RGB image, by jointly considering the relationships between these tasks.

3D Hand Pose Estimation Multi-Task Learning +1

Paper
Code

Trip-ROMA: Self-Supervised Learning with Triplets and Random Mappings

1 code implementation • 22 Jul 2021 • Wenbin Li, Xuesong Yang, Meihao Kong, Lei Wang, Jing Huo, Yang Gao, Jiebo Luo

However, in small data regimes, we can not obtain a sufficient number of negative pairs or effectively avoid the over-fitting problem when negatives are not used at all.

Representation Learning Self-Supervised Learning +1

Paper
Code

A Self-Boosting Framework for Automated Radiographic Report Generation

no code implementations • CVPR 2021 • Zhanyu Wang, Luping Zhou, Lei Wang, Xiu Li

On one hand, the image-text matching branch helps to learn highly text-correlated visual features for the report generation branch to output high quality reports.

Image Captioning Image-text matching +3

Paper
Add Code

Signal Acquisition of Luojia-1A Low Earth Orbit Navigation Augmentation System with Software Defined Receiver

no code implementations • 31 May 2021 • Liang Chen, Xiangchen Lu, Nan Shen, Lei Wang, Yuan Zhuang, Ye Su, Deren Li, Ruizhi Chen

The performance of those integration algorithms on expanding the successful acquisition time range is verified by the real data collected from the Luojia-1A satellite.

Paper
Add Code

StyTr$^2$: Image Style Transfer with Transformers

4 code implementations • 30 May 2021 • Yingying Deng, Fan Tang, WeiMing Dong, Chongyang Ma, Xingjia Pan, Lei Wang, Changsheng Xu

The goal of image style transfer is to render an image with artistic features guided by a style reference while maintaining the original content.

Style Transfer

318

Paper
Code

RFCBF: enhance the performance and stability of Fast Correlation-Based Filter

no code implementations • 30 May 2021 • Xiongshi Deng, Min Li, Lei Wang, Qikang Wan

Feature selection is a preprocessing step which plays a crucial role in the domain of machine learning and data mining.

feature selection

Paper
Add Code

Hybrid gene selection approach using XGBoost and multi-objective genetic algorithm for cancer classification

no code implementations • 30 May 2021 • Xiongshi Deng, Min Li, Shaobo Deng, Lei Wang

In the second stage, XGBoost-MOGA searches for an optimal gene subset based on the most relevant genes's group using a multi-objective optimization genetic algorithm.

feature selection

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.