no code implementations • Findings (EMNLP) 2021 • Peiyang Liu, Xi Wang, Sen Wang, Wei Ye, Xiangyu Xi, Shikun Zhang
Current embedding-based large-scale retrieval models are trained with 0-1 hard label that indicates whether a query is relevant to a document, ignoring rich information of the relevance degree.
no code implementations • 25 Nov 2024 • Zuhao Liu, Aleksandar Yanev, Ahmad Mahmood, Ivan Nikolov, Saman Motamed, Wei-Shi Zheng, Xi Wang, Luc van Gool, Danda Pani Paudel
Advances in video generation have significantly improved the realism and quality of created scenes.
no code implementations • 18 Nov 2024 • Xinyang Li, Yi Zhang, Yi Xie, Jianfei Yang, Xi Wang, Hao Chen, Haixian Zhang
In this paper, we introduce GroupMIL, a novel framework inspired by the clinical practice of collective analysis, which models multiple slides as a single sample and organizes groups of patches and slides sequentially to capture cross-slide prognostic features.
no code implementations • 13 Nov 2024 • Farouq Sammour, Jia Xu, Xi Wang, Mo Hu, Zhenyu Zhang
Construction remains one of the most hazardous sectors.
no code implementations • 19 Oct 2024 • Xiaocheng Zhang, Xi Wang, Yifei Lu, Zhuangzhuang Ye, Jianing Wang, Mengjiao Bao, Peng Yan, Xiaohong Su
However, previous studies on explanation generation has shown several limitations, such as being confined to English scenarios, involving overly complex inference processes, and not fully unleashing the potential of the mutual feedback between veracity labels and explanation texts.
no code implementations • 18 Oct 2024 • Nefeli Andreou, Xi Wang, Victoria Fernández Abrevaya, Marie-Paule Cani, Yiorgos Chrysanthou, Vicky Kalogeiton
Here, we address this by combining latent diffusion with a realignment mechanism, producing a novel, semantically structured space that encodes the semantics of language.
no code implementations • 17 Oct 2024 • Xuezhi Xiang, Xi Wang, Lei Zhang, Denis Ombati, Himaloy Himu, XianTong Zhen
Scene flow estimation aims to generate the 3D motion field of points between two consecutive frames of point clouds, which has wide applications in various fields.
no code implementations • 14 Oct 2024 • Xi Wang, Liana Mikaelyan, Taketomo Isazawa, James Hensman
In this paper, we propose Knowledge Base augmented Language Model (KBLaM), a new method for augmenting Large Language Models (LLMs) with external knowledge.
no code implementations • 26 Sep 2024 • Timo Breuer, Christin Katharina Kreutz, Norbert Fuhr, Krisztian Balog, Philipp Schaer, Nolwenn Bernard, Ingo Frommholz, Marcel Gohsen, Kaixin Ji, Gareth J. F. Jones, Jüri Keller, Jiqun Liu, Martin Mladenov, Gabriella Pasi, Johanne Trippas, Xi Wang, Saber Zerhoudi, ChengXiang Zhai
This paper is a report of the Workshop on Simulations for Information Access (Sim4IA) workshop at SIGIR 2024.
no code implementations • 24 Sep 2024 • Xi Wang, Tianxing Chen, Qiaojun Yu, Tianling Xu, Zanxin Chen, Yiting Fu, Cewu Lu, Yao Mu, Ping Luo
To address this limitation, we present a closed-loop pipeline integrating interactive perception with online axis estimation from segmented 3D point clouds.
no code implementations • 23 Sep 2024 • Junqing He, Liang Zhu, Rui Wang, Xi Wang, Reza Haffari, Jiaxing Zhang
Long-term memory is important for chatbots and dialogue systems (DS) to create consistent and human-like conversations, evidenced by numerous developed memory-augmented DS (MADS).
no code implementations • 23 Sep 2024 • Bingkun Yao, Ning Wang, Jie zhou, Xi Wang, Hong Gao, Zhe Jiang, Nan Guan
Bug localization in Verilog code is a crucial and time-consuming task during the verification of hardware design.
no code implementations • 16 Sep 2024 • Xi Wang, Xin Liu, Songming Zhu, Zhanwen Li, Lina Gao
We conducted comparative experiments with multiple update strategies for self-updating and identified an optimal approach.
no code implementations • 2 Sep 2024 • Jiahe Tian, Peng Chen, Cai Yu, Xiaomeng Fu, Xi Wang, Jiao Dai, Jizhong Han
The produced manipulation maps can serve as better supervision to enhance face forgery detectors.
no code implementations • 29 Aug 2024 • Hossein A. Rahmani, Xi Wang, Emine Yilmaz, Nick Craswell, Bhaskar Mitra, Paul Thomas
Large-scale test collections play a crucial role in Information Retrieval (IR) research.
no code implementations • 26 Aug 2024 • Maxwell McManus, Tenzin Rinchen, Annoy Dey, Sumanth Thota, Zhaoxi Zhang, Jiangqi Hu, Xi Wang, Mingyue Ji, Nicholas Mastronarde, Elizabeth Serena Bentley, Michael Medley, Zhangyu Guan
In this work, we present a new federation framework for UnionLabs, an innovative cloud-based resource-sharing infrastructure designed for next-generation (NextG) and Internet of Things (IoT) over-the-air (OTA) experiments.
1 code implementation • 16 Aug 2024 • Kaixiang Yang, Wenqi Shan, Xudong Li, Xuan Wang, Xikai Yang, Xi Wang, Pheng-Ann Heng, Qiang Li, Zhiwei Wang
Multi-modal brain tumor segmentation typically involves four magnetic resonance imaging (MRI) modalities, while incomplete modalities significantly degrade performance.
no code implementations • 14 Aug 2024 • Zihao Ren, Lei Wang, Xinlei Yi, Xi Wang, Deming Yuan, Tao Yang, Zhengguang Wu, Guodong Shi
In this paper, we demonstrate that effective information compression may occur over time or space during sequences of node communications in distributed algorithms, leading to the concept of spatio-temporal compressors.
no code implementations • 10 Aug 2024 • Kexin Ma, Ruochun Jin, Xi Wang, Huan Chen, Jing Ren, Yuhua Tang
Retrieval-Augmented Large Language Models (RALMs) have made significant strides in enhancing the accuracy of generated responses. However, existing research often overlooks the data quality issues within retrieval results, often caused by inaccurate existing vector-distance-based retrieval methods. We propose to boost the precision of RALMs' answers from a data quality perspective through the Context-Driven Index Trimming (CDIT) framework, where Context Matching Dependencies (CMDs) are employed as logical data quality rules to capture and regulate the consistency between retrieved contexts. Based on the semantic comprehension capabilities of Large Language Models (LLMs), CDIT can effectively identify and discard retrieval results that are inconsistent with the query context and further modify indexes in the database, thereby improving answer quality. Experiments demonstrate on challenging question-answering tasks. Also, the flexibility of CDIT is verified through its compatibility with various language models and indexing methods, which offers a promising approach to bolster RALMs' data quality and retrieval precision jointly.
no code implementations • 5 Aug 2024 • Ekaterina Khramtsova, Mahsa Baktashmotlagh, Guido Zuccon, Xi Wang, Mathieu Salzmann
In this work, we propose a source-free approach centred on uncertainty-based estimation, using a generative model for calibration in the absence of source data.
no code implementations • 31 Jul 2024 • Xi Wang, Procheta Sen, Ruizhe Li, Emine Yilmaz
Despite the success of integrating large language models into the development of conversational systems, many studies have shown the effectiveness of retrieving and augmenting external knowledge for informative responses.
1 code implementation • 21 Jul 2024 • Ning Wang, Bingkun Yao, Jie zhou, Xi Wang, Zhe Jiang, Nan Guan
Recent advancements in large language models (LLMs) have catalyzed significant interest in the automatic generation of Register-Transfer Level (RTL) code, particularly Verilog, from natural language instructions.
no code implementations • 18 Jul 2024 • Qiao Li, Xiaomeng Fu, Xi Wang, Jin Liu, Xingyu Gao, Jiao Dai, Jizhong Han
Therefore, in order to judge whether a specific image is utilized as a member of a model's training set, Membership Inference Attack (MIA) is proposed to serve as a tool for privacy protection.
1 code implementation • 1 Jul 2024 • Robin Courant, Nicolas Dufour, Xi Wang, Marc Christie, Vicky Kalogeiton
dataset, we propose a diffusion-based approach, named DIRECTOR, which generates complex camera trajectories from textual captions that describe the relation and synchronisation between the camera and characters.
Ranked #1 on 3D Generation on E.T. the Exceptional Trajectories
no code implementations • 28 Jun 2024 • Daiwei Zhang, Gengyan Li, Jiajie Li, Mickaël Bressieux, Otmar Hilliges, Marc Pollefeys, Luc van Gool, Xi Wang
Human activities are inherently complex, often involving numerous object interactions.
1 code implementation • 26 Jun 2024 • Dunyuan Xu, Xi Wang, Jingyang Zhang, Pheng-Ann Heng
To achieve this, we create the orientational gradient alignment to ensure memorizability on previous sites, and arbitrary gradient alignment to enhance generalizability on unseen sites.
no code implementations • 21 Jun 2024 • Zeyao Ma, Bohan Zhang, Jing Zhang, Jifan Yu, Xiaokang Zhang, Xiaohan Zhang, Sijia Luo, Xi Wang, Jie Tang
We introduce SpreadsheetBench, a challenging spreadsheet manipulation benchmark exclusively derived from real-world scenarios, designed to immerse current large language models (LLMs) in the actual workflow of spreadsheet users.
1 code implementation • 14 Jun 2024 • Ridouane Ghermi, Xi Wang, Vicky Kalogeiton, Ivan Laptev
Recent advances in vision-language models have significantly propelled video understanding.
Multiple Choice Question Answering (MCQA) Open-Ended Question Answering +1
no code implementations • 22 May 2024 • Xi Wang, Laurence Aitchison
This gives critical insights for how to set the weight decay in AdamW, and how the weight decay should scale with model and dataset size.
1 code implementation • 30 Apr 2024 • Cai Yu, Shan Jia, Xiaomeng Fu, Jin Liu, Jiahe Tian, Jiao Dai, Xi Wang, Siwei Lyu, Jizhong Han
With the rising prevalence of deepfakes, there is a growing interest in developing generalizable detection methods for various types of deepfakes.
no code implementations • CVPR 2024 • Markos Diomataris, Nikos Athanasiou, Omid Taheri, Xi Wang, Otmar Hilliges, Michael J. Black
To address this, we introduce WANDR, a data-driven model that takes an avatar's initial pose and a goal's 3D position and generates natural human motions that place the end effector (wrist) on the goal location.
no code implementations • 20 Apr 2024 • Xi Wang, Yichen Peng, Heng Fang, Haoran Xie, Xi Yang, Chuntao Li
Achieving this requires the effective decoupling of key attributes within the input image data, aiming to get representations accurately.
no code implementations • 19 Apr 2024 • Xi Wang, Nicolas Dufour, Nefeli Andreou, Marie-Paule Cani, Victoria Fernandez Abrevaya, David Picard, Vicky Kalogeiton
Classifier-Free Guidance (CFG) enhances the quality and condition adherence of text-to-image diffusion models.
no code implementations • 10 Apr 2024 • Suleyman Ozdel, Yao Rong, Berat Mert Albaba, Yen-Ling Kuo, Xi Wang
We introduce the Gaze-guided Action Anticipation algorithm, which establishes a visual-semantic graph from the video input.
no code implementations • 10 Apr 2024 • Suleyman Ozdel, Yao Rong, Berat Mert Albaba, Yen-Ling Kuo, Xi Wang
Eye-tracking applications that utilize the human gaze in video understanding tasks have become increasingly important.
no code implementations • 4 Apr 2024 • Zixuan Yi, Xi Wang, Iadh Ounis
To account for and model possible noise in the users' interactions in graph neural recommenders, we propose a novel Diffusion Graph Transformer (DiffGT) model for top-k recommendation.
no code implementations • 3 Apr 2024 • Ata Çelen, Guo Han, Konrad Schindler, Luc van Gool, Iro Armeni, Anton Obukhov, Xi Wang
Interior design allows us to be who we are and live how we want - each design is as unique as our distinct personality.
no code implementations • CVPR 2024 • Yihua Cheng, Yaning Zhu, Zongji Wang, Hongquan Hao, Yongwei Liu, Shiqing Cheng, Xi Wang, Hyung Jin Chang
GazeDPTR shows state-of-the-art performance on the IVGaze dataset.
1 code implementation • 19 Mar 2024 • Xi Wang, Hongliang Dai, Shen Gao, Piji Li
In response to this research gap, we create a benchmark for the characteristic AI agents task, including dataset, techniques, and evaluation metrics.
no code implementations • 13 Mar 2024 • Xiaomeng Fu, Xi Wang, Qiao Li, Jin Liu, Jiao Dai, Jizhong Han
In this paper, we explore a novel perspective for the TMI task by leveraging the intrinsic generative priors within the diffusion model.
1 code implementation • 8 Mar 2024 • Shoujin Huang, GuanXiong Luo, Xi Wang, Ziran Chen, Yuwan Wang, Huaishui Yang, Pheng-Ann Heng, Lingyan Zhang, Mengye Lyu
In general, diffusion model-based MRI reconstruction methods incrementally remove artificially added noise while imposing data consistency to reconstruct the underlying images.
no code implementations • 29 Feb 2024 • Xi Wang, Laurence Aitchison
We propose a batch size invariant version of Adam, for use in large-scale, distributed settings, in which the mini-batch is divided into micro-batches which are distributed among worker nodes.
no code implementations • 28 Feb 2024 • Xi Wang, Xiaotong Zhao, Juncheng Wang, You Li, Qingjiang Shi
We then propose a joint beamforming and linear stream allocation algorithm, termed as RWMMSE-LSA, which yields closed-form updates with linear stream allocation complexity and is guaranteed to converge to the stationary points of the original joint optimization problem.
no code implementations • 23 Feb 2024 • Francis Engelmann, Ayca Takmaz, Jonas Schult, Elisabetta Fedele, Johanna Wald, Songyou Peng, Xi Wang, Or Litany, Siyu Tang, Federico Tombari, Marc Pollefeys, Leonidas Guibas, Hongbo Tian, Chunjie Wang, Xiaosheng Yan, Bingwen Wang, Xuanyang Zhang, Xiao Liu, Phuc Nguyen, Khoi Nguyen, Anh Tran, Cuong Pham, Zhening Huang, Xiaoyang Wu, Xi Chen, Hengshuang Zhao, Lei Zhu, Joan Lasenby
This report provides an overview of the challenge hosted at the OpenSUN3D Workshop on Open-Vocabulary 3D Scene Understanding held in conjunction with ICCV 2023.
no code implementations • 21 Feb 2024 • Xikai Yang, Jian Wu, Xi Wang, Yuchen Yuan, Ning Li Wang, Pheng-Ann Heng
Extensive experiments on the Sequential fundus Images for Glaucoma Forecast (SIGF) dataset demonstrate the superiority of the proposed MST-former method, achieving an AUC of 98. 6% for glaucoma forecasting.
1 code implementation • 16 Feb 2024 • Alberto Cabezas, Adrien Corenflos, Junpeng Lao, Rémi Louf, Antoine Carnec, Kaustubh Chaudhari, Reuben Cohn-Gordon, Jeremie Coullon, Wei Deng, Sam Duffield, Gerardo Durán-Martín, Marcin Elantkowski, Dan Foreman-Mackey, Michele Gregori, Carlos Iguaran, Ravin Kumar, Martin Lysy, Kevin Murphy, Juan Camilo Orduz, Karm Patel, Xi Wang, Rob Zinkov
BlackJAX is a library implementing sampling and variational inference algorithms commonly used in Bayesian computation.
1 code implementation • 8 Feb 2024 • Jerome Ramos, Hossen A. Rahmani, Xi Wang, Xiao Fu, Aldo Lipani
Given the recent advances in Large Language Models (LLMs), we investigate how a properly crafted prompt can be used to summarize a user's preferences from past reviews and recommend items based only on language-based preferences.
1 code implementation • 2 Feb 2024 • Hossein A. Rahmani, Xi Wang, Mohammad Aliannejadi, Mohammadmehdi Naghiaei, Emine Yilmaz
Clarifying questions are an integral component of modern information retrieval systems, directly impacting user satisfaction and overall system performance.
1 code implementation • 31 Jan 2024 • Xiaopeng Li, Shasha Li, Shezheng Song, Huijun Liu, Bin Ji, Xi Wang, Jun Ma, Jie Yu, Xiaodong Liu, Jing Wang, Weimin Zhang
In particular, local editing methods, which directly update model parameters, are more suitable for updating a small amount of knowledge.
no code implementations • 26 Jan 2024 • Xi Wang, Ruoqing Zhao, Hongliang Dai, Piji Li
Chinese Spelling Check (CSC) is a meaningful task in the area of Natural Language Processing (NLP) which aims at detecting spelling errors in Chinese texts and then correcting these errors.
no code implementations • 17 Jan 2024 • Dunyuan Xu, Xi Wang, Jinyue Cai, Pheng-Ann Heng
Brain tumor represents one of the most fatal cancers around the world, and is very common in children and the elderly.
no code implementations • 7 Jan 2024 • Xianghui Xie, Xi Wang, Nikos Athanasiou, Bharat Lal Bhatnagar, Chun-Hao P. Huang, Kaichun Mo, Hao Chen, Xia Jia, Zerui Zhang, Liangxian Cui, Xiao Lin, Bingqiao Qian, Jie Xiao, Wenfei Yang, Hyeongjin Nam, Daniel Sungho Jung, Kihoon Kim, Kyoung Mu Lee, Otmar Hilliges, Gerard Pons-Moll
Modeling the interaction between humans and objects has been an emerging research direction in recent years.
no code implementations • CVPR 2024 • Xi Wang, Xu Yang, Jie Yin, Kun Wei, Cheng Deng
In this paper we constructed two parallel spaces simultaneously: 1) Sub-prototype space and 2) Reminiscence space to learn robust representations while alleviating forgetfulness.
no code implementations • 26 Dec 2023 • Ruoqing Zhao, Xi Wang, Hongliang Dai, Pan Gao, Piji Li
Automated radiology report generation has the potential to improve radiology reporting and alleviate the workload of radiologists.
no code implementations • 19 Dec 2023 • Xueyuan Chen, Xi Wang, Shaofei Zhang, Lei He, Zhiyong Wu, Xixin Wu, Helen Meng
Both objective and subjective evaluations demonstrate that our proposed method can effectively improve the naturalness and expressiveness of the synthesized speech in audiobook synthesis especially for the role and out-of-domain scenarios.
no code implementations • 13 Dec 2023 • M. Eren Akbiyik, Nedko Savov, Danda Pani Paudel, Nikola Popovic, Christian Vater, Otmar Hilliges, Luc van Gool, Xi Wang
In contrast, we focus on inferring the ego trajectory of a driver's vehicle using their gaze data.
no code implementations • 8 Dec 2023 • Xi Wang, Xueyang Fu, Peng-Tao Jiang, Jie Huang, Mi Zhou, Bo Li, Zheng-Jun Zha
The former facilitates channel-dependent degradation removal operation, allowing the network to tailor responses to various adverse weather types; the latter, by integrating Fourier's global properties into channel-independent content features, enhances network capacity for consistent global content reconstruction.
no code implementations • 29 Nov 2023 • Sanghwan Kim, Daoji Huang, Yongqin Xian, Otmar Hilliges, Luc van Gool, Xi Wang
Traditional methods heavily rely on representation learning that is trained on a large amount of video data.
no code implementations • 27 Nov 2023 • Xi Wang, Xianyao Ling, Tom Zhang, Xuecao Li, Shaolan Wang, Zhixing Li, Liang Zhang, Peng Gong
This study demonstrates the effectiveness and superiority of the joint fine-tuning method using Prefix and LoRA for ChatGLM in the urban renewal knowledge QA tasks.
no code implementations • 27 Nov 2023 • Siwei Liu, Xi Wang, Craig Macdonald, Iadh Ounis
We propose a novel recommendation model, the Social-aware Gaussian Pre-trained model (SGP), which encodes the user social relations and interaction data at the pre-training stage in a Graph Neural Network (GNN).
1 code implementation • 20 Nov 2023 • Nikola Popovic, Dimitrios Christodoulou, Danda Pani Paudel, Xi Wang, Luc van Gool
In this work, we propose to predict 3D eye gaze from weak supervision of eye semantic segmentation masks and direct supervision of a few 3D gaze vectors.
1 code implementation • 25 Oct 2023 • Xi Wang, Hossein A. Rahmani, Jiqun Liu, Emine Yilmaz
Conversational Recommendation System (CRS) is a rapidly growing research area that has gained significant attention alongside advancements in language modelling techniques.
no code implementations • 29 Sep 2023 • Xi Wang, Laurence Aitchison, Maja Rudolph
To address these issues, we propose an ensemble approach using Low-Rank Adapters (LoRA), a parameter-efficient fine-tuning technique.
no code implementations • 28 Sep 2023 • Jin Liu, Xi Wang, Xiaomeng Fu, Yesheng Chai, Cai Yu, Jiao Dai, Jizhong Han
Other works construct one-to-one mapping between audio signal and head motion sequences, introducing ambiguity correspondences into the mapping since people can behave differently in head motions when speaking the same content.
no code implementations • 25 Sep 2023 • Zihao Hu, Guanghui Wang, Xi Wang, Andre Wibisono, Jacob Abernethy, Molei Tao
In the context of Euclidean space, it is established that the last-iterates of both the extragradient (EG) and past extragradient (PEG) methods converge to the solution of monotone variational inequality problems at a rate of $O\left(\frac{1}{\sqrt{T}}\right)$ (Cai et al., 2022).
no code implementations • 18 Sep 2023 • Ekaterina Khramtsova, Shengyao Zhuang, Mahsa Baktashmotlagh, Xi Wang, Guido Zuccon
We propose the new problem of choosing which dense retrieval model to use when searching on a new collection for which no labels are available, i. e. in a zero-shot setting.
no code implementations • 7 Sep 2023 • Brendan Walsh, Mark Hamilton, Greg Newby, Xi Wang, Serena Ruan, Sheng Zhao, Lei He, Shaofei Zhang, Eric Dettinger, William T. Freeman, Markus Weimer
In this work, we present a system that can automatically generate high-quality audiobooks from online e-books.
no code implementations • 7 Sep 2023 • Robin Courant, Xi Wang, Marc Christie, Vicky Kalogeiton
BluNF provides a robust and user-friendly 2D blueprint, enabling intuitive scene editing.
no code implementations • 6 Sep 2023 • Zhihang Xu, Shaofei Zhang, Xi Wang, Jiajun Zhang, Wenning Wei, Lei He, Sheng Zhao
In this paper, we present MuLanTTS, the Microsoft end-to-end neural text-to-speech (TTS) system designed for the Blizzard Challenge 2023.
no code implementations • 31 Aug 2023 • Jin Liu, Xi Wang, Xiaomeng Fu, Yesheng Chai, Cai Yu, Jiao Dai, Jizhong Han
Responsive listening head generation is an important task that aims to model face-to-face communication scenarios by generating a listener head video given a speaker video and a listener head image.
2 code implementations • 24 Aug 2023 • Adam X. Yang, Maxime Robeyns, Xi Wang, Laurence Aitchison
Low-rank adaptation (LoRA) has emerged as a new paradigm for cost-efficient fine-tuning of large language models (LLMs).
no code implementations • 9 Jul 2023 • Somin Park, Xi Wang, Carol C. Menassa, Vineet R. Kamat, Joyce Y. Chai
When introducing HRC in construction, it is critical to recognize the importance of teamwork and supervision in field construction and establish a natural and intuitive communication system for the human workers and robotic assistants.
no code implementations • 3 Jul 2023 • Yujia Xiao, Shaofei Zhang, Xi Wang, Xu Tan, Lei He, Sheng Zhao, Frank K. Soong, Tan Lee
Experiments show that ContextSpeech significantly improves the voice quality and prosody expressiveness in paragraph reading with competitive model efficiency.
1 code implementation • 28 Jun 2023 • Daoji Huang, Otmar Hilliges, Luc van Gool, Xi Wang
We present Palm, a solution to the Long-Term Action Anticipation (LTA) task utilizing vision-language and large language models.
no code implementations • 16 Jun 2023 • Sean MacAvaney, Xi Wang
Model distillation has emerged as a prominent technique to improve neural search models.
1 code implementation • 2 Jun 2023 • Zhengxiang Shi, Xi Wang, Aldo Lipani
Session-based recommendation, which aims to predict the next item of users' interest as per an existing sequence interaction of items, has attracted growing applications of Contrastive Learning (CL) with improved user and item representations.
1 code implementation • 25 May 2023 • Hossein A. Rahmani, Xi Wang, Yue Feng, Qiang Zhang, Emine Yilmaz, Aldo Lipani
The ability to understand a user's underlying needs is critical for conversational systems, especially with limited input from users in a conversation.
1 code implementation • 9 May 2023 • Zhengxiang Shi, Jerome Ramos, To Eun Kim, Xi Wang, Hossein A. Rahmani, Aldo Lipani
We move towards this target with two sub-tasks, a classification task and a ranking task.
no code implementations • 9 May 2023 • Haldun Balim, Seonwook Park, Xi Wang, Xucong Zhang, Otmar Hilliges
In this paper, we propose a frame-to-gaze network that directly predicts both 3D gaze origin and 3D gaze direction from the raw frame out of the camera without any face or eye cropping.
no code implementations • 13 Apr 2023 • Luyang Luo, Xi Wang, Yi Lin, Xiaoqi Ma, Andong Tan, Ronald Chan, Varut Vardhanabhuti, Winnie CW Chu, Kwang-Ting Cheng, Hao Chen
Breast cancer has reached the highest incidence rate worldwide among all malignancies since 2020.
no code implementations • 31 Mar 2023 • Jin Liu, Xi Wang, Xiaomeng Fu, Yesheng Chai, Cai Yu, Jiao Dai, Jizhong Han
Specifically, the head pose prediction module is designed to generate head pose sequences from the source face and driving audio.
1 code implementation • CVPR 2023 • Xi Wang, Robin Courant, Jinglei Shi, Eric Marchand, Marc Christie
This paper presents JAWS, an optimization-driven approach that achieves the robust transfer of visual cinematic features from a reference in-the-wild video clip to a newly generated clip.
1 code implementation • 15 Mar 2023 • Jinxiang Lai, Siqian Yang, Wenlong Wu, Tao Wu, Guannan Jiang, Xi Wang, Jun Liu, Bin-Bin Gao, Wei zhang, Yuan Xie, Chengjie Wang
Then we derive two specific attention modules, named SpatialFormer Semantic Attention (SFSA) and SpatialFormer Target Attention (SFTA), to enhance the target object regions while reduce the background distraction.
1 code implementation • 2 Mar 2023 • Yinuo Zhang, Zhulin Tao, Xi Wang, Tongyue Wang
Therefore, we proposed a structure coherence-based multi-modal fact verification scheme to classify fake news.
no code implementations • 16 Feb 2023 • Jin Liu, Xi Wang, Xiaomeng Fu, Yesheng Chai, Cai Yu, Jiao Dai, Jizhong Han
To solve the identity mismatch problem and achieve high-quality free pose control, we present One-shot Pose-controllable Talking head generation network (OPT).
no code implementations • 1 Feb 2023 • Xi Wang, Ramya Penta, Bhavya Sehgal, Dale Chen-Song
Falls have become more frequent in recent years, which has been harmful for senior citizens. Therefore detecting falls have become important and several data sets and machine learning model have been introduced related to fall detection.
no code implementations • 24 Jan 2023 • Procheta Sen, Xi Wang, Ruiqing Xu, Emine Yilmaz
Search engines and conversational assistants are commonly used to help users complete their every day tasks such as booking travel, cooking, etc.
no code implementations • CVPR 2024 • Razvan-George Pasca, Alexey Gavryushin, Muhammad Hamza, Yen-Ling Kuo, Kaichun Mo, Luc van Gool, Otmar Hilliges, Xi Wang
This task requires an understanding of the spatio-temporal context formed by past actions on objects, coined action context.
no code implementations • 21 Dec 2022 • Xi Wang, Weixi Cheng, Wenliang Jia
we propose a deep learning method based on Generative Adversarial Network (GAN) and condition edges as structural prior in order to assist the generation.
1 code implementation • CVPR 2023 • Alessandro Ruzzi, Xiangwei Shi, Xi Wang, Gengyan Li, Shalini De Mello, Hyung Jin Chang, Xucong Zhang, Otmar Hilliges
We propose GazeNeRF, a 3D-aware method for the task of gaze redirection.
no code implementations • 25 Nov 2022 • Hanze Dong, Xi Wang, Yong Lin, Tong Zhang
With the popularity of Stein variational gradient descent (SVGD), the focus of particle-based VI algorithms has been on the properties of functions in Reproducing Kernel Hilbert Space (RKHS) to approximate the gradient flow.
no code implementations • 23 Nov 2022 • Jiawei Zhan, Jun Liu, Wei Tang, Guannan Jiang, Xi Wang, Bin-Bin Gao, Tianliang Zhang, Wenlong Wu, Wei zhang, Chengjie Wang, Yuan Xie
This paper builds a unified framework to perform effective noisy-proposal suppression and to interact between global and local features for robust feature learning.
no code implementations • 2 Nov 2022 • Jinxiang Lai, Siqian Yang, Guannan Jiang, Xi Wang, Yuxi Li, Zihui Jia, Xiaochen Chen, Jun Liu, Bin-Bin Gao, Wei zhang, Yuan Xie, Chengjie Wang
In this paper, for the first time, we investigate the contributions of different distance metrics, and propose an adaptive fusion scheme, bringing significant improvements in few-shot classification.
no code implementations • 2 Nov 2022 • Jian Wang, Xi Wang, Chaoqun Ma, Lei Kou
With the advent of the electric power big data era, semantic interoperability and interconnection of power data have received extensive attention.
1 code implementation • 22 Oct 2022 • Fanghua Ye, Xi Wang, Jie Huang, Shenghui Li, Samuel Stern, Emine Yilmaz
Experimental results demonstrate that all three schemes can achieve competitive performance.
1 code implementation • 13 Oct 2022 • Xi Wang, Tomas Geffner, Justin Domke
Black-box variational inference performance is sometimes hindered by the use of gradient estimators with high variance.
no code implementations • 10 Sep 2022 • Xi Wang, Wenjie Wang, Fuli Feng, Wenge Rong, Chuantao Yin, Zhang Xiong
Specifically, we find that: 1) item popularity is a confounder between the exposed items and users' post-click interactions, leading to the first unfairness; and 2) some hidden confounders (e. g., the reputation of item producers) affect both item popularity and quality, resulting in the second unfairness.
no code implementations • 6 Sep 2022 • Xi Wang, Gen Li, Yen-Ling Kuo, Muhammed Kocabas, Emre Aksan, Otmar Hilliges
We further qualitatively evaluate the effectiveness of our method on real images and demonstrate its generalizability towards interaction types and object categories.
1 code implementation • Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022) 2022 • Bin-Bin Gao, Xiaochen Chen, Zhongyi Huang, Congchong Nie, Jun Liu, Jinxiang Lai, Guannan Jiang, Xi Wang, Chengjie Wang
This paper focus on few-shot object detection~(FSOD) and instance segmentation~(FSIS), which requires a model to quickly adapt to novel classes with a few labeled instances.
Ranked #4 on Few-Shot Object Detection on MS-COCO (1-shot)
1 code implementation • 14 Jul 2022 • Ji Liu, daxiang dong, Xi Wang, An Qin, Xingjian Li, Patrick Valduriez, Dejing Dou, dianhai yu
Although more layers and more parameters generally improve the accuracy of the models, such big models generally have high computational complexity and require big memory, which exceed the capacity of small devices for inference and incurs long training time.
no code implementations • 9 Jul 2022 • Ekaterina Khramtsova, Guido Zuccon, Xi Wang, Mahsa Baktashmotlagh
This paper performs a detailed analysis of the effectiveness of topological properties for image classification in various training scenarios, defined by: the number of training samples, the complexity of the training data and the complexity of the backbone network.
no code implementations • 25 Jun 2022 • Yihan Wu, Xi Wang, Shaofei Zhang, Lei He, Ruihua Song, Jian-Yun Nie
In this paper, we propose a novel framework for learning style representation from abundant plain text in a self-supervised manner.
1 code implementation • 24 Jun 2022 • Xi Wang, Laurence Aitchison
We develop ShiftMatch, a new training-data-dependent likelihood for robustness to corruption in Bayesian neural networks (BNNs).
no code implementations • 1 Jun 2022 • Camilo Fosco, Emilie Josephs, Alex Andonian, Allen Lee, Xi Wang, Aude Oliva
Second, they allow us to generate novel "Deepfake Caricatures": transformations of the deepfake that exacerbate artifacts to improve human detection.
no code implementations • 16 May 2022 • Yang Liu, Xiaoqi Wang, Xi Wang, Zhen Wang, Jürgen Kurths
We assume that the state of a number of nodes in a network could be investigated if necessary, and study what configuration of those nodes could facilitate a better solution for the diffusion-source-localization (DSL) problem.
3 code implementations • 9 May 2022 • Xu Tan, Jiawei Chen, Haohe Liu, Jian Cong, Chen Zhang, Yanqing Liu, Xi Wang, Yichong Leng, YuanHao Yi, Lei He, Frank Soong, Tao Qin, Sheng Zhao, Tie-Yan Liu
In this paper, we answer these questions by first defining the human-level quality based on the statistical significance of subjective measure and introducing appropriate guidelines to judge it, and then developing a TTS system called NaturalSpeech that achieves human-level quality on a benchmark dataset.
Ranked #1 on Text-To-Speech Synthesis on LJSpeech (using extra training data)
1 code implementation • CVPR 2022 • Jiahao Xia, Weiwei qu, Wenjian Huang, JianGuo Zhang, Xi Wang, Min Xu
The SLPT generates the representation of each single landmark from a local patch and aggregates them by an adaptive inherent relation based on the attention mechanism.
Ranked #2 on Face Alignment on COFW-68
3 code implementations • 23 Dec 2021 • Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, dianhai yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang
A unified framework named ERNIE 3. 0 was recently proposed for pre-training large-scale knowledge enhanced models and trained a model with 10 billion parameters.
1 code implementation • 21 Dec 2021 • Zhe Liu, Tengteng Huang, Bingling Li, Xiwu Chen, Xi Wang, Xiang Bai
Recently, fusing the LiDAR point cloud and camera image to improve the performance and robustness of 3D object detection has received more and more attention, as these two modalities naturally possess strong complementarity.
no code implementations • NeurIPS 2021 • Xi Wang, Zhipeng Tu, Yiguang Hong, Yingyi Wu, Guodong Shi
We consider online optimization over Riemannian manifolds, where a learner attempts to minimize a sequence of time-varying loss functions defined on Riemannian manifolds.
1 code implementation • 18 Nov 2021 • Xiang Bai, Hanchen Wang, Liya Ma, Yongchao Xu, Jiefeng Gan, Ziwei Fan, Fan Yang, Ke Ma, Jiehua Yang, Song Bai, Chang Shu, Xinyu Zou, Renhao Huang, Changzheng Zhang, Xiaowu Liu, Dandan Tu, Chuou Xu, Wenqing Zhang, Xi Wang, Anguo Chen, Yu Zeng, Dehua Yang, Ming-Wei Wang, Nagaraj Holalkere, Neil J. Halin, Ihab R. Kamel, Jia Wu, Xuehua Peng, Xiang Wang, Jianbo Shao, Pattanasak Mongkolwat, Jianjun Zhang, Weiyang Liu, Michael Roberts, Zhongzhao Teng, Lucian Beer, Lorena Escudero Sanchez, Evis Sala, Daniel Rubin, Adrian Weller, Joan Lasenby, Chuangsheng Zheng, Jianming Wang, Zhen Li, Carola-Bibiane Schönlieb, Tian Xia
Artificial intelligence (AI) provides a promising substitution for streamlining COVID-19 diagnoses.
no code implementations • 20 Oct 2021 • Shibin Zhang, Hongyan Zhou, Pengcheng Zheng, Jinbo Wu, Liping Zhang, Zhongxu Li, Kai Huang, Xin Ou, Xi Wang
Monolithic integration of multiband (1. 4~ 6. 0 GHz) RF acoustic devices were successfully demonstrated within the same process flow by using the lithium niobate (LN) thin film on silicon carbide (LNOSiC) substrate.
no code implementations • 3 Sep 2021 • Xinwei He, Silin Cheng, Dingkang Liang, Song Bai, Xi Wang, Yingying Zhu
To investigate this, we propose a novel Locality-Aware Point-View Fusion Transformer (LATFormer) for 3D shape retrieval and classification.
1 code implementation • ICCV 2021 • Adrian Spurr, Aneesh Dahiya, Xi Wang, Xucong Zhang, Otmar Hilliges
Encouraged by the success of contrastive learning on image classification tasks, we propose a new self-supervised method for the structured regression task of 3D hand pose estimation.
Ranked #14 on 3D Hand Pose Estimation on FreiHAND
no code implementations • 8 Jun 2021 • Liping Chen, Yan Deng, Xi Wang, Frank K. Soong, Lei He
Experimental results obtained by the Transformer TTS show that the proposed BERT can extract fine-grained, segment-level prosody, which is complementary to utterance-level prosody to improve the final prosody of the TTS speech.
no code implementations • NAACL 2021 • Peiyang Liu, Sen Wang, Xi Wang, Wei Ye, Shikun Zhang
The embedding-based large-scale query-document retrieval problem is a hot topic in the information retrieval (IR) field.
no code implementations • 19 May 2021 • Feng Zhang, Xi Wang, Honggao Cao
In this paper, we study the behavior of information ratio (IR) as determined by the fundamental law of active investment management.
no code implementations • 21 Apr 2021 • Luyang Luo, Hao Chen, Yongjie Xiao, Yanning Zhou, Xi Wang, Varut Vardhanabhuti, Mingxiang Wu, Chu Han, Zaiyi Liu, Xin Hao Benjamin Fang, Efstratios Tsougenis, Huangjing Lin, Pheng-Ann Heng
The models were also compared to radiologists on a subset of the internal testing set (n=496).
no code implementations • pproximateinference AABI Symposium 2022 • Xi Wang, Laurence Aitchison
In particular, aleatoric uncertainty signals a specific type of OOD point: one without a well-defined class-label, and our model of data curation gives a likelihood for these points, giving us a mechanism for conditioning on outlier points and thus performing principled Bayesian outlier exposure.
Out-of-Distribution Detection Out of Distribution (OOD) Detection
no code implementations • 5 Feb 2021 • Xi Wang, Iadh Ounis, Craig Macdonald
Furthermore, inspired by the users' information adoption framework, we integrate two loss functions and a negative sampling strategy into our proposed RPRM model, to ensure that the properties of reviews are correlated with the users' preferences.
no code implementations • ICCV 2021 • Xueyang Fu, Xi Wang, Aiping Liu, Junwei Han, Zheng-Jun Zha
Specifically, we design a variational model to formulate the image de-blocking problem and propose two prior terms for the image content and gradient, respectively.
no code implementations • 24 Sep 2020 • Jingda Guo, Dominic Carrillo, Sihai Tang, Qi Chen, Qing Yang, Song Fu, Xi Wang, Nannan Wang, Paparao Palacharla
To reduce the amount of transmitted data, feature map based fusion is recently proposed as a practical solution to cooperative 3D object detection by autonomous vehicles.
no code implementations • 21 Aug 2020 • Xi Wang, Zoya Bylinskii, Aaron Hertzmann, Robert Pepperell
It has long been hypothesized that perceptual ambiguities play an important role in aesthetic experience: a work with some ambiguity engages a viewer more than one that does not.
no code implementations • 14 Aug 2020 • Xi Wang, Iadh Ounis, Craig Macdonald
However, a classification model that learns to classify binary instances with incomplete positive labels while assuming all unlabelled data to be negative examples will often generate a biased classifier.
no code implementations • 6 Jun 2020 • Luyang Luo, Lequan Yu, Hao Chen, Quande Liu, Xi Wang, Jiaqi Xu, Pheng-Ann Heng
Recent researches have demonstrated that performance bottleneck exists in joint training on different CXR datasets, and few made efforts to address the obstacle.
no code implementations • 26 Jul 2019 • Xi Wang, Hao Chen, Luyang Luo, An-ran Ran, Poemen P. Chan, Clement C. Tham, Carol Y. Cheung, Pheng-Ann Heng
Besides, the proposed multi-task learning network is capable of exploring the structure and function relationship from the OCT image and visual field measurement simultaneously, which contributes to classification performance boosting.
1 code implementation • 18 Jul 2019 • Yibin Zheng, Xi Wang, Lei He, Shifeng Pan, Frank K. Soong, Zhengqi Wen, Jian-Hua Tao
Experimental results show our proposed methods especially the second one (bidirectional decoder regularization), leads a significantly improvement on both robustness and overall naturalness, as outperforming baseline (the revised version of Tacotron2) with a MOS gap of 0. 14 in a challenging test, and achieving close to human quality (4. 42 vs. 4. 49 in MOS) on general test.
no code implementations • 7 Jun 2019 • Luyang Luo, Hao Chen, Xi Wang, Qi Dou, Huangjin Lin, Juan Zhou, Gongjie Li, Pheng-Ann Heng
In this paper, we propose to identify breast tumor in MRI by Cosine Margin Sigmoid Loss (CMSL) with deep learning (DL) and localize possible cancer lesion by COrrelation Attention Map (COAM) based on the learned features.
no code implementations • 12 Feb 2019 • Xi Wang, Albert Chern, Marc Alexa
The boundary of the pupil is fitted with an ellipse and the euclidean center of the ellipse in the image is taken as the center of the pupil.
no code implementations • 17 Aug 2017 • Jinyu Li, Michael L. Seltzer, Xi Wang, Rui Zhao, Yifan Gong
High accuracy speech recognition requires a large amount of transcribed data for supervised training.
no code implementations • 4 Jul 2017 • Shaoxiang Chen, Xi Wang, Yongyi Tang, Xinpeng Chen, Zuxuan Wu, Yu-Gang Jiang
This paper introduces the system we developed for the Google Cloud & YouTube-8M Video Understanding Challenge, which can be considered as a multi-label classification problem defined on top of the large scale YouTube-8M Dataset.
no code implementations • 21 Sep 2015 • Zuxuan Wu, Yu-Gang Jiang, Xi Wang, Hao Ye, xiangyang xue, Jun Wang
A multi-stream framework is proposed to fully utilize the rich multimodal information in videos.
no code implementations • 8 Apr 2015 • Hao Ye, Zuxuan Wu, Rui-Wei Zhao, Xi Wang, Yu-Gang Jiang, xiangyang xue
In this paper, we conduct an in-depth study to investigate important implementation options that may affect the performance of deep nets on video classification.
1 code implementation • 7 Apr 2015 • Zuxuan Wu, Xi Wang, Yu-Gang Jiang, Hao Ye, xiangyang xue
In this paper, we propose a hybrid deep learning framework for video classification, which is able to model static spatial information, short-term motion, as well as long-term temporal clues in the videos.