no code implementations • 18 Apr 2025 • Pin-Er Chen, Da-Chen Lian, Shu-Kai Hsieh, Sieh-Chuen Huang, Hsuan-Lei Shao, Jun-Wei Chiu, Yang-Hsien Lin, Zih-Ching Chen, Cheng-Kuang, Eddie TC Huang, Simon See
The recent advances in Legal Large Language Models (LLMs) have transformed the landscape of legal research and practice by automating tasks, enhancing research precision, and supporting complex decision-making processes.
no code implementations • 7 Apr 2025 • Zhiwei Cao, Minghao Li, Feng Lin, Jimin Jia, Yonggang Wen, Jianxiong Yin, Simon See
Our results demonstrate its superior performance over traditional time-consuming Computational Fluid Dynamics/Heat Transfer (CFD/HT) simulation, with a median absolute temperature prediction error of 0. 18 {\deg}C. This emerging approach would open doors to several potential research directions for advancing Physical AI in future DC operations.
no code implementations • 23 Mar 2025 • Xuesong Chen, Shaoshuai Shi, Tao Ma, Jingqiu Zhou, Simon See, Ka Chun Cheung, Hongsheng Li
In this paper, we introduce M3Net, a novel multimodal and multi-task network that simultaneously tackles detection, segmentation, and 3D occupancy prediction for autonomous driving and achieves superior performance than single task model.
no code implementations • 12 Mar 2025 • Dikai Liu, Tianwei Zhang, Jianxiong Yin, Simon See
Quadrupeds have gained rapid advancement in their capability of traversing across complex terrains.
no code implementations • 11 Mar 2025 • Jiawen Wei, Aniruddha Bora, Vivek Oommen, Chenyu Dong, Juntao Yang, Jeff Adie, Chen Chen, Simon See, George Karniadakis, Gianmarco Mengaldo
Extreme weather events are increasing in frequency and intensity due to climate change.
no code implementations • 18 Feb 2025 • Xin Wang, Juntao Yang, Jeff Adie, Simon See, Kalli Furtado, Chen Chen, Troy Arcomano, Romit Maulik, Gianmarco Mengaldo
In this work, we find that water vapor oversaturation during condensation is a key factor compromising the stability of hybrid models.
no code implementations • 16 Feb 2025 • Tianshi Zheng, Jiayang Cheng, Chunyang Li, Haochen Shi, ZiHao Wang, Jiaxin Bai, Yangqiu Song, Ginny Y. Wong, Simon See
Modern large language models (LLMs) employ various forms of logical inference, both implicitly and explicitly, when addressing reasoning tasks.
no code implementations • 13 Feb 2025 • Adjovi Sim, Zhengkui Wang, Aik Beng Ng, Shalini De Mello, Simon See, Wonmin Byeon
Online continual learning for image classification is crucial for models to adapt to new data while retaining knowledge of previously learned tasks.
no code implementations • 12 Dec 2024 • Meng Shen, Yake Wei, Jianxiong Yin, Deepu Rajan, Di Hu, Simon See
Additionally, most AL methods seldom address multimodal data, highlighting a research gap in this field.
no code implementations • 31 Oct 2024 • Xiufeng Huang, RuiQi Li, Yiu-ming Cheung, Ka Chun Cheung, Simon See, Renjie Wan
3D Gaussian Splatting (3DGS) has become a crucial method for acquiring 3D assets.
no code implementations • 30 Oct 2024 • Qi Song, Ziyuan Luo, Ka Chun Cheung, Simon See, Renjie Wan
Single-view 3D reconstruction methods like Triplane Gaussian Splatting (TGS) have enabled high-quality 3D model generation from just a single image input within seconds.
no code implementations • 21 Oct 2024 • Jeremy Stephen Gabriel Yee, Pai Chet Ng, Zhengkui Wang, Ian McLoughlin, Aik Beng Ng, Simon See
This paper presents a systematic review of the infrastructure requirements for deploying Large Language Models (LLMs) on-device within the context of small and medium-sized enterprises (SMEs), focusing on both hardware and software perspectives.
1 code implementation • 19 Oct 2024 • Siyuan Yan, Zhen Yu, Clare Primiero, Cristina Vico-Alonso, Zhonghua Wang, Litao Yang, Philipp Tschandl, Ming Hu, Lie Ju, Gin Tan, Vincent Tang, Aik Beng Ng, David Powell, Paul Bonnington, Simon See, Elisabetta Magnaterra, Peter Ferguson, Jennifer Nguyen, Pascale Guitera, Jose Banuls, Monika Janda, Victoria Mar, Harald Kittler, H. Peter Soyer, ZongYuan Ge
Diagnosing and treating skin diseases require advanced visual skills across domains and the ability to synthesize information from multiple imaging modalities.
1 code implementation • 5 Oct 2024 • Chunkit Chan, Cheng Jiayang, Xin Liu, Yauwai Yim, Yuxin Jiang, Zheye Deng, Haoran Li, Yangqiu Song, Ginny Y. Wong, Simon See
Debate is the process of exchanging viewpoints or convincing others on a particular issue.
no code implementations • 18 Jul 2024 • Xiufeng Huang, Ka Chun Cheung, Simon See, Renjie Wan
While approaches like CopyRNeRF have been introduced to embed binary messages into NeRF models as digital signatures for copyright protection, the process of recolorization can remove these binary messages.
no code implementations • 15 Jul 2024 • Xingzhi Zhou, Xin Dong, Chunhao Li, Yuning Bai, Yulong Xu, Ka Chun Cheung, Simon See, Xinpeng Song, Runshun Zhang, Xuezhong Zhou, Nevin L. Zhang
However, this task faces limitations due to the scarcity of high-quality clinical datasets and the complex relationship between symptoms and herbs.
no code implementations • 10 Jul 2024 • Qi Song, Ziyuan Luo, Ka Chun Cheung, Simon See, Renjie Wan
Neural Radiance Fields (NeRFs) have become a key method for 3D scene representation.
no code implementations • 4 Jun 2024 • Zhengyi Kwan, Wei zhang, Zhengkui Wang, Aik Beng Ng, Simon See
In this paper, we propose NuNet, a transformer-based network designed for nutrition estimation that utilizes both RGB and depth information from food images.
1 code implementation • 22 May 2024 • Chen-Hao Chao, Chien Feng, Wei-Fang Sun, Cheng-Kuang Lee, Simon See, Chun-Yi Lee
This framework integrates the policy evaluation steps and the policy improvement steps, resulting in a single objective training process.
Ranked #1 on
Omniverse Isaac Gym
on FrankaCabinet
1 code implementation • 16 Feb 2024 • Zhaowei Wang, Wei Fan, Qing Zong, Hongming Zhang, Sehyun Choi, Tianqing Fang, Xin Liu, Yangqiu Song, Ginny Y. Wong, Simon See
Abstraction ability is crucial in human intelligence, which can also benefit various tasks in NLP study.
no code implementations • 29 Jan 2024 • Xiaoyu Shi, Zhaoyang Huang, Fu-Yun Wang, Weikang Bian, Dasong Li, Yi Zhang, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li
For the first stage, we propose a diffusion-based motion field predictor, which focuses on deducing the trajectories of the reference image's pixels.
no code implementations • 26 Jan 2024 • Xingzhi Zhou, Zhiliang Tian, Ka Chun Cheung, Simon See, Nevin L. Zhang
Test-time domain adaptation effectively adjusts the source domain model to accommodate unseen domain shifts in a target domain during inference.
no code implementations • CVPR 2024 • Sixing Yan, William K. Cheung, Ivor W. Tsang, Keith Chiu, Terence M. Tong, Ka Chun Cheung, Simon See
Automatic radiology report generation using deep learning models has been recently explored and found promising.
no code implementations • CVPR 2024 • Lanyun Zhu, Tianrun Chen, Jianxiong Yin, Simon See, Jun Liu
Next we extract background context from the query image to modulate the support foreground feature thus eliminating the foreground feature misalignment caused by the different backgrounds.
1 code implementation • 8 Oct 2023 • Qing Zong, Zhaowei Wang, Baixuan Xu, Tianshi Zheng, Haochen Shi, Weiqi Wang, Yangqiu Song, Ginny Y. Wong, Simon See
A main goal of Argument Mining (AM) is to analyze an author's stance.
1 code implementation • 15 Sep 2023 • Chunkit Chan, Xin Liu, Tsz Ho Chan, Jiayang Cheng, Yangqiu Song, Ginny Wong, Simon See
However, the inter-sentential coherence and the model consistency have not been well exploited in the previous works on this task.
no code implementations • ICCV 2023 • Lanyun Zhu, Tianrun Chen, Jianxiong Yin, Simon See, Jun Liu
We innovatively utilize Gabor filters as a powerful extractor to exploit texture features, motivated by the capability of Gabor filters in effectively capturing multi-frequency features and detailed local information.
1 code implementation • ICCV 2023 • Ziyuan Luo, Qing Guo, Ka Chun Cheung, Simon See, Renjie Wan
Neural Radiance Fields (NeRF) have the potential to be a major representation of media.
1 code implementation • 14 Jun 2023 • Meng Shen, Yizheng Huang, Jianxiong Yin, Heqing Zou, Deepu Rajan, Simon See
Our studies demonstrate that the proposed method achieves more balanced multimodal learning by avoiding greedy sample selection from the dominant modality.
1 code implementation • ICCV 2023 • Xuesong Chen, Shaoshuai Shi, Chao Zhang, Benjin Zhu, Qiang Wang, Ka Chun Cheung, Simon See, Hongsheng Li
3D multi-object tracking (MOT) is vital for many applications including autonomous driving vehicles and service robots.
1 code implementation • 4 Jun 2023 • Wei-Fang Sun, Cheng-Kuang Lee, Simon See, Chun-Yi Lee
In fully cooperative multi-agent reinforcement learning (MARL) settings, environments are highly stochastic due to the partial observability of each agent and the continuously changing policies of other agents.
Ranked #1 on
SMAC
on SMAC 26m_vs_30m
1 code implementation • 9 May 2023 • Zhaowei Wang, Quyet V. Do, Hongming Zhang, Jiayao Zhang, Weiqi Wang, Tianqing Fang, Yangqiu Song, Ginny Y. Wong, Simon See
This paper proposes a new task to detect commonsense causation between two events in an event sequence (i. e., context), called contextualized commonsense causal reasoning.
1 code implementation • 6 May 2023 • ZiHao Wang, Weizhi Fei, Hang Yin, Yangqiu Song, Ginny Y. Wong, Simon See
In contrast to existing scoring functions motivated by local comparison or global transport, this work investigates the local and global trade-off with unbalanced optimal transport theory.
1 code implementation • 6 May 2023 • Chunkit Chan, Xin Liu, Jiayang Cheng, Zihan Li, Yangqiu Song, Ginny Y. Wong, Simon See
Implicit Discourse Relation Recognition (IDRR) is a sophisticated and challenging task to recognize the discourse relations between the arguments with the absence of discourse connectives.
no code implementations • CVPR 2023 • Lanyun Zhu, Tianrun Chen, Jianxiong Yin, Simon See, Jun Liu
Continual Semantic Segmentation (CSS) extends static semantic segmentation by incrementally introducing new classes for training.
1 code implementation • ICCV 2023 • Xiaoyu Shi, Zhaoyang Huang, Weikang Bian, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li
We first propose a TRi-frame Optical Flow (TROF) module that estimates bi-directional optical flows for the center frame in a three-frame manner.
1 code implementation • CVPR 2023 • Xiaoyu Shi, Zhaoyang Huang, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li
FlowFormer introduces a transformer architecture into optical flow estimation and achieves state-of-the-art performance.
1 code implementation • 21 Jan 2023 • ZiHao Wang, Yangqiu Song, Ginny Y. Wong, Simon See
On top of the query graph, we propose the Logical Message Passing Neural Network (LMPNN) that connects the local one-hop inferences on atomic formulas to the global logical reasoning for complex query answering.
1 code implementation • 1 Jan 2023 • Huaizheng Zhang, Yuanming Li, Wencong Xiao, Yizheng Huang, Xing Di, Jianxiong Yin, Simon See, Yong Luo, Chiew Tong Lau, Yang You
The vision of this paper is to provide a more comprehensive and practical benchmark study for MIG in order to eliminate the need for tedious manual benchmarking and tuning efforts.
no code implementations • 17 Dec 2022 • Tsung-Ming Tai, Giuseppe Fiameni, Cheng-Kuang Lee, Simon See, Oswald Lanz
Consequently, existing solutions based on the action recognition models are only suboptimal.
1 code implementation • 23 Nov 2022 • Xin He, Jiangchao Yao, Yuxin Wang, Zhenheng Tang, Ka Chu Cheung, Simon See, Bo Han, Xiaowen Chu
One-shot neural architecture search (NAS) substantially improves the search efficiency by training one supernet to estimate the performance of every possible child architecture (i. e., subnet).
Ranked #26 on
Neural Architecture Search
on NAS-Bench-201, CIFAR-10
1 code implementation • 7 Nov 2022 • Huiru Xiao, Xin Liu, Yangqiu Song, Ginny Y. Wong, Simon See
However, the performance of the hyperbolic KG embedding models for non-transitive relations is still unpromising, while the complex hyperbolic embeddings do not deal with multi-relations.
1 code implementation • 2 Nov 2022 • Jun Wang, Abhir Bhalerao, Terry Yin, Simon See, Yulan He
Radiology report generation (RRG) has gained increasing research attention because of its huge potential to mitigate medical resource shortages and aid the process of disease decision making by radiologists.
1 code implementation • 14 Oct 2022 • Tianqing Fang, Quyet V. Do, Hongming Zhang, Yangqiu Song, Ginny Y. Wong, Simon See
We propose PseudoReasoner, a semi-supervised learning framework for CSKB population that uses a teacher model pre-trained on CSKBs to provide pseudo labels on the unlabeled candidate dataset for a student model to learn from.
1 code implementation • 13 Oct 2022 • Zhaowei Wang, Hongming Zhang, Tianqing Fang, Yangqiu Song, Ginny Y. Wong, Simon See
In this paper, we propose a new task of sub-event generation for an unseen process to evaluate the understanding of the coherence of sub-event actions and objects.
no code implementations • 4 Jul 2022 • Sixing Yan, William K. Cheung, Keith Chiu, Terence M. Tong, Charles K. Cheung, Simon See
In this paper, we introduce a novel fined-grained knowledge graph structure called an attributed abnormality graph (ATAG).
1 code implementation • CVPR 2023 • Dasong Li, Xiaoyu Shi, Yi Zhang, Ka Chun Cheung, Simon See, Xiaogang Wang, Hongwei Qin, Hongsheng Li
In this study, we propose a simple yet effective framework for video restoration.
Ranked #2 on
Deblurring
on GoPro
(using extra training data)
no code implementations • 22 Jun 2022 • Tsung-Ming Tai, Oswald Lanz, Giuseppe Fiameni, Yi-Kwan Wong, Sze-Sen Poon, Cheng-Kuang Lee, Ka-Chun Cheung, Simon See
In this report, we describe the technical details of our submission for the EPIC-Kitchen-100 action anticipation challenge.
1 code implementation • 2 Jun 2022 • Tsung-Ming Tai, Giuseppe Fiameni, Cheng-Kuang Lee, Simon See, Oswald Lanz
To this end, we propose a unified recurrence modeling for video action anticipation via message passing framework.
no code implementations • 18 Oct 2021 • Yi-Chen Chen, Shu-wen Yang, Cheng-Kuang Lee, Simon See, Hung-Yi Lee
It has been shown that an SSL pretraining model can achieve excellent performance in various downstream tasks of speech processing.
no code implementations • 26 Sep 2021 • Haozhi Cao, Yuecong Xu, Jianfei Yang, Kezhi Mao, Lihua Xie, Jianxiong Yin, Simon See
This paper introduces a novel self-supervised method that leverages incoherence detection for video representation learning.
no code implementations • 11 Jul 2021 • Yuecong Xu, Jianfei Yang, Haozhi Cao, Kezhi Mao, Jianxiong Yin, Simon See
Yet correlation features of the same action would differ across domains due to domain shift.
no code implementations • 6 Jun 2021 • Hyun Gon Ryu, Jeong-Hoon Kim, Simon See
The proposed method is expected to adapt for researching on neural network models capable of synthesizing speech at the studio recording level.
no code implementations • 26 Aug 2020 • Haozhi Cao, Yuecong Xu, Jianfei Yang, Kezhi Mao, Jianxiong Yin, Simon See
Temporal feature extraction is an essential technique in video-based action recognition.
no code implementations • 9 Jun 2020 • Yuecong Xu, Haozhi Cao, Jianfei Yang, Kezhi Mao, Jianxiong Yin, Simon See
Empirical results prove the effectiveness and efficiency of our PNL module, which achieves state-of-the-art performance of 83. 09% on the Mini-Kinetics dataset, with decreased computation cost compared to the non-local block.
1 code implementation • 6 Jun 2020 • Yuecong Xu, Jianfei Yang, Haozhi Cao, Kezhi Mao, Jianxiong Yin, Simon See
We bridge the gap of the lack of data for this task by collecting a new dataset: the Action Recognition in the Dark (ARID) dataset.
no code implementations • 6 May 2020 • Yuecong Xu, Jianfei Yang, Kezhi Mao, Jianxiong Yin, Simon See
Temporal feature extraction is an important issue in video-based action recognition.
no code implementations • 8 Mar 2020 • Zhangsheng Lai, Aik Beng Ng, Liang Ze Wong, Simon See, Shaowei Lin
Reasoning over knowledge graphs is traditionally built upon a hierarchy of languages in the Semantic Web Stack.
1 code implementation • 20 Nov 2019 • Shaohuai Shi, Xiaowen Chu, Ka Chun Cheung, Simon See
Distributed stochastic gradient descent (SGD) algorithms are widely deployed in training large-scale deep learning models, while the communication overhead among workers becomes the new system bottleneck.
1 code implementation • 9 Jul 2019 • Qingyi Tao, ZongYuan Ge, Jianfei Cai, Jianxiong Yin, Simon See
Secondly, in CT scans, the lesions are often indistinguishable from the background since the lesion and non-lesion areas may have very similar appearances.
no code implementations • 10 Oct 2018 • Lei Ma, Felix Juefei-Xu, Minhui Xue, Qiang Hu, Sen Chen, Bo Li, Yang Liu, Jianjun Zhao, Jianxiong Yin, Simon See
Over the past decades, deep learning (DL) systems have achieved tremendous success and gained great popularity in various applications, such as intelligent machines, image processing, speech processing, and medical diagnostics.
no code implementations • 4 Sep 2018 • Xiaofei Xie, Lei Ma, Felix Juefei-Xu, Hongxu Chen, Minhui Xue, Bo Li, Yang Liu, Jianjun Zhao, Jianxiong Yin, Simon See
In company with the data explosion over the past decade, deep neural network (DNN) based software has experienced unprecedented leap and is becoming the key driving force of many novel industrial applications, including many safety-critical scenarios such as autonomous driving.
1 code implementation • CVPR 2018 • Jason Kuen, Xiangfei Kong, Zhe Lin, Gang Wang, Jianxiong Yin, Simon See, Yap-Peng Tan
We propose a novel approach for cost-adjustable inference in CNNs - Stochastic Downsampling Point (SDPoint).
1 code implementation • 4 Jun 2017 • Xinyu Fu, Eugene Ch'ng, Uwe Aickelin, Simon See
We observe that MGU achieves the optimal runtime and comparable performance against GRU and LSTM.
no code implementations • 3 Aug 2016 • Jinghua Wang, Zhenhua Wang, DaCheng Tao, Simon See, Gang Wang
In this paper, we tackle the problem of RGB-D semantic segmentation of indoor images.