no code implementations • COLING 2022 • Haoran Sun, Deyi Xiong
Knowledge transfer across languages is crucial for multilingual neural machine translation.
no code implementations • ICML 2020 • Haoran Sun, Songtao Lu, Mingyi Hong
Similarly, for online problems, the proposed method achieves an $\mathcal{O}(m \epsilon^{-3/2})$ sample complexity and an $\mathcal{O}(\epsilon^{-1})$ communication complexity, while the best existing bounds are $\mathcal{O}(m\epsilon^{-2})$ and $\mathcal{O}(\epsilon^{-2})$.
no code implementations • 3 Dec 2024 • Kai Sun, Siyan Xue, Fuchun Sun, Haoran Sun, Yu Luo, Ling Wang, Siyuan Wang, Na Guo, Lei Liu, Tian Zhao, Xinzhou Wang, Lei Yang, Shuo Jin, Jun Yan, Jiahong Dong
Recent advancements in deep learning have significantly revolutionized the field of clinical diagnosis and treatment, offering novel approaches to improve diagnostic precision and treatment efficacy across diverse clinical domains, thus driving the pursuit of precision medicine.
1 code implementation • 17 Nov 2024 • Shaolin Zhu, Supryadi, Shaoyang Xu, Haoran Sun, Leiyu Pan, Menglong Cui, Jiangcun Du, Renren Jin, António Branco, Deyi Xiong
An important focus of this survey is on the evaluation of MLLMs.
no code implementations • 7 Nov 2024 • Haoran Sun, Dominique Fourer, Hichem Maaref
Then, a model-based inversion is completed to restore the original audio signal.
no code implementations • 28 Oct 2024 • Shuchang Yan, Haoran Sun
We assume accurate observation of battery state of charge (SOC) and precise speed curves when addressing the constrained optimal fuel consumption (COFC) problem via constrained reinforcement learning (CRL).
no code implementations • 6 Oct 2024 • Yijiang Li, Qingying Gao, Haoran Sun, Haiyun Lyu, Dezhi Luo, Hokin Deng
To this end, we propose CogDevelop2K, a comprehensive benchmark that spans 12 sub-concepts from primitive knowledge like object permanence and boundary to more complex abilities like intentionality understanding, structured via the developmental trajectory of a human mind.
no code implementations • 3 Oct 2024 • Yekun Chai, Haoran Sun, Huang Fang, Shuohuan Wang, Yu Sun, Hua Wu
However, token-level RLHF suffers from the credit assignment problem over long sequences, where delayed rewards make it challenging for the model to discern which actions contributed to successful outcomes.
no code implementations • 1 Oct 2024 • Haoran Sun, Qingying Gao, Haiyun Lyu, Dezhi Luo, Hokin Deng, Yijiang Li
Mechanical reasoning is a fundamental ability that sets human intelligence apart from other animal intelligence.
no code implementations • 1 Oct 2024 • Qingying Gao, Yijiang Li, Haiyun Lyu, Haoran Sun, Dezhi Luo, Hokin Deng
Knowing others' intentions and taking others' perspectives are two core components of human intelligence that are typically considered to be instantiations of theory-of-mind.
no code implementations • 1 Oct 2024 • Dezhi Luo, Haiyun Lyu, Qingying Gao, Haoran Sun, Yijiang Li, Hokin Deng
Conservation is a critical milestone of cognitive development considered to be supported by both the understanding of quantitative concepts and the reversibility of mental operations.
1 code implementation • 12 Aug 2024 • Haoran Sun, Renren Jin, Shaoyang Xu, Leiyu Pan, Supryadi, Menglong Cui, Jiangcun Du, Yikun Lei, Lei Yang, Ling Shi, Juesi Xiao, Shaolin Zhu, Deyi Xiong
To mitigate this challenge, we present FuxiTranyu, an open-source multilingual LLM, which is designed to satisfy the need of the research community for balanced and high-performing multilingual capabilities.
no code implementations • 22 Jul 2024 • Jixiang Chen, Yiqun Lin, Haoran Sun, Xiaomeng Li
Although various deep learning models have been proposed to address this complex task, they suffer from two limitations: 1) They employ voxel representation for bone shape and exploit 3D convolutional layers to capture anatomy prior, which are memory-intensive and limit the reconstruction resolution.
no code implementations • 15 May 2024 • Siwei Wang, Yifei Shen, Shi Feng, Haoran Sun, Shang-Hua Teng, Wei Chen
Furthermore, our theoretical analysis of gradient-based learning dynamics reveals that LLMs can learn both the adjacency and a limited form of the reachability matrices.
1 code implementation • 3 Apr 2024 • Haoran Sun, Lixin Liu, Junjie Li, Fengyu Wang, Baohua Dong, Ran Lin, Ruohui Huang
To address this challenge, we introduce Conifer, a novel instruction tuning dataset, designed to enhance LLMs to follow multi-level instructions with complex constraints.
no code implementations • 19 Feb 2024 • Zhijian Duan, Haoran Sun, Yichong Xia, Siqiang Wang, Zhilin Zhang, Chuan Yu, Jian Xu, Bo Zheng, Xiaotie Deng
Identifying high-revenue mechanisms that are both dominant strategy incentive compatible (DSIC) and individually rational (IR) is a fundamental challenge in auction design.
1 code implementation • 31 Oct 2023 • Haoran Sun, Xiaohu Zhao, Yikun Lei, Shaolin Zhu, Deyi Xiong
In this paper, we employ Singular Value Canonical Correlation Analysis (SVCCA) to analyze representations learnt in a multilingual end-to-end speech translation model trained over 22 languages.
no code implementations • 13 Jun 2023 • Yurong Chen, Qian Wang, Zhijian Duan, Haoran Sun, Zhaohua Chen, Xiang Yan, Xiaotie Deng
To the best of our knowledge, we are the first to consider bidder coordination in online repeated auctions with constraints.
2 code implementations • NeurIPS 2023 • Zhijian Duan, Haoran Sun, Yurong Chen, Xiaotie Deng
AMenuNet is always DSIC and individually rational (IR) due to the properties of AMAs, and it enhances scalability by generating candidate allocations through a neural network.
no code implementations • 19 May 2023 • Shun Zhang, Haoran Sun, Runze Yu, Hongshenyuan Cui, Jian Ren, Feifei Gao, Shi Jin, Hongxiang Xie, Hao Wang
In particular, we adopt a self-developed broadband intelligent communication system 40MHz-Net (BICT-40N) terminal in order to fully acquire the channel information.
1 code implementation • 17 Feb 2023 • Haoran Sun, Yang Wang, Haipeng Liu, Biao Qian
The proposed FF-Block integrates an attention block and several convolution layers to effectively fuse the fine-grained word-context features into the corresponding visual features, in which the text information is fully used to refine the initial image with more details.
no code implementations • 30 Nov 2022 • Haoran Sun, Lijun Yu, Bo Dai, Dale Schuurmans, Hanjun Dai
Score-based modeling through stochastic differential equations (SDEs) has provided a new perspective on diffusion models, and demonstrated superior performance on continuous data.
1 code implementation • 16 Sep 2022 • Haoran Sun, Hanjun Dai, Dale Schuurmans
Optimal scaling has been well studied for Metropolis-Hastings (M-H) algorithms in continuous spaces, but a similar understanding has been lacking in discrete spaces.
no code implementations • 23 Jul 2022 • Haoran Sun, Etash K. Guha, Hanjun Dai
However, learning neural networks for CO problems is notoriously difficult in lack of the labeled data as the training is easily trapped at local optima.
no code implementations • 29 Jun 2022 • Haoran Sun, Hanjun Dai, Bo Dai, Haomin Zhou, Dale Schuurmans
It is known that gradient-based MCMC samplers for continuous spaces, such as Langevin Monte Carlo (LMC), can be derived as particle versions of a gradient flow that minimizes KL divergence on a Wasserstein manifold.
no code implementations • 28 Dec 2021 • Bingqing Song, Haoran Sun, Wenqiang Pu, Sijia Liu, Mingyi Hong
We then provide a series of theoretical results to further understand the properties of the two approaches.
no code implementations • NeurIPS 2021 • Xinshi Chen, Haoran Sun, Caleb Ellington, Eric Xing, Le Song
We consider the problem of discovering $K$ related Gaussian directed acyclic graphs (DAGs), where the involved graph structures share a consistent causal order and sparse unions of supports.
no code implementations • 18 Oct 2021 • Haoran Sun, Chen Chen, Lantian Li, Dong Wang
SpeechFlow is a powerful factorization model based on information bottleneck (IB), and its effectiveness has been reported by several studies.
no code implementations • ICLR 2022 • Xinshi Chen, Haoran Sun, Le Song
In this work, we propose PLISA (Provable Learning-based Iterative Sparse recovery Algorithm) to learn algorithms automatically from data.
no code implementations • ICLR 2022 • Haoran Sun, Hanjun Dai, Wei Xia, Arun Ramamurthy
Energy-based Model (EBM) offers a powerful approach for modeling discrete structure, but both inference and learning of EBM are hard as it involves sampling from discrete distributions.
1 code implementation • 3 May 2021 • Haoran Sun, Wenqiang Pu, Xiao Fu, Tsung-Hui Chang, Mingyi Hong
However, it is often challenging for these approaches to learn in a dynamic environment.
no code implementations • NeurIPS Workshop LMCA 2020 • Haoran Sun, Wenbo Chen, Hui Li, Le Song
Branch-and-Bound~(B\&B) is a general and widely used algorithm paradigm for solving Mixed Integer Programming~(MIP).
4 code implementations • 16 Nov 2020 • Haoran Sun, Wenqiang Pu, Minghe Zhu, Xiao Fu, Tsung-Hui Chang, Mingyi Hong
We propose to build the notion of continual learning (CL) into the modeling process of learning wireless systems, so that the learning model can incrementally adapt to the new episodes, {\it without forgetting} knowledge learned from the previous episodes.
no code implementations • 27 Oct 2020 • Haoran Sun, Lantian Li, Yunqi Cai, Yang Zhang, Thomas Fang Zheng, Dong Wang
Various information factors are blended in speech signals, which forms the primary difficulty for most speech information processing tasks.
no code implementations • 20 Jun 2020 • Mingyi Hong, Siliang Zeng, Junyu Zhang, Haoran Sun
However, by constructing some counter-examples, we show that when certain local Lipschitz conditions (LLC) on the local function gradient $\nabla f_i$'s are not satisfied, most of the existing decentralized algorithms diverge, even if the global Lipschitz condition (GLC) is satisfied, where the sum function $f$ has Lipschitz gradient.
no code implementations • 15 Jan 2020 • Haoran Sun, Xueqing Liu, Xinyang Feng, Chen Liu, Nanyan Zhu, Sabrina J. Gjerswold-Selleck, Hong-Jian Wei, Pavan S. Upadhyayula, Angeliki Mela, Cheng-Chia Wu, Peter D. Canoll, Andrew F. Laine, J. Thomas Vaughan, Scott A. Small, Jia Guo
Together, these studies validate our hypothesis that a deep learning approach can potentially replace the need for GBCAs in brain MRI.
no code implementations • 29 Oct 2019 • Haoran Sun, Yunqi Cai, Lantian Li, Dong Wang
Speech signals are complex composites of various information, including phonetic content, speaker traits, channel effect, etc.
no code implementations • 13 Oct 2019 • Haoran Sun, Songtao Lu, Mingyi Hong
Similarly, for online problems, the proposed method achieves an $\mathcal{O}(m \epsilon^{-3/2})$ sample complexity and an $\mathcal{O}(\epsilon^{-1})$ communication complexity, while the best existing bounds are $\mathcal{O}(m\epsilon^{-2})$ and $\mathcal{O}(\epsilon^{-2})$, respectively.
no code implementations • 21 Jul 2019 • Zhanzhan Zhao, Haoran Sun
This paper proposes an algorithm Alice having no access to the physics law of the environment, which is actually linear with stochastic noise, and learns to make decisions directly online without a training phase or a stable policy as initial input.
no code implementations • NeurIPS 2020 • Xiangyi Chen, Tiancong Chen, Haoran Sun, Zhiwei Steven Wu, Mingyi Hong
We show that these algorithms are non-convergent whenever there is some disparity between the expected median and mean over the local gradients.
no code implementations • 8 Jun 2018 • Xinye Cai, Haoran Sun, Chunyang Zhu, Zhenyu Li, Qingfu Zhang
In this paper, an evolutionary many-objective optimization algorithm based on corner solution search (MaOEA-CS) was proposed.